November 2018
Intermediate to advanced
300 pages
7h 42m
English
In this chapter, we will work with a book ratings dataset (Ziegler et al., 2005) that was collected in a four-week crawl. It contains data on 278,858 members of the Book-Crossing website and 1,157,112 ratings, both implicit and explicit, referring to 271,379 distinct ISBNs. User data is anonymized, but with demographic information. The dataset is taken from Improving Recommendation Lists Through Topic Diversification, Cai-Nicolas Ziegler, Sean M. McNee, Joseph A. Konstan, Georg Lausen: Proceedings of the 14th International World Wide Web Conference (WWW '05), May 10-14, 2005, Chiba, Japan (http://www2.informatik.uni-freiburg.de/~cziegler/BX/).
The Book-Crossing dataset is comprised of three files, as follows: