View book-crossing-ratings-1.0 (public)

2011-09-14 15:17 by mgashler | Version 1 | Rating Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star
Rating
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Overall (based on 0 votes)
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Interesting
Empty StarEmpty StarEmpty StarEmpty StarEmpty StarEmpty Star Documentation
Summary

Ratings of books by readers

License
ODbL
Dependencies
Tags
collaborative-filtering recommendation
Attribute Types
Integer
Download
# Instances: 1149780 / # Attributes: 3
HDF5 (13.2 MB) XML CSV ARFF LibSVM Matlab Octave

Files are converted on demand and the process can take up to a minute. Please wait until download begins.

Completeness of this item currently: 100%.
You can edit this item to add more meta information and make use of the site's premium features.
Original Data Format
arff
Name
book-crossing
Version mldata
0
Comment

This data was obtained from http://www.informatik.uni-freiburg.de/~cziegler/BX/

Collected by Cai-Nicolas Ziegler in a 4-week crawl (August / September 2004) from the Book-Crossing community with kind permission from Ron Hornbaker, CTO of Humankind Systems. Contains 278,858 users (anonymized but with demographic information) providing 1,149,780 ratings (explicit / implicit) about 271,379 books. (Note from the guy that converted it to arff format: I'm not sure where those values came from. The data does not seem to match them.)

[ ! ] Freely available for research use when acknowledged with the following reference (further details on the dataset are given in this publication):

* Improving Recommendation Lists Through Topic Diversification,
  Cai-Nicolas Ziegler, Sean M. McNee, Joseph A. Konstan, Georg Lausen; Proceedings of the 14th International World Wide Web Conference (WWW '05), May 10-14, 2005, Chiba, Japan. To appear.

  Download: [ PDF Pre-Print ]

As a courtesy, if you use the data, I would appreciate knowing your name, what research group you are in, and the publications that may result.

Names
userid,bookid,rating,
Types
  1. numeric
  2. numeric
  3. numeric
Data (first 10 data points)
    userid bookid rating
    0 0 0
    1 1 5
    2 2 0
    3 3 3
    3 4 6
    4 5 0
    5 6 8
    6 7 6
    7 8 7
    8 9 10
    ... ... ...
Description

Collected by Cai-Nicolas Ziegler in a 4-week crawl (August / September 2004) from the Book-Crossing community with kind permission from Ron Hornbaker, CTO of Humankind Systems. (See http://www.informatik.uni-freiburg.de/~cziegler/BX/)

To give attribution, as required by the ODbL license, please cite the following reference: Improving Recommendation Lists Through Topic Diversification, Cai-Nicolas Ziegler, Sean M. McNee, Joseph A. Konstan, Georg Lausen; Proceedings of the 14th International World Wide Web Conference (WWW '05), May 10-14, 2005, Chiba, Japan.

URLs
http://www.informatik.uni-freiburg.de/~cziegler/BX/
Publications
    Data Source
    http://www.informatik.uni-freiburg.de/~cziegler/BX/
    Measurement Details

    To simplify analysis, the original user IDs have been replaced with integer values from 0 to 105282 and the original book ISBN numbers have been replaced with integer values from 0 to 339865. In both cases, all integer values in those ranges occur at least once. The data in its original form is available at http://www.informatik.uni-freiburg.de/~cziegler/BX/, and additional meta-data is available with it.

    Usage Scenario

    Evaluating the effectiveness of a collaborative-filtering recommendation system.

    revision 1
    by mgashler on 2011-09-14 15:17

    No one has posted any comments yet. Perhaps you would like to be the first?

    Leave a comment

    To post a comment, please sign in.

    This item was downloaded 16812 times and viewed 7586 times.

    No Tasks yet on dataset book-crossing-ratings-1.0

    Submit a new Task for this Data item

    Data

    Sort by

    Disclaimer

    We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.

    Data | Task | Method | Challenge

    Acknowledgements

    This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
    PASCAL Logo
    http://www.pascal-network.org/.