SyskillWebert-Bands
OpenML380MaRDI QIDQ6033121FDOQ6033121RO-CrateQ6033121
OpenML dataset with id 380
Michael Pazzani
Full work available at URL: https://api.openml.org/data/v1/download/1663741/SyskillWebert-Bands.arff
Upload date: 27 September 2014
Dataset Characteristics
Number of classes: 3
Number of features: 3 (numeric: 0, symbolic: 1 and in total binary: 0 )
Number of instances: 61
Number of instances with missing values: 0
Number of missing values: 0
Author: Michael Pazzani (pazzani@ics.uci.edu) Source: UCI- 1999 Please cite:
Syskill and Webert Web Page Ratings This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and BioMedical)
The HTML source of a web page is given. Users looked at each web page and indicated on a 3 point scale (hot medium cold) 50-100 pages per domain. However, this is realistic because we want to learn user profiles from as few examples as possible so that users have an incentive to rate pages.
The problem is to predict user ratings for web pages (within a subject category). The accuracy of predicting ratings is reported in early publications. Later publications used the precision at top N or the F-measure.
Past Usage Pazzani M., Billsus, D. (1997). Learning and Revising User Profiles: The identification of interesting web sites. Machine Learning 27, 313-331
Pazzani, M., Muramatsu J., Billsus, D. (1996). Syskill & Webert: Identifying interesting web sites. Proceedings of the National Conference on Artificial Intelligence, Portland, OR.
ROCrate
What is a RO-Crate?
A RO-Crate is a standardized research object package used to bundle data together with rich machine-readable metadata. Each RO-Crate contains:
- the files belonging to the dataset (e.g. CSVs, images, code, documentation)
- a ro-crate-metadata.json file describing the content, provenance, and context
- persistent identifiers and references to related research objects (e.g. software, publications)
This ensures that the dataset can be easily reused, cited, validated, and interpreted in a reproducible manner. More information can be found here.
Download
You can download a RO-Crate for this dataset here: Download RO-Crate
HINT: The RO-Crate is created dynamically, so it could take up to 30 seconds until the downloads starts.
This page was built for dataset: SyskillWebert-Bands