SyskillWebert-Sheep

From MaRDI portal
Dataset:6033116



OpenML376MaRDI QIDQ6033116

OpenML dataset with id 376

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/1663734/SyskillWebert-Sheep.arff

Upload date: 27 September 2014


Dataset Characteristics

Number of classes: 2
Number of features: 3 (numeric: 0, symbolic: 1 and in total binary: 1 )
Number of instances: 65
Number of instances with missing values: 0
Number of missing values: 0

Author: Michael Pazzani (pazzani@ics.uci.edu) Source: UCI- 1999 Please cite:

Syskill and Webert Web Page Ratings This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and BioMedical)

The HTML source of a web page is given. Users looked at each web page and indicated on a 3 point scale (hot medium cold) 50-100 pages per domain. However, this is realistic because we want to learn user profiles from as few examples as possible so that users have an incentive to rate pages.

The problem is to predict user ratings for web pages (within a subject category). The accuracy of predicting ratings is reported in early publications. Later publications used the precision at top N or the F-measure.

Past Usage Pazzani M., Billsus, D. (1997). Learning and Revising User Profiles: The identification of interesting web sites. Machine Learning 27, 313-331

Pazzani, M., Muramatsu J., Billsus, D. (1996). Syskill & Webert: Identifying interesting web sites. Proceedings of the National Conference on Artificial Intelligence, Portland, OR.