SyskillWebert-BioMedical
Dataset:6033113
OpenML dataset with id 374
No author found.
Full work available at URL: https://api.openml.org/data/v1/download/1663733/SyskillWebert-BioMedical.arff
Upload date: 27 September 2014
Dataset Characteristics
Number of classes: 3
Number of features: 3 (numeric: 0, symbolic: 1 and in total binary: 0 )
Number of instances: 131
Number of instances with missing values: 0
Number of missing values: 0
Author: Michael Pazzani (pazzani@ics.uci.edu) Source: UCI- 1999 Please cite:
Syskill and Webert Web Page Ratings This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and BioMedical)
The HTML source of a web page is given. Users looked at each web page and indicated on a 3 point scale (hot medium cold) 50-100 pages per domain. However, this is realistic because we want to learn user profiles from as few examples as possible so that users have an incentive to rate pages.
The problem is to predict user ratings for web pages (within a subject category). The accuracy of predicting ratings is reported in early publications. Later publications used the precision at top N or the F-measure.
Past Usage Pazzani M., Billsus, D. (1997). Learning and Revising User Profiles: The identification of interesting web sites. Machine Learning 27, 313-331
Pazzani, M., Muramatsu J., Billsus, D. (1996). Syskill & Webert: Identifying interesting web sites. Proceedings of the National Conference on Artificial Intelligence, Portland, OR.
This page was built for dataset: SyskillWebert-BioMedical