20_newsgroups.drift
OpenML274MaRDI QIDQ6033061FDOQ6033061RO-CrateQ6033061
OpenML dataset with id 274
Ken Lang
Full work available at URL: https://api.openml.org/data/v1/download/11347/20_newsgroups.drift.arff
Upload date: 2 May 2014
Dataset Characteristics
Number of classes: 2
Number of features: 1,002 (numeric: 0, symbolic: 1,001 and in total binary: 1,001 )
Number of instances: 399,940
Number of instances with missing values: 0
Number of missing values: 0
ROCrate
What is a RO-Crate?
A RO-Crate is a standardized research object package used to bundle data together with rich machine-readable metadata. Each RO-Crate contains:
- the files belonging to the dataset (e.g. CSVs, images, code, documentation)
- a ro-crate-metadata.json file describing the content, provenance, and context
- persistent identifiers and references to related research objects (e.g. software, publications)
This ensures that the dataset can be easily reused, cited, validated, and interpreted in a reproducible manner. More information can be found here.
Download
You can download a RO-Crate for this dataset here: Download RO-Crate
HINT: The RO-Crate is created dynamically, so it could take up to 30 seconds until the downloads starts.
This page was built for dataset: 20_newsgroups.drift