adult

From MaRDI portal
Dataset:6036968



OpenML43898MaRDI QIDQ6036968

OpenML dataset with id 43898

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/22102806/adult.arff

Upload date: 31 May 2022


Dataset Characteristics

Number of classes: 2
Number of features: 15 (numeric: 6, symbolic: 9 and in total binary: 2 )
Number of instances: 48,790
Number of instances with missing values: 3,615
Number of missing values: 6,456

Predict whether income exceeds $50K/yr based on census data. Also known as Census Income dataset. Train and test sets combined. Null values represented with question mark is replaced with na. 52 duplicate values found and dropped