tae
OpenML48MaRDI QIDQ6032903FDOQ6032903RO-CrateQ6032903
OpenML dataset with id 48
Wei-Yin Loh
Full work available at URL: https://api.openml.org/data/v1/download/48/tae.arff
Upload date: 6 April 2014
Dataset Characteristics
Number of classes: 3
Number of features: 6 (numeric: 3, symbolic: 3 and in total binary: 2 )
Number of instances: 151
Number of instances with missing values: 0
Number of missing values: 0
Author: Source: Unknown - Please cite:
1. Title: Teaching Assistant Evaluation
2. Sources:
(a) Collector: Wei-Yin Loh (Department of Statistics, UW-Madison)
(b) Donor: Tjen-Sien Lim (limt@stat.wisc.edu)
(b) Date: June 7, 1997
3. Past Usage:
1. Loh, W.-Y. & Shih, Y.-S. (1997). Split Selection Methods for
Classification Trees, Statistica Sinica 7: 815-840.
2. Lim, T.-S., Loh, W.-Y. & Shih, Y.-S. (1999). A Comparison of
Prediction Accuracy, Complexity, and Training Time of
Thirty-three Old and New Classification Algorithms. Machine
Learning. Forthcoming.
(ftp://ftp.stat.wisc.edu/pub/loh/treeprogs/quest1.7/mach1317.pdf or
(http://www.stat.wisc.edu/~limt/mach1317.pdf)
4. Relevant Information:
The data consist of evaluations of teaching performance over three
regular semesters and two summer semesters of 151 teaching assistant
(TA) assignments at the Statistics Department of the University of
Wisconsin-Madison. The scores were divided into 3 roughly equal-sized
categories ("low", "medium", and "high") to form the class variable.
5. Number of Instances: 151
6. Number of Attributes: 6 (including the class attribute)
7. Attribute Information:
1. Whether of not the TA is a native English speaker (binary)
1=English speaker, 2=non-English speaker
2. Course instructor (categorical, 25 categories)
3. Course (categorical, 26 categories)
4. Summer or regular semester (binary) 1=Summer, 2=Regular
5. Class size (numerical)
6. Class attribute (categorical) 1=Low, 2=Medium, 3=High
8. Missing Attribute Values: None
Information about the dataset CLASSTYPE: nominal CLASSINDEX: last
ROCrate
What is a RO-Crate?
A RO-Crate is a standardized research object package used to bundle data together with rich machine-readable metadata. Each RO-Crate contains:
- the files belonging to the dataset (e.g. CSVs, images, code, documentation)
- a ro-crate-metadata.json file describing the content, provenance, and context
- persistent identifiers and references to related research objects (e.g. software, publications)
This ensures that the dataset can be easily reused, cited, validated, and interpreted in a reproducible manner. More information can be found here.
Download
You can download a RO-Crate for this dataset here: Download RO-Crate
HINT: The RO-Crate is created dynamically, so it could take up to 30 seconds until the downloads starts.
This page was built for dataset: tae