tae

From MaRDI portal
Tae



OpenML48MaRDI QIDQ6032903FDOQ6032903RO-CrateQ6032903

OpenML dataset with id 48

Wei-Yin Loh

Full work available at URL: https://api.openml.org/data/v1/download/48/tae.arff

Upload date: 6 April 2014



Dataset Characteristics

Number of classes: 3
Number of features: 6 (numeric: 3, symbolic: 3 and in total binary: 2 )
Number of instances: 151
Number of instances with missing values: 0
Number of missing values: 0

Author: Source: Unknown - Please cite:

1. Title: Teaching Assistant Evaluation

2. Sources:
   (a) Collector: Wei-Yin Loh (Department of Statistics, UW-Madison)
   (b) Donor:     Tjen-Sien Lim (limt@stat.wisc.edu)
   (b) Date:      June 7, 1997

3. Past Usage:
   1. Loh, W.-Y. & Shih, Y.-S. (1997). Split Selection Methods for 
      Classification Trees, Statistica Sinica 7: 815-840.
   2. Lim, T.-S., Loh, W.-Y. & Shih, Y.-S. (1999). A Comparison of
      Prediction Accuracy, Complexity, and Training Time of
      Thirty-three Old and New Classification Algorithms. Machine
      Learning. Forthcoming.
      (ftp://ftp.stat.wisc.edu/pub/loh/treeprogs/quest1.7/mach1317.pdf or
      (http://www.stat.wisc.edu/~limt/mach1317.pdf)

4. Relevant Information:
   The data consist of evaluations of teaching performance over three
   regular semesters and two summer semesters of 151 teaching assistant
   (TA) assignments at the Statistics Department of the University of
   Wisconsin-Madison. The scores were divided into 3 roughly equal-sized
   categories ("low", "medium", and "high") to form the class variable.

5. Number of Instances: 151

6. Number of Attributes: 6 (including the class attribute)

7. Attribute Information:
  
   1. Whether of not the TA is a native English speaker (binary)
      1=English speaker, 2=non-English speaker
   2. Course instructor (categorical, 25 categories)
   3. Course (categorical, 26 categories)
   4. Summer or regular semester (binary) 1=Summer, 2=Regular
   5. Class size (numerical)
   6. Class attribute (categorical) 1=Low, 2=Medium, 3=High

8. Missing Attribute Values: None
Information about the dataset
CLASSTYPE: nominal
CLASSINDEX: last






ROCrate

What is a RO-Crate?

A RO-Crate is a standardized research object package used to bundle data together with rich machine-readable metadata. Each RO-Crate contains:

  • the files belonging to the dataset (e.g. CSVs, images, code, documentation)
  • a ro-crate-metadata.json file describing the content, provenance, and context
  • persistent identifiers and references to related research objects (e.g. software, publications)

This ensures that the dataset can be easily reused, cited, validated, and interpreted in a reproducible manner. More information can be found here.

Download

You can download a RO-Crate for this dataset here: Download RO-Crate

HINT: The RO-Crate is created dynamically, so it could take up to 30 seconds until the downloads starts.


This page was built for dataset: tae