haberman

From MaRDI portal
Dataset:6032899



OpenML43MaRDI QIDQ6032899

OpenML dataset with id 43

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/43/haberman.arff

Upload date: 6 April 2014


Dataset Characteristics

Number of classes: 2
Number of features: 4 (numeric: 2, symbolic: 2 and in total binary: 1 )
Number of instances: 306
Number of instances with missing values: 0
Number of missing values: 0

Author: Source: Unknown - Please cite:

1. Title: Haberman's Survival Data

2. Sources:
   (a) Donor:   Tjen-Sien Lim (limt@stat.wisc.edu)
   (b) Date:    March 4, 1999

3. Past Usage:
   1. Haberman, S. J. (1976). Generalized Residuals for Log-Linear
      Models, Proceedings of the 9th International Biometrics
      Conference, Boston, pp. 104-122.
   2. Landwehr, J. M., Pregibon, D., and Shoemaker, A. C. (1984),
      Graphical Models for Assessing Logistic Regression Models (with
      discussion), Journal of the American Statistical Association 79:
      61-83.
   3. Lo, W.-D. (1993). Logistic Regression Trees, PhD thesis,
      Department of Statistics, University of Wisconsin, Madison, WI.

4. Relevant Information:
   The dataset contains cases from a study that was conducted between
   1958 and 1970 at the University of Chicago's Billings Hospital on
   the survival of patients who had undergone surgery for breast
   cancer.

5. Number of Instances: 306

6. Number of Attributes: 4 (including the class attribute)

7. Attribute Information:
   1. Age of patient at time of operation (numerical)
   2. Patient's year of operation (year - 1900, numerical)
   3. Number of positive axillary nodes detected (numerical)
   4. Survival status (class attribute)
         1 = the patient survived 5 years or longer
         2 = the patient died within 5 year

8. Missing Attribute Values: None
Information about the dataset
CLASSTYPE: nominal
CLASSINDEX: last