sylva_prior
OpenML dataset with id 1040
No author found.
Full work available at URL: https://api.openml.org/data/v1/download/53923/sylva_prior.arff
Upload date: 6 October 2014
Dataset Characteristics
Number of classes: 2
Number of features: 109 (numeric: 108, symbolic: 1 and in total binary: 1 )
Number of instances: 14,395
Number of instances with missing values: 0
Number of missing values: 0
Author: Source: Unknown - Date unknown Please cite:
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch)
Note: Derived from the covertype dataset
Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php
Modified by TunedIT (converted to ARFF format)
SYLVA is the ecology database
The task of SYLVA is to classify forest cover types. The forest cover type for 30 x 30 meter cells is obtained from US Forest Service (USFS) Region 2 Resource Information System (RIS) data. We brought it back to a two-class classification problem (classifying Ponderosa pine vs. everything else). The "agnostic learning track" data consists in 216 input variables. Each pattern is composed of 4 records: 2 true records matching the target and 2 records picked at random. Thus 1/2 of the features are distracters. The "prior knowledge track" data is identical to the "agnostic learning track" data, except that the distracters are removed and the identity of the features is revealed. For that track, the forest cover original ids are revealed for training data.
Data type: non-sparse
Number of features: 108
Number of examples and check-sums:
Pos_ex Neg_ex Tot_ex Check_sum
Train 805 12281 13086 118996108.00
Valid 81 1228 1309 11904801.00
This page was built for dataset: sylva_prior