sylva_prior

OpenML dataset with id 1040

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/53923/sylva_prior.arff

Upload date: 6 October 2014

Dataset Characteristics

Number of classes: 2
Number of features: 109 (numeric: 108, symbolic: 1 and in total binary: 1 )
Number of instances: 14,395
Number of instances with missing values: 0
Number of missing values: 0

Description

Author: Source: Unknown - Date unknown Please cite:

Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch)

Note: Derived from the covertype dataset

Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php

Modified by TunedIT (converted to ARFF format)

SYLVA is the ecology database

The task of SYLVA is to classify forest cover types. The forest cover type for 30 x 30 meter cells is obtained from US Forest Service (USFS) Region 2 Resource Information System (RIS) data. We brought it back to a two-class classification problem (classifying Ponderosa pine vs. everything else). The "agnostic learning track" data consists in 216 input variables. Each pattern is composed of 4 records: 2 true records matching the target and 2 records picked at random. Thus 1/2 of the features are distracters. The "prior knowledge track" data is identical to the "agnostic learning track" data, except that the distracters are removed and the identity of the features is revealed. For that track, the forest cover original ids are revealed for training data.

Data type: non-sparse Number of features: 108 Number of examples and check-sums: Pos_ex Neg_ex Tot_ex Check_sum Train 805 12281 13086 118996108.00 Valid 81 1228 1309 11904801.00

This page was built for dataset: sylva_prior