letter

From MaRDI portal
Dataset:6032859



OpenML6MaRDI QIDQ6032859

OpenML dataset with id 6

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/6/letter.arff

Upload date: 6 April 2014


Dataset Characteristics

Number of classes: 26
Number of features: 17 (numeric: 16, symbolic: 1 and in total binary: 0 )
Number of instances: 20,000
Number of instances with missing values: 0
Number of missing values: 0

Author: David J. Slate Source: UCI - 01-01-1991 Please cite: P. W. Frey and D. J. Slate. "Letter Recognition Using Holland-style Adaptive Classifiers". Machine Learning 6(2), 1991

1. TITLE:

 Letter Image Recognition Data 

   The objective is to identify each of a large number of black-and-white
   rectangular pixel displays as one of the 26 capital letters in the English
   alphabet.  The character images were based on 20 different fonts and each
   letter within these 20 fonts was randomly distorted to produce a file of
   20,000 unique stimuli.  Each stimulus was converted into 16 primitive
   numerical attributes (statistical moments and edge counts) which were then
   scaled to fit into a range of integer values from 0 through 15.  We
   typically train on the first 16000 items and then use the resulting model
   to predict the letter category for the remaining 4000.  See the article
   cited above for more details.