Kuzushiji-MNIST

From MaRDI portal
Dataset:6035774



OpenML41982MaRDI QIDQ6035774

OpenML dataset with id 41982

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/21388379/Kuzushiji-MNIST.arff

Upload date: 23 July 2019
Copyright license: Creative Commons Attribution-ShareAlike 4.0 International



Dataset Characteristics

Number of classes: 10
Number of features: 785 (numeric: 784, symbolic: 1 and in total binary: 0 )
Number of instances: 70,000
Number of instances with missing values: 0
Number of missing values: 0

Much of machine learning research focuses on producing models which perform well on benchmark tasks, in turn improving our understanding of the challenges associated with those tasks. From the perspective of ML researchers, the content of the task itself is largely irrelevant, and thus there have increasingly been calls for benchmark tasks to more heavily focus on problems which are of social or cultural relevance. In this work, we introduce Kuzushiji-MNIST, a dataset which focuses on Kuzushiji (cursive Japanese), as well as two larger, more challenging datasets, Kuzushiji-49 and Kuzushiji-Kanji. Through these datasets, we wish to engage themachine learning community into the world of classical Japanese literature.



This page was built for dataset: Kuzushiji-MNIST