JapaneseVowels (Q6033114): Difference between revisions

Latest revision as of 12:27, 16 April 2024

OpenML dataset with id 375

Language	Label	Description	Also known as
English	JapaneseVowels	OpenML dataset with id 375

Statements

instance of

data set

0 references

dataset version identifier

1

0 references

description

**Author**: Mineichi Kudo, Jun Toyama, Masaru Shimbo \N**Source**: [UCI](https://archive.ics.uci.edu/ml/datasets/Japanese+Vowels) \N**Please cite**: \N\N**Japanese vowels** \NThis dataset records 640 time series of 12 LPC cepstrum coefficients taken from nine male speakers.\N\NThe data was collected for examining our newly developed classifier for multidimensional curves (multidimensional time series). Nine male speakers uttered two Japanese vowels /ae/ successively. For each utterance, with the analysis parameters described below, we applied 12-degree linear prediction analysis to it to obtain a discrete-time series with 12 LPC cepstrum coefficients. This means that one utterance by a speaker forms a time series whose length is in the range 7-29 and each point of a time series is of 12 features (12 coefficients).\N\NSimilar data are available for different utterances /ei/, /iu/, /uo/, /oa/ in addition to /ae/. Please contact the donor if you are interested in using this data.\N\NThe number of the time series is 640 in total. We used one set of 270 time series for training and the other set of 370 time series for testing.\N\NAnalysis parameters: \N* Sampling rate : 10kHz\N* Frame length : 25.6 ms\N* Shift length : 6.4ms\N* Degree of LPC coefficients : 12\N\NEach line represents 12 LPC coefficients in the increasing order separated by spaces. This corresponds to one analysis\Nframe. Lines are organized into blocks, which are a set of 7-29 lines separated by blank lines and corresponds to a single speech utterance of /ae/ with 7-29 frames.\N\NEach speaker is a set of consecutive blocks. In ae.train there are 30 blocks for each speaker. Blocks 1-30 represent speaker 1, blocks 31-60 represent speaker 2, and so on up to speaker 9. In ae.test, speakers 1 to 9 have the corresponding number of blocks: 31 35 88 44 29 24 40 50 29. Thus, blocks 1-31 represent speaker 1 (31 utterances of /ae/), blocks 32-66 represent speaker 2 (35 utterances of /ae/), and so on.\N\N**Past Usage**\N\NM. Kudo, J. Toyama and M. Shimbo. (1999). "Multidimensional Curve Classification Using Passing-Through Regions". Pattern Recognition Letters, Vol. 20, No. 11--13, pages 1103--1111.\N\NIf you publish any work using the dataset, please inform the donor. Use for commercial purposes requires donor permission.\N\NReferences \N\N1. http://ips9.main.eng.hokudai.ac.jp/index_e.html\N2. mailto:mine@main.eng.hokudai.ac.jp\N3. mailto:jun@main.eng.hokudai.ac.jp\N4. mailto:shimbo@main.eng.hokudai.ac.jp\N5. http://kdd.ics.uci.edu/\N6. http://www.ics.uci.edu/\N7. http://www.uci.edu/

0 references

author name string

Mineichi Kudo

object has role

creator

0 references

Jun Toyama

object has role

creator

0 references

Masaru Shimbo

object has role

creator

0 references

upload date

27 September 2014

0 references

full work available at URL

https://api.openml.org/data/v1/download/52415/JapaneseVowels.arff

0 references

https://archive.ics.uci.edu/ml/datasets/Japanese+Vowels

0 references

default target attribute

speaker

0 references

0 references

0 references

https://www.sciencedirect.com/science/article/pii/S016786559900077X

0 references

checksum

8b59d000ae5310afb26ec66e97fc7724

determination method

MD5

0 references

number of binary features

0

0 references

number of classes

9

0 references

number of features

15

0 references

number of instances

9,961

0 references

number of instances with missing values

0

0 references

number of missing values

0

0 references

number of numeric features

14

0 references

number of symbolic features

1

0 references

file format

ARFF

0 references

MaRDI profile type

MaRDI dataset profile

0 references

Identifiers

OpenML dataset ID

375

0 references

Sitelinks

Mathematics(1 entry)

mardi Dataset:6033114

Revision as of 10:11, 15 April 2024 Importer (talk \| contribs) Bots 7,068,092 edits ‎Created a new Item	Latest revision as of 12:27, 16 April 2024 Import240416010454 (talk \| contribs) 10,906 edits Added link to MaRDI item.
links / mardi / name	links / mardi / name
		Dataset:6033114