A robust method for speech emotion recognition based on infinite Student's \(t\)-mixture model (Q1665771): Difference between revisions

Summary: Speech emotion classification method, proposed in this paper, is based on Student's \(t\)-mixture model with infinite component number (iSMM) and can directly conduct effective recognition for various kinds of speech emotion samples. Compared with the traditional GMM (Gaussian mixture model), speech emotion model based on Student's \(t\)-mixture can effectively handle speech sample outliers that exist in the emotion feature space. Moreover, \(t\)-mixture model could keep robust to atypical emotion test data. In allusion to the high data complexity caused by high-dimensional space and the problem of insufficient training samples, a global latent space is joined to emotion model. Such an approach makes the number of components divided infinite and forms an iSMM emotion model, which can automatically determine the best number of components with lower complexity to complete various kinds of emotion characteristics data classification. Conducted over one spontaneous (FAU Aibo Emotion Corpus) and two acting (DES and EMO-DB) universal speech emotion databases which have high-dimensional feature samples and diversiform data distributions, the iSMM maintains better recognition performance than the comparisons. Thus, the effectiveness and generalization to the high-dimensional data and the outliers are verified. Hereby, the iSMM emotion model is verified as a robust method with the validity and generalization to outliers and high-dimensional emotion characters.

0 references

describes a project that uses

OpenEAR

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1155/2015/475810

0 references

cites work

Survey on speech emotion recognition: features, classification schemes, and databases

0 references

Analytic calculations for the EM algorithm for multivariate skew-\(t\) mixture models

0 references

Q4363980

0 references

Hidden Markov Models With Stick-Breaking Priors

0 references

Extending mixtures of multivariate \(t\)-factor analyzers

0 references

Kullback-leibler approximation of spectral density functions

0 references

Identifiers

zbMATH Open document ID

1394.68347

0 references

DOI

10.1155/2015/475810

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1665771

@@ Property / cites work @@
+Survey on speech emotion recognition: features, classification schemes, and databases
+Normal rank
@@ Property / cites work @@
+Analytic calculations for the EM algorithm for multivariate skew-\(t\) mixture models
+Normal rank
@@ Property / cites work @@
+Q4363980
@@ Property / cites work: Q4363980 / rank @@
+Normal rank
@@ Property / cites work @@
+Hidden Markov Models With Stick-Breaking Priors
@@ Property / cites work: Hidden Markov Models With Stick-Breaking Priors / rank @@
+Normal rank
@@ Property / cites work @@
+Extending mixtures of multivariate \(t\)-factor analyzers
+Normal rank
@@ Property / cites work @@
+Kullback-leibler approximation of spectral density functions
+Normal rank