Randomly initialized EM algorithm for two-component Gaussian mixture achieves near optimality in \(O(\sqrt{n})\) iterations (Q2113265)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Randomly initialized EM algorithm for two-component Gaussian mixture achieves near optimality in \(O(\sqrt{n})\) iterations	scientific article

Statements

scholarly article

0 references

Randomly initialized EM algorithm for two-component Gaussian mixture achieves near optimality in \(O(\sqrt{n})\) iterations (English)

0 references

0 references

Mathematical Statistics and Learning

0 references

publication date

11 March 2022

0 references

full work available at URL

https://arxiv.org/abs/1908.10935

0 references

Summary: We analyze the classical EM algorithm for parameter estimation in the symmetric two-component Gaussian mixtures in \(d\) dimensions. We show that, even in the absence of any separation between components, provided that the sample size satisfies \(n=\Omega (d \log^4 d)\), the randomly initialized EM algorithm converges to an estimate in at most \(O(\sqrt{n})\) iterations with high probability, which is at most \(O((d/n)^{1/4} \log n)\) in Euclidean distance from the true parameter and within logarithmic factors of the minimax rate of \((d/n)^{1/4}\). Both the nonparametric statistical rate and the sublinear convergence rate are direct consequences of the zero Fisher information in the worst case. Refined pointwise guarantees beyond worst-case analysis and convergence to the MLE are also shown under mild conditions. This improves the previous result of \textit{S. Balakrishnan} et al. [Ann. Stat. 45, No. 1, 77--120 (2017; Zbl 1367.62052)], which requires strong conditions on both the separation of the components and the quality of the initialization, and that of \textit{C. Daskalakis}, \textit{C. Tzamos} and \textit{M. Zampetakis} [``Ten steps of EM suffice for mixtures of two Gaussians'', Proc. Mach. Learn. Res. (PMLR) 65, 704--710 (2017)], which requires sample splitting and restarting the EM iteration.

0 references

zbMATH Keywords

EM algorithm

0 references

Gaussian mixture

0 references

minimax rates

0 references

convergence rate

0 references

random initialization

0 references

MaRDI profile type

MaRDI publication profile

0 references

Statistical guarantees for the EM algorithm: from population to sample-based analysis

0 references

Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models

0 references

Sparse PCA: optimal rates and adaptive estimation

0 references

Gradient descent with random initialization: fast global convergence for nonconvex phase retrieval

0 references

Trust Region Methods

0 references

0 references

Singularity, misspecification and the convergence rate of EM

0 references

On the rate of convergence in Wasserstein distance of the empirical measure

0 references

Rates of convergence for the Gaussian mixture sieve.

0 references

Entropies and rates of convergence for maximum likelihood and Bayes estimation for mixtures of normal densities.

0 references

Convergence rates of parameter estimation for some weakly identifiable finite mixtures

0 references

Choosing initial values for the EM algorithm for finite mixtures

0 references

Adaptive estimation of a quadratic functional by model selection.

0 references

The landscape of empirical risk for nonconvex losses

0 references

Dissipation of Information in Channels With Input Constraints

0 references

Mixture Densities, Maximum Likelihood and the EM Algorithm

0 references

On the nonparametric maximum likelihood estimator for Gaussian location mixture densities with application to Gaussian denoising

0 references

0 references

The transportation cost from the uniform measure to the empirical measure in dimension \(\geq 3\)

0 references

0 references

Asymptotic Statistics

0 references

Functional Properties of Minimum Mean-Square Error and Mutual Information

0 references

Optimal estimation of Gaussian mixtures via denoised method of moments

0 references

Information-theoretic determination of minimax rates of convergence

0 references

0 references

Identifiers

zbMATH Open document ID

0 references

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2113265

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q2113265&oldid=36777028"