Estimation of the number of components of nonparametric multivariate finite mixture models

From MaRDI portal
Publication:2054486

DOI10.1214/20-AOS2032zbMATH Open1486.62085arXiv1908.03656OpenAlexW3202852176MaRDI QIDQ2054486FDOQ2054486


Authors: Caleb Kwon, Eric Mbakop Edit this on Wikidata


Publication date: 3 December 2021

Published in: The Annals of Statistics (Search for Journal in Brave)

Abstract: We propose a novel estimator for the number of components (denoted by M) in a K-variate non-parametric finite mixture model, where the analyst has repeated observations of Kgeq2 variables that are independent given a finitely supported unobserved variable. Under a mild assumption on the joint distribution of the observed and latent variables, we show that an integral operator T, that is identified from the data, has rank equal to M. Using this observation, and the fact that singular values are stable under perturbations, the estimator of M that we propose is based on a thresholding rule which essentially counts the number of singular values of a consistent estimator of T that are greater than a data-driven threshold. We prove that our estimator of M is consistent, and establish non-asymptotic results which provide finite sample performance guarantees for our estimator. We present a Monte Carlo study which shows that our estimator performs well for samples of moderate size.


Full work available at URL: https://arxiv.org/abs/1908.03656




Recommendations




Cites Work


Cited In (10)





This page was built for publication: Estimation of the number of components of nonparametric multivariate finite mixture models

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2054486)