Investigation of parameter uncertainty in clustering using a Gaussian mixture model via jackknife, bootstrap and weighted likelihood bootstrap
From MaRDI portal
Publication:2282602
Abstract: Mixture models are a popular tool in model-based clustering. Such a model is often fitted by a procedure that maximizes the likelihood, such as the EM algorithm. At convergence, the maximum likelihood parameter estimates are typically reported, but in most cases little emphasis is placed on the variability associated with these estimates. In part this may be due to the fact that standard errors are not directly calculated in the model-fitting algorithm, either because they are not required to fit the model, or because they are difficult to compute. The examination of standard errors in model-based clustering is therefore typically neglected. The widely used R package mclust has recently introduced bootstrap and weighted likelihood bootstrap methods to facilitate standard error estimation. This paper provides an empirical comparison of these methods (along with the jackknife method) for producing standard errors and confidence intervals for mixture parameters. These methods are illustrated and contrasted in both a simulation study and in the traditional Old Faithful data set and Thyroid data set.
Recommendations
- Bootstrap validation of the estimated parameters in mixture models used for clustering
- Standard errors of fitted component means of normal mixture
- Weighted likelihood mixture modeling and model-based clustering
- Mixture Models, Robustness, and the Weighted Likelihood Methodology
- How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis
Cites work
- scientific article; zbMATH DE number 3886919 (Why is no real title available?)
- scientific article; zbMATH DE number 4104359 (Why is no real title available?)
- scientific article; zbMATH DE number 3782216 (Why is no real title available?)
- scientific article; zbMATH DE number 3567782 (Why is no real title available?)
- scientific article; zbMATH DE number 509150 (Why is no real title available?)
- scientific article; zbMATH DE number 708500 (Why is no real title available?)
- scientific article; zbMATH DE number 1059776 (Why is no real title available?)
- scientific article; zbMATH DE number 1104922 (Why is no real title available?)
- scientific article; zbMATH DE number 849934 (Why is no real title available?)
- A Look at Some Data on the Old Faithful Geyser
- A Three-step Method for Choosing the Number of Bootstrap Repetitions
- A handbook of statistical analyses using R.
- A note on the delete-d jackknife variance estimators
- A sequentially constructed design for estimating a nonlinear parametric function
- Bootstrapping generalized linear models
- Computing empirical likelihood from the bootstrap
- Estimating the Propagation Rate of a Viral Infection of Potato Plants via Mixtures of Regressions
- Estimating the dimension of a model
- Extremum estimation and numerical derivatives
- Finite mixture models
- Fitting finite mixtures of generalized linear regressions in \textsf{R}
- How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis
- Incorrect asymptotic size of subsampling procedures based on post-consistent model selection estimators
- Jackknife, bootstrap and other resampling methods in regression analysis
- MODEL SELECTION AND INFERENCE: FACTS AND FICTION
- Maximum likelihood estimation of the multivariate normal mixture model
- Missing Data, Imputation, and the Bootstrap
- Model-Based Clustering, Classification, and Density Estimation Using mclust in R
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- Model-based clustering and classification with non-normal mixture distributions
- NOTES ON BIAS IN ESTIMATION
- Nonparametric estimates of standard error: The jackknife, the bootstrap and other methods
- On non-singular information matrices and local identifiability
- Sieve bootstrap for time series
- Standard errors of fitted component means of normal mixture
- The jackknife estimate of variance
Cited in
(8)- Model-based clustering of count processes
- Efficient sampling from the PKBD distribution
- Early identification of an impending rockslide location via a spatially-aided Gaussian mixture model
- Calibrated model-based evidential clustering using bootstrapping
- The jackknife-like method for assessing uncertainty of point estimates for Bayesian estimation in a finite Gaussian mixture model
- The anatomy of sorting -- evidence from Danish data
- Bootstrap validation of the estimated parameters in mixture models used for clustering
- Assessing the variability of posterior probabilities in Gaussian model-based clustering
This page was built for publication: Investigation of parameter uncertainty in clustering using a Gaussian mixture model via jackknife, bootstrap and weighted likelihood bootstrap
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2282602)