Discriminative variable selection for clustering with the sparse Fisher-EM algorithm
From MaRDI portal
Abstract: The interest in variable selection for clustering has increased recently due to the growing need in clustering high-dimensional data. Variable selection allows in particular to ease both the clustering and the interpretation of the results. Existing approaches have demonstrated the efficiency of variable selection for clustering but turn out to be either very time consuming or not sparse enough in high-dimensional spaces. This work proposes to perform a selection of the discriminative variables by introducing sparsity in the loading matrix of the Fisher-EM algorithm. This clustering method has been recently proposed for the simultaneous visualization and clustering of high-dimensional data. It is based on a latent mixture model which fits the data into a low-dimensional discriminative subspace. Three different approaches are proposed in this work to introduce sparsity in the orientation matrix of the discriminative subspace through -type penalizations. Experimental comparisons with existing approaches on simulated and real-world data sets demonstrate the interest of the proposed methodology. An application to the segmentation of hyperspectral images of the planet Mars is also presented.
Recommendations
- Simultaneous model-based clustering and visualization in the Fisher discriminative subspace
- Variable Selection for Model-Based High-Dimensional Clustering and Its Application to Microarray Data
- Variable selection in model-based clustering and discriminant analysis with a regularization approach
- scientific article; zbMATH DE number 5251644
- Variable selection for clustering and classification
Cites work
- scientific article; zbMATH DE number 3126094 (Why is no real title available?)
- scientific article; zbMATH DE number 47593 (Why is no real title available?)
- A framework for feature selection in clustering
- A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis
- An Optimal Set of Discriminant Vectors
- Dimensionally reduced mixtures of regression models
- Discriminative variable selection for clustering with the sparse Fisher-EM algorithm
- Estimating the number of clusters in a data set via the gap statistic
- Heteroscedastic factor mixture analysis
- High-Dimensional Discriminant Analysis
- High-dimensional data clustering
- Least angle regression. (With discussion)
- Letter to the Editor
- Modelling high-dimensional data by mixtures of factor analyzers
- On the ``degrees of freedom of the lasso
- Penalized factor mixture analysis for variable selection in clustered data
- Penalized model-based clustering
- Penalized model-based clustering with application to variable selection
- Procrustes Problems
- Regularization and Variable Selection Via the Elastic Net
- Simultaneous model-based clustering and visualization in the Fisher discriminative subspace
- Sparse linear discriminant analysis with applications to high dimensional low sample size data
- Theoretical and practical considerations on the convergence properties of the Fisher-EM algorithm
- Variable Selection for Clustering with Gaussian Mixture Models
- Variable Selection for Model-Based Clustering
- Variable Selection for Model-Based High-Dimensional Clustering and Its Application to Microarray Data
- Variable selection in model-based clustering: a general variable role modeling
Cited in
(21)- Model-based clustering of high-dimensional data: a review
- Bayesian inference for infinite asymmetric Gaussian mixture with feature selection
- A Bayesian Fisher-EM algorithm for discriminative Gaussian subspace clustering
- Model-based clustering
- Variable selection methods for model-based clustering
- Multivariate response and parsimony for Gaussian cluster-weighted models
- Discriminative variable selection for clustering with the sparse Fisher-EM algorithm
- On the estimation of the latent discriminative subspace in the Fisher-EM algorithm
- Principles of experimental design for big data analysis
- Variable selection in model-based clustering and discriminant analysis with a regularization approach
- Sparse and geometry-aware generalisation of the mutual information for joint discriminative clustering and feature selection
- Cluster analysis with cellwise trimming and applications for the robust clustering of curves
- Sparse matrices in data analysis
- The discriminative functional mixture model for a comparative analysis of bike sharing systems
- Consistency of variational Bayes inference for estimation and model selection in mixtures
- A mixture of generalized hyperbolic factor analyzers
- TLS-EM algorithm of mixture density models for exponential families
- Simultaneous model-based clustering and visualization in the Fisher discriminative subspace
- A hierarchical Bayesian approach for examining heterogeneity in choice decisions
- Factor probabilistic distance clustering (FPDC): a new clustering method
- On variable selection in matrix mixture modelling
This page was built for publication: Discriminative variable selection for clustering with the sparse Fisher-EM algorithm
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2259731)