High-dimensional data clustering
From MaRDI portal
Abstract: Clustering in high-dimensional spaces is a difficult problem which is recurrent in many domains, for example in image analysis. The difficulty is due to the fact that high-dimensional data usually live in different low-dimensional subspaces hidden in the original space. This paper presents a family of Gaussian mixture models designed for high-dimensional data which combine the ideas of dimension reduction and parsimonious modeling. These models give rise to a clustering method based on the Expectation-Maximization algorithm which is called High-Dimensional Data Clustering (HDDC). In order to correctly fit the data, HDDC estimates the specific subspace and the intrinsic dimension of each group. Our experiments on artificial and real datasets show that HDDC outperforms existing methods for clustering high-dimensional data
Recommendations
- scientific article; zbMATH DE number 5280158
- Clustering high dimensional massive scientific datasets
- Clustering High-Dimensional Data via Feature Selection
- The challenges of clustering high dimensional data
- scientific article; zbMATH DE number 2112095
- Introduction to clustering large and high-dimensional data.
Cites work
- scientific article; zbMATH DE number 3126094 (Why is no real title available?)
- scientific article; zbMATH DE number 3567782 (Why is no real title available?)
- scientific article; zbMATH DE number 1059776 (Why is no real title available?)
- 10.1162/153244303322753616
- A classification EM algorithm for clustering and two stochastic versions
- A maximum likelihood methodology for clusterwise linear regression
- A mixture model for the classification of three-way proximity data
- A nonlinear PCA based on manifold approximation
- ARPACK Users' Guide
- An Algorithm for Simultaneous Orthogonal Transformation of Several Positive Definite Symmetric Matrices to Nearly Diagonal Form
- Automatische Klassifikation
- Detection and Characterization of Cluster Substructure I. Linear Structure: Fuzzy c-Lines
- Dimensionality reduction in quadratic discriminant analysis
- Discriminant Analysis with Singular Covariance Matrices: Methods and Applications to Spectroscopic Data
- Effect of dimensionality on discrimination
- Estimating Mixtures of Normal Distributions and Switching Regressions
- Estimating the dimension of a model
- Finite mixture models
- High-dimensional data clustering
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- Model-Based Gaussian and Non-Gaussian Clustering
- Modelling high-dimensional data by mixtures of factor analyzers
- On feature selection, curse-of-dimensionality and error probability in discriminant analysis
- Principal Curves
- Principal component analysis.
- Probabilistic models in cluster analysis
- The Discrimination Subspace Model
- Variable Selection for Model-Based Clustering
Cited in
(only showing first 100 items - show all)- Finite mixtures of matrix normal distributions for classifying three-way data
- Model-based clustering, classification, and discriminant analysis of data with mixed type
- High-dimensional data clustering
- Image super-resolution with PCA reduced generalized Gaussian mixture models in materials science
- Inverse regression approach to robust nonlinear high-to-low dimensional mapping
- Gaussian mixture model with an extended ultrametric covariance structure
- A mixture of common skew-\(t\) factor analysers
- Flexible clustering of high-dimensional data via mixtures of joint generalized hyperbolic distributions
- High-dimensional clustering via random projections
- Initializing the EM algorithm in Gaussian mixture models with an unknown number of components
- The generic subspace clustering model
- Efficient mixture model for clustering of sparse high dimensional binary data
- Large values of the clustering coefficient
- Learning from partially supervised data using mixture models and belief functions
- Mixtures of generalized hyperbolic distributions and mixtures of skew-t distributions for model-based clustering with incomplete data
- Model-based clustering of high-dimensional data: a review
- Model-based clustering of longitudinal data
- Flexible mixture modeling via the multivariate t distribution with the Box-Cox transformation: an alternative to the skew-t distribution
- Flexible mixture regression with the generalized hyperbolic distribution
- HYPER-SPECTRAL DATA CLUSTERING METHOD BASED UPON THE SENSITIVE SUBSPACE
- scientific article; zbMATH DE number 5280158 (Why is no real title available?)
- Clustering gene expression time course data using mixtures of multivariate \(t\)-distributions
- Stable and visualizable Gaussian parsimonious clustering models
- A Bayesian Fisher-EM algorithm for discriminative Gaussian subspace clustering
- A hidden Markov model applied to the protein 3D structure analysis
- Model-based clustering of high-dimensional data streams with online mixture of probabilistic PCA
- Parsimonious skew mixture models for model-based clustering and classification
- Model-based clustering
- scientific article; zbMATH DE number 5817573 (Why is no real title available?)
- scientific article; zbMATH DE number 1832310 (Why is no real title available?)
- Clustering analysis of multivariate data: a weighted spatial ranks-based approach
- Functional data clustering via information maximization
- scientific article; zbMATH DE number 7578295 (Why is no real title available?)
- Clustering and classification via cluster-weighted factor analyzers
- Functional data clustering: a survey
- Analyzing state-dependent model-data comparison in multi-regime systems
- Parsimonious ultrametric Gaussian mixture models
- Variable selection methods for model-based clustering
- Is-ClusterMPP: clustering algorithm through point processes and influence space towards high-dimensional data
- Variational Bayes approximations for clustering via mixtures of normal inverse Gaussian distributions
- Clustering high dimension, low sample size data using the maximal data piling distance
- Clustering of high values in random fields
- Model-based clustering for multivariate functional data
- A distance-relatedness dynamic model for clustering high dimensional data of arbitrary shapes and densities
- Robust discriminative clustering with sparse regularizers
- The parsimonious Gaussian mixture models with partitioned parameters and their application in clustering
- Model-based clustering with missing not at random data
- Discriminative variable selection for clustering with the sparse Fisher-EM algorithm
- Adaptive mixture discriminant analysis for supervised learning with unobserved classes
- High-dimensional mixture models for unsupervised image denoising (HDMI)
- In the pursuit of sparseness: a new rank-preserving penalty for a finite mixture of factor analyzers
- Theoretical and practical considerations on the convergence properties of the Fisher-EM algorithm
- Statistical modeling of dissimilarity increments for \(d\)-dimensional data: application in partitional clustering
- Mixture model averaging for clustering
- A hierarchical modeling approach for clustering probability density functions
- Parameter-wise co-clustering for high-dimensional data
- Variable Selection for Clustering with Gaussian Mixture Models
- Dimensionally reduced mixtures of regression models
- Projective clustering based on Parzen window technique
- A dual subspace parsimonious mixture of matrix normal distributions
- Aspects in classification learning -- review of recent developments in learning vector quantization
- Clustering of imbalanced high-dimensional media data
- Clustering longitudinal data for growth curve modelling by Gibbs sampler and information criterion
- Reducing data dimension for cluster detection
- Application of affinity propagation for prototype sample detection, with application to face recognition
- Cluster analysis with cellwise trimming and applications for the robust clustering of curves
- Subspace clustering of high-dimensional data: a predictive approach
- Tensor envelope mixture model for simultaneous clustering and multiway dimension reduction
- Translated Poisson mixture model for stratification learning
- Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection
- Optimal operator space pursuit: a framework for video sequence data analysis
- Using conditional independence for parsimonious model-based Gaussian clustering
- Editorial: Statistical learning methods including dimensionality reduction
- Addressing overfitting and underfitting in Gaussian model-based clustering
- High dimensional data clustering from a dynamical systems point of view
- Estimating common principal components in high dimensions
- Model-based clustering of time series in group-specific functional subspaces
- Frugal Gaussian clustering of huge imbalanced datasets through a bin-marginal approach
- scientific article; zbMATH DE number 1945788 (Why is no real title available?)
- The discriminative functional mixture model for a comparative analysis of bike sharing systems
- A mixture of generalized hyperbolic factor analyzers
- A feature group weighting method for subspace clustering of high-dimensional data
- Latent simplex position model: high dimensional multi-view clustering with uncertainty quantification
- Hierarchical subspace clustering
- A classification method for binary predictors combining similarity measures and mixture models
- Functional data clustering by projection into latent generalized hyperbolic subspaces
- Kernel discriminant analysis and clustering with parsimonious Gaussian process models
- Compressive learning for patch-based image denoising
- Divisive clustering of high dimensional data streams
- Simultaneous model-based clustering and visualization in the Fisher discriminative subspace
- A joint latent factor analyzer and functional subspace model for clustering multivariate functional data
- Clustering and forecasting multiple functional time series
- Heteroscedastic factor mixture analysis
- Subspace clustering for the finite mixture of generalized hyperbolic distributions
- A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering
- Mini-batch learning of exponential family finite mixture models
- Holo-entropy based categorical data hierarchical clustering
- Dimensionally reduced model-based clustering through mixtures of factor mixture analyzers
- Sparse optimal discriminant clustering
- Group-wise shrinkage estimation in penalized model-based clustering
This page was built for publication: High-dimensional data clustering
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1020836)