High-dimensional data clustering
From MaRDI portal
Abstract: Clustering in high-dimensional spaces is a difficult problem which is recurrent in many domains, for example in image analysis. The difficulty is due to the fact that high-dimensional data usually live in different low-dimensional subspaces hidden in the original space. This paper presents a family of Gaussian mixture models designed for high-dimensional data which combine the ideas of dimension reduction and parsimonious modeling. These models give rise to a clustering method based on the Expectation-Maximization algorithm which is called High-Dimensional Data Clustering (HDDC). In order to correctly fit the data, HDDC estimates the specific subspace and the intrinsic dimension of each group. Our experiments on artificial and real datasets show that HDDC outperforms existing methods for clustering high-dimensional data
Recommendations
- scientific article; zbMATH DE number 5280158
- Clustering high dimensional massive scientific datasets
- Clustering High-Dimensional Data via Feature Selection
- The challenges of clustering high dimensional data
- scientific article; zbMATH DE number 2112095
- Introduction to clustering large and high-dimensional data.
Cites work
- scientific article; zbMATH DE number 3126094 (Why is no real title available?)
- scientific article; zbMATH DE number 3567782 (Why is no real title available?)
- scientific article; zbMATH DE number 1059776 (Why is no real title available?)
- 10.1162/153244303322753616
- A classification EM algorithm for clustering and two stochastic versions
- A maximum likelihood methodology for clusterwise linear regression
- A mixture model for the classification of three-way proximity data
- A nonlinear PCA based on manifold approximation
- ARPACK Users' Guide
- An Algorithm for Simultaneous Orthogonal Transformation of Several Positive Definite Symmetric Matrices to Nearly Diagonal Form
- Automatische Klassifikation
- Detection and Characterization of Cluster Substructure I. Linear Structure: Fuzzy c-Lines
- Dimensionality reduction in quadratic discriminant analysis
- Discriminant Analysis with Singular Covariance Matrices: Methods and Applications to Spectroscopic Data
- Effect of dimensionality on discrimination
- Estimating Mixtures of Normal Distributions and Switching Regressions
- Estimating the dimension of a model
- Finite mixture models
- High-dimensional data clustering
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- Model-Based Gaussian and Non-Gaussian Clustering
- Modelling high-dimensional data by mixtures of factor analyzers
- On feature selection, curse-of-dimensionality and error probability in discriminant analysis
- Principal Curves
- Principal component analysis.
- Probabilistic models in cluster analysis
- The Discrimination Subspace Model
- Variable Selection for Model-Based Clustering
Cited in
(only showing first 100 items - show all)- Location and scale mixtures of Gaussians with flexible tail behaviour: properties, inference and application to multivariate clustering
- Mixture model averaging for clustering
- Model-based classification via mixtures of multivariate \(t\)-distributions
- Reducing data dimension for cluster detection
- Clustering and forecasting multiple functional time series
- scientific article; zbMATH DE number 5280158 (Why is no real title available?)
- Mixtures of modified \(t\)-factor analyzers for model-based clustering, classification, and discriminant analysis
- Inverse regression approach to robust nonlinear high-to-low dimensional mapping
- Divisive clustering of high dimensional data streams
- Model-based clustering for multivariate functional data
- Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection
- Theoretical and practical considerations on the convergence properties of the Fisher-EM algorithm
- Translated Poisson mixture model for stratification learning
- Model-based clustering of longitudinal data
- Simultaneous model-based clustering and visualization in the Fisher discriminative subspace
- Statistical modeling of dissimilarity increments for \(d\)-dimensional data: application in partitional clustering
- Adaptive mixture discriminant analysis for supervised learning with unobserved classes
- Greedy clustering of count data through a mixture of multinomial PCA
- Model-based clustering
- A hidden Markov model applied to the protein 3D structure analysis
- Analyzing state-dependent model-data comparison in multi-regime systems
- Robust supervised classification with mixture models: learning from data with uncertain labels
- A classification method for binary predictors combining similarity measures and mixture models
- A mixture of generalized hyperbolic factor analyzers
- Cluster analysis with cellwise trimming and applications for the robust clustering of curves
- Subspace clustering of high-dimensional data: a predictive approach
- Variable selection methods for model-based clustering
- Robust discriminative clustering with sparse regularizers
- A hierarchical modeling approach for clustering probability density functions
- Using conditional independence for parsimonious model-based Gaussian clustering
- Variational Bayes approximations for clustering via mixtures of normal inverse Gaussian distributions
- scientific article; zbMATH DE number 5817573 (Why is no real title available?)
- Mixtures of generalized hyperbolic distributions and mixtures of skew-\(t\) distributions for model-based clustering with incomplete data
- Estimating common principal components in high dimensions
- Variable Selection for Clustering with Gaussian Mixture Models
- A feature group weighting method for subspace clustering of high-dimensional data
- High-dimensional mixture models for unsupervised image denoising (HDMI)
- Model-based clustering of time series in group-specific functional subspaces
- Dimensionally reduced mixtures of regression models
- A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering
- Parsimonious skew mixture models for model-based clustering and classification
- Model-based clustering of high-dimensional data: a review
- Learning from partially supervised data using mixture models and belief functions
- Discriminative variable selection for clustering with the sparse Fisher-EM algorithm
- A Bayesian Fisher-EM algorithm for discriminative Gaussian subspace clustering
- Initializing the EM algorithm in Gaussian mixture models with an unknown number of components
- Model-based clustering of high-dimensional data streams with online mixture of probabilistic PCA
- Kernel discriminant analysis and clustering with parsimonious Gaussian process models
- Model-based clustering, classification, and discriminant analysis via mixtures of multivariate \(t\)-distributions
- Functional data clustering: a survey
- scientific article; zbMATH DE number 1945788 (Why is no real title available?)
- Clustering and classification via cluster-weighted factor analyzers
- The discriminative functional mixture model for a comparative analysis of bike sharing systems
- Clustering gene expression time course data using mixtures of multivariate \(t\)-distributions
- Flexible mixture modeling via the multivariate \(t\) distribution with the Box-Cox transformation: an alternative to the skew-\(t\) distribution
- Finite mixtures of matrix normal distributions for classifying three-way data
- Stable and visualizable Gaussian parsimonious clustering models
- A distance-relatedness dynamic model for clustering high dimensional data of arbitrary shapes and densities
- Heteroscedastic factor mixture analysis
- Dimensionally reduced model-based clustering through mixtures of factor mixture analyzers
- Clustering of high values in random fields
- Clustering of imbalanced high-dimensional media data
- Model-based clustering, classification, and discriminant analysis of data with mixed type
- Aspects in classification learning -- review of recent developments in learning vector quantization
- Sparse optimal discriminant clustering
- Clustering high dimension, low sample size data using the maximal data piling distance
- High-dimensional data clustering
- Large values of the clustering coefficient
- Compressive learning for patch-based image denoising
- Flexible mixture regression with the generalized hyperbolic distribution
- Addressing overfitting and underfitting in Gaussian model-based clustering
- The generic subspace clustering model
- Tensor envelope mixture model for simultaneous clustering and multiway dimension reduction
- Model-based clustering of functional data via mixtures of \(t\) distributions
- Clustering analysis of multivariate data: a weighted spatial ranks-based approach
- Mini-batch learning of exponential family finite mixture models
- Image super-resolution with PCA reduced generalized Gaussian mixture models in materials science
- Gaussian mixture model with an extended ultrametric covariance structure
- scientific article; zbMATH DE number 1832310 (Why is no real title available?)
- Functional data clustering by projection into latent generalized hyperbolic subspaces
- The parsimonious Gaussian mixture models with partitioned parameters and their application in clustering
- Model-based clustering with missing not at random data
- Clustering longitudinal data for growth curve modelling by Gibbs sampler and information criterion
- A joint latent factor analyzer and functional subspace model for clustering multivariate functional data
- Latent simplex position model: high dimensional multi-view clustering with uncertainty quantification
- Functional data clustering via information maximization
- On clustering uncertain and structured data with Wasserstein barycenters and a geodesic criterion for the number of clusters
- PCA reduced Gaussian mixture models with applications in superresolution
- Is-ClusterMPP: clustering algorithm through point processes and influence space towards high-dimensional data
- Application of affinity propagation for prototype sample detection, with application to face recognition
- Frugal Gaussian clustering of huge imbalanced datasets through a bin-marginal approach
- Parsimonious ultrametric Gaussian mixture models
- A mixture of common skew-\(t\) factor analysers
- Editorial: Statistical learning methods including dimensionality reduction
- Optimal operator space pursuit: a framework for video sequence data analysis
- High-dimensional clustering via random projections
- Clustering boundary pattern discovery for high dimensional space based on matrix model
- Flexible clustering of high-dimensional data via mixtures of joint generalized hyperbolic distributions
- Hierarchical subspace clustering
- Group-wise shrinkage estimation in penalized model-based clustering
This page was built for publication: High-dimensional data clustering
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1020836)