Model-based clustering using copulas with applications
From MaRDI portal
(Redirected from Publication:340862)
Abstract: The majority of model-based clustering techniques is based on multivariate Normal models and their variants. In this paper copulas are used for the construction of flexible families of models for clustering applications. The use of copulas in model-based clustering offers two direct advantages over current methods: i) the appropriate choice of copulas provides the ability to obtain a range of exotic shapes for the clusters, and ii) the explicit choice of marginal distributions for the clusters allows the modelling of multivariate data of various modes (either discrete or continuous) in a natural way. This paper introduces and studies the framework of copula-based finite mixture models for clustering applications. Estimation in the general case can be performed using standard EM, and, depending on the mode of the data, more efficient procedures are provided that can fully exploit the copula structure. The closure properties of the mixture models under marginalization are discussed, and for continuous, real-valued data parametric rotations in the sample space are introduced, with a parallel discussion on parameter identifiability depending on the choice of copulas for the components. The exposition of the methodology is accompanied and motivated by the analysis of real and artificial data.
Recommendations
Cites work
- scientific article; zbMATH DE number 997340 (Why is no real title available?)
- scientific article; zbMATH DE number 1134711 (Why is no real title available?)
- A Primer on Copulas for Count Data
- A copula-based algorithm for discovering patterns of dependent observations
- A new family of multivariate heavy-tailed distributions with variable marginal amounts of tailweight: application to robust clustering
- An introduction to copulas.
- Approximations to Multivariate Normal Rectangle Probabilities Based on Conditional Expectations
- Bayesian inference for finite mixtures of univariate and multivariate skew-normal and skew-\(t\) distributions
- Clustering student skill set profiles in a unit hypercube using mixtures of multivariate betas
- Copula analysis of mixture models
- Dimension reduction for model-based clustering via mixtures of shifted asymmetric Laplace distributions
- Finite mixture models
- Finite mixtures of multivariate Poisson distributions with application
- Finite mixtures of multivariate skew \(t\)-distributions: some recent and new results
- Flexible mixture modelling using the multivariate skew-\(t\)-normal distribution
- Likelihood inference for Archimedean copulas in high dimensions under known margins
- Maximum likelihood estimation via the ECM algorithm: A general framework
- Methods for merging Gaussian mixture components
- Mixtures of modified \(t\)-factor analyzers for model-based clustering, classification, and discriminant analysis
- Model-Based Clustering, Classification, and Density Estimation Using mclust in R
- Model-Based Gaussian and Non-Gaussian Clustering
- Model-based clustering of Gaussian copulas for mixed data
- Model-based clustering, classification, and discriminant analysis of data with mixed type
- Pair copula constructions for multivariate discrete data
- The meta-elliptical distributions with given marginals
- Using Multinomial Mixture Models to Cluster Internet Traffic
- Vines -- a new graphical model for dependent random variables.
- maxLik: a package for maximum likelihood estimation in R
Cited in
(31)- Copula-based bivariate finite mixture regression models with an application for insurance claim count data
- Model based clustering for mixed data: clustMD
- Model-based clustering for spatiotemporal data on air quality monitoring
- An overview of skew distributions in model-based clustering
- Copula density estimation by finite mixture of parametric copula densities
- Unsupervised fuzzy model-based Gaussian clustering
- Multivariate models for dependent clusters of variables with conditional independence given aggregation variables
- Dissimilarity functions for rank-invariant hierarchical clustering of continuous variables
- A semiparametric and location-shift copula-based mixture model
- Vine copulas for mixed data: multi-view clustering for mixed data beyond meta-Gaussian dependencies
- Model-based clustering
- A copula-based algorithm for discovering patterns of dependent observations
- Classical and Bayesian inference of a mixture of bivariate exponentiated exponential model
- Mixture copulas with discrete margins and their application to imbalanced data
- Clustering student skill set profiles in a unit hypercube using mixtures of multivariate betas
- Football tracking data: a copula-based hidden Markov model for classification of tactics in football
- Variable selection methods for model-based clustering
- Clustering dependencies via mixtures of copulas
- Clustering dependent observations with copula functions
- A family of parsimonious mixtures of multivariate Poisson-lognormal distributions for clustering multivariate count data
- CD-vine model for capturing complex dependence
- scientific article; zbMATH DE number 7218962 (Why is no real title available?)
- Mixture of hidden Markov models for accelerometer data
- Copula analysis of mixture models
- Model-based clustering of Gaussian copulas for mixed data
- Finite normal mixture copulas for multivariate discrete data modeling
- Distance Metrics and Clustering Methods for Mixed‐type Data
- Mixtures of Gaussian copula factor analyzers for clustering high dimensional data
- scientific article; zbMATH DE number 2034573 (Why is no real title available?)
- Spectral Clustering, Bayesian Spanning Forest, and Forest Process
- A simple proof of Pitman-Yor's Chinese restaurant process from its stick-breaking representation
Describes a project that uses
Uses Software
This page was built for publication: Model-based clustering using copulas with applications
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q340862)