Piecewise regression mixture for simultaneous functional data clustering and optimal segmentation
From MaRDI portal
Publication:2628065
Abstract: This paper introduces a novel mixture model-based approach for simultaneous clustering and optimal segmentation of functional data which are curves presenting regime changes. The proposed model consists in a finite mixture of piecewise polynomial regression models. Each piecewise polynomial regression model is associated with a cluster, and within each cluster, each piecewise polynomial component is associated with a regime (i.e., a segment). We derive two approaches for learning the model parameters. The former is an estimation approach and consists in maximizing the observed-data likelihood via a dedicated expectation-maximization (EM) algorithm. A fuzzy partition of the curves in K clusters is then obtained at convergence by maximizing the posterior cluster probabilities. The latter however is a classification approach and optimizes a specific classification likelihood criterion through a dedicated classification expectation-maximization (CEM) algorithm. The optimal curve segmentation is performed by using dynamic programming. In the classification approach, both the curve clustering and the optimal segmentation are performed simultaneously as the CEM learning proceeds. We show that the classification approach is the probabilistic version that generalizes the deterministic K-means-like algorithm proposed in H'ebrail et al. (2010). The proposed approach is evaluated using simulated curves and real-world curves. Comparisons with alternatives including regression mixture models and the K-means like algorithm for piecewise regression demonstrate the effectiveness of the proposed approach.
Recommendations
- Functional data clustering via piecewise constant nonparametric density estimation
- Model-based clustering and segmentation of time series with changes in regime
- A new Dirichlet process for mining dynamic patterns in functional data
- Functional data clustering via hypothesis testing \(k\)-means
- Simultaneous curve registration and clustering for functional data
Cites work
- scientific article; zbMATH DE number 3810727 (Why is no real title available?)
- scientific article; zbMATH DE number 3567782 (Why is no real title available?)
- scientific article; zbMATH DE number 194758 (Why is no real title available?)
- scientific article; zbMATH DE number 1928654 (Why is no real title available?)
- scientific article; zbMATH DE number 2118472 (Why is no real title available?)
- A Case Study of two Clustering Methods based on Maximum Likelihood
- A Segmentation/Clustering Model for the Analysis of Array CGH Data
- A classification EM algorithm for clustering and two stochastic versions
- Adaptive mixture discriminant analysis for supervised learning with unobserved classes
- Approximation of Curves by Line Segments
- Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models
- Clustering for Sparsely Sampled Functional Data
- Estimating the dimension of a model
- Finite mixture models
- Finite mixture models and model-based clustering
- Finite mixtures of multivariate skew \(t\)-distributions: some recent and new results
- Functional data analysis.
- Initializing \(K\)-means batch clustering: A critical evaluation of several techniques
- Local statistical modeling via a cluster-weighted approach with elliptical distributions
- Mixtures of skew-\(t\) factor analyzers
- Mixtures of spatial spline regressions for clustering and classification
- Model based clustering of high-dimensional binary data
- Model-Based Clustering, Discriminant Analysis, and Density Estimation
- Model-Based Gaussian and Non-Gaussian Clustering
- Model-based biclustering of clickstream data
- Model-based clustering and classification with non-normal mixture distributions
- Model-based clustering and segmentation of time series with changes in regime
- Model-based clustering for multivariate functional data
- Model-based clustering of high-dimensional data: a review
- On the approximation of curves by line segments using dynamic programming
- On-Line Inference for Multiple Changepoint Problems
- Simultaneous curve registration and clustering for functional data
- Special issue on ``New trends on model-based clustering and classification. Preface by the guest editors
- Statistical analysis of finite mixture distributions
- The EM Algorithm and Extensions, 2E
- The efficiency of a linear discriminant function based on unclassified initial samples
- The generalized linear mixed cluster-weighted model
- Time series clustering with ARMA mixtures
- Time series modeling by a regression approach based on a latent process
- Variable selection for clustering and classification
Cited in
(3)
This page was built for publication: Piecewise regression mixture for simultaneous functional data clustering and optimal segmentation
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2628065)