Quantile-based clustering

DOI10.1214/19-EJS1640MaRDI QIDQ107146zbMATH OpenFDO

Authors Christian Hennig, Cinzia Viroli, Laura Anderlucci, Christian Hennig, Cinzia Viroli

Publication date 1 January 2019

Published in Electronic Journal of Statistics (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1806.10403, https://projecteuclid.org/euclid.ejs/1575428462

fixed partition model high dimensional clustering nonparametric mixture quantile discrepancy

Nonparametric regression and quantile regression (62G08) Classification and discrimination; cluster analysis (statistical aspects) (62H30)

Abstract: A new cluster analysis method,

K

-quantiles clustering, is introduced.

K

-quantiles clustering can be computed by a simple greedy algorithm in the style of the classical Lloyd's algorithm for

K

-means. It can be applied to large and high-dimensional datasets. It allows for within-cluster skewness and internal variable scaling based on within-cluster variation. Different versions allow for different levels of parsimony and computational efficiency. Although

K

-quantiles clustering is conceived as nonparametric, it can be connected to a fixed partition model of generalized asymmetric Laplace-distributions. The consistency of

K

-quantiles clustering is proved, and it is shown that

K

-quantiles clusters correspond to well separated mixture components in a nonparametric mixture. In a simulation,

K

-quantiles clustering is compared with a number of popular clustering methods with good results. A high-dimensional microarray dataset is clustered by

K

-quantiles.

Recommendations

Cites work

Cited in

(6)

Describes a project that uses

Uses Software

This page was built for publication: Quantile-based clustering

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q107146)