Clustering and feature selection using sparse principal component analysis

DOI10.1007/S11081-008-9057-ZMaRDI QIDQ374668zbMATH OpenOpenAlexFDO

Authors Ronny Luss, Alexandre d'Aspremont

Publication date 24 October 2013

Published in Optimization and Engineering (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/0707.0701

clustering feature selection semidefinite programming sparse principal component analysis

Factor analysis and principal components; correspondence analysis (62H25) Numerical optimization and variational techniques (65K10) Genetics and epigenetics (92D10) Semidefinite programming (90C22)

Abstract: In this paper, we study the application of sparse principal component analysis (PCA) to clustering and feature selection problems. Sparse PCA seeks sparse factors, or linear combinations of the data variables, explaining a maximum amount of variance in the data while having only a limited number of nonzero coefficients. PCA is often used as a simple clustering technique and sparse factors allow us here to interpret the clusters in terms of a reduced set of variables. We begin with a brief introduction and motivation on sparse PCA and detail our implementation of the algorithm in d'Aspremont et al. (2005). We then apply these results to some classic clustering and feature selection problems arising in biology.

Recommendations

Cites work

Cited in

(22)

Describes a project that uses

Uses Software

This page was built for publication: Clustering and feature selection using sparse principal component analysis

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q374668)