On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences (Q296474)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences |
scientific article |
Statements
On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences (English)
0 references
15 June 2016
0 references
Summary: Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-\(X\) used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the \(\alpha\)-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the \(\alpha\)-divergences using the concept of mixed divergences. First, we present a novel extension of \(k\)-means clustering to mixed divergences. Second, we extend the \(k\)-means++ seeding to mixed \(\alpha\)-divergences and report a guaranteed probabilistic bound. Finally, we describe a soft clustering technique for mixed \(\alpha\)-divergences.
0 references
bag-of-\(X\)
0 references
\(\alpha\)-divergence
0 references
Jeffreys divergence
0 references
centroid
0 references
\(k\)-means clustering
0 references
\(k\)-means seeding
0 references