On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences (Q296474): Difference between revisions
From MaRDI portal
Created a new Item |
Changed an Item |
||
Property / author | |||
Property / author: Shun-ichi Amari / rank | |||
Normal rank | |||
Property / review text | |||
Summary: Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-\(X\) used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the \(\alpha\)-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the \(\alpha\)-divergences using the concept of mixed divergences. First, we present a novel extension of \(k\)-means clustering to mixed divergences. Second, we extend the \(k\)-means++ seeding to mixed \(\alpha\)-divergences and report a guaranteed probabilistic bound. Finally, we describe a soft clustering technique for mixed \(\alpha\)-divergences. | |||
Property / review text: Summary: Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-\(X\) used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the \(\alpha\)-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the \(\alpha\)-divergences using the concept of mixed divergences. First, we present a novel extension of \(k\)-means clustering to mixed divergences. Second, we extend the \(k\)-means++ seeding to mixed \(\alpha\)-divergences and report a guaranteed probabilistic bound. Finally, we describe a soft clustering technique for mixed \(\alpha\)-divergences. / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 62H30 / rank | |||
Normal rank | |||
Property / Mathematics Subject Classification ID | |||
Property / Mathematics Subject Classification ID: 62B10 / rank | |||
Normal rank | |||
Property / zbMATH DE Number | |||
Property / zbMATH DE Number: 6593656 / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
bag-of-\(X\) | |||
Property / zbMATH Keywords: bag-of-\(X\) / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
\(\alpha\)-divergence | |||
Property / zbMATH Keywords: \(\alpha\)-divergence / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
Jeffreys divergence | |||
Property / zbMATH Keywords: Jeffreys divergence / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
centroid | |||
Property / zbMATH Keywords: centroid / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
\(k\)-means clustering | |||
Property / zbMATH Keywords: \(k\)-means clustering / rank | |||
Normal rank | |||
Property / zbMATH Keywords | |||
\(k\)-means seeding | |||
Property / zbMATH Keywords: \(k\)-means seeding / rank | |||
Normal rank |
Revision as of 20:57, 27 June 2023
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences |
scientific article |
Statements
On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences (English)
0 references
15 June 2016
0 references
Summary: Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-\(X\) used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the \(\alpha\)-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the \(\alpha\)-divergences using the concept of mixed divergences. First, we present a novel extension of \(k\)-means clustering to mixed divergences. Second, we extend the \(k\)-means++ seeding to mixed \(\alpha\)-divergences and report a guaranteed probabilistic bound. Finally, we describe a soft clustering technique for mixed \(\alpha\)-divergences.
0 references
bag-of-\(X\)
0 references
\(\alpha\)-divergence
0 references
Jeffreys divergence
0 references
centroid
0 references
\(k\)-means clustering
0 references
\(k\)-means seeding
0 references