On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences (Q296474): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Importer (talk | contribs)
Changed an Item
Property / author
 
Property / author: Shun-ichi Amari / rank
 
Normal rank
Property / review text
 
Summary: Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-\(X\) used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the \(\alpha\)-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the \(\alpha\)-divergences using the concept of mixed divergences. First, we present a novel extension of \(k\)-means clustering to mixed divergences. Second, we extend the \(k\)-means++ seeding to mixed \(\alpha\)-divergences and report a guaranteed probabilistic bound. Finally, we describe a soft clustering technique for mixed \(\alpha\)-divergences.
Property / review text: Summary: Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-\(X\) used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the \(\alpha\)-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the \(\alpha\)-divergences using the concept of mixed divergences. First, we present a novel extension of \(k\)-means clustering to mixed divergences. Second, we extend the \(k\)-means++ seeding to mixed \(\alpha\)-divergences and report a guaranteed probabilistic bound. Finally, we describe a soft clustering technique for mixed \(\alpha\)-divergences. / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 62H30 / rank
 
Normal rank
Property / Mathematics Subject Classification ID
 
Property / Mathematics Subject Classification ID: 62B10 / rank
 
Normal rank
Property / zbMATH DE Number
 
Property / zbMATH DE Number: 6593656 / rank
 
Normal rank
Property / zbMATH Keywords
 
bag-of-\(X\)
Property / zbMATH Keywords: bag-of-\(X\) / rank
 
Normal rank
Property / zbMATH Keywords
 
\(\alpha\)-divergence
Property / zbMATH Keywords: \(\alpha\)-divergence / rank
 
Normal rank
Property / zbMATH Keywords
 
Jeffreys divergence
Property / zbMATH Keywords: Jeffreys divergence / rank
 
Normal rank
Property / zbMATH Keywords
 
centroid
Property / zbMATH Keywords: centroid / rank
 
Normal rank
Property / zbMATH Keywords
 
\(k\)-means clustering
Property / zbMATH Keywords: \(k\)-means clustering / rank
 
Normal rank
Property / zbMATH Keywords
 
\(k\)-means seeding
Property / zbMATH Keywords: \(k\)-means seeding / rank
 
Normal rank

Revision as of 20:57, 27 June 2023

scientific article
Language Label Description Also known as
English
On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences
scientific article

    Statements

    On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    15 June 2016
    0 references
    Summary: Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-\(X\) used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the \(\alpha\)-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the \(\alpha\)-divergences using the concept of mixed divergences. First, we present a novel extension of \(k\)-means clustering to mixed divergences. Second, we extend the \(k\)-means++ seeding to mixed \(\alpha\)-divergences and report a guaranteed probabilistic bound. Finally, we describe a soft clustering technique for mixed \(\alpha\)-divergences.
    0 references
    bag-of-\(X\)
    0 references
    \(\alpha\)-divergence
    0 references
    Jeffreys divergence
    0 references
    centroid
    0 references
    \(k\)-means clustering
    0 references
    \(k\)-means seeding
    0 references

    Identifiers