On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences (Q296474)

From MaRDI portal





scientific article; zbMATH DE number 6593656
Language Label Description Also known as
default for all languages
No label defined
    English
    On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences
    scientific article; zbMATH DE number 6593656

      Statements

      On clustering histograms with \(k\)-means by using mixed \(\alpha\)-divergences (English)
      0 references
      0 references
      0 references
      0 references
      0 references
      15 June 2016
      0 references
      Summary: Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-\(X\) used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the \(\alpha\)-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the \(\alpha\)-divergences using the concept of mixed divergences. First, we present a novel extension of \(k\)-means clustering to mixed divergences. Second, we extend the \(k\)-means++ seeding to mixed \(\alpha\)-divergences and report a guaranteed probabilistic bound. Finally, we describe a soft clustering technique for mixed \(\alpha\)-divergences.
      0 references
      bag-of-\(X\)
      0 references
      \(\alpha\)-divergence
      0 references
      Jeffreys divergence
      0 references
      centroid
      0 references
      \(k\)-means clustering
      0 references
      \(k\)-means seeding
      0 references

      Identifiers