A partition based method for finding highly correlated pairs (Q615683)

From MaRDI portal





scientific article; zbMATH DE number 5832979
Language Label Description Also known as
default for all languages
No label defined
    English
    A partition based method for finding highly correlated pairs
    scientific article; zbMATH DE number 5832979

      Statements

      A partition based method for finding highly correlated pairs (English)
      0 references
      0 references
      0 references
      6 January 2011
      0 references
      Summary: The problem of finding highly correlated pairs is to output all item pairs whose (Pearson) correlation coefficients are greater than a user-specified correlation threshold. Effective discovery of such item pairs is of primary importance in many real data mining applications. Algorithm and Taper algorithm are special cases of our new algorithm with respect to the number of segments. Experimental results on real datasets demonstrate the feasibility and superiority of our algorithm. Recently, the Taper algorithm is developed to discover the set of highly correlated item pairs. In this paper, we present a generalised Taper algorithm to find strongly correlated pairs between items by partitioning the collection of transactions into different segments, so as to achieve better pruning effect and less running time. Consequently, it can be proved that both are naive.
      0 references
      correlation
      0 references
      association rules
      0 references
      Pearson correlation coefficients
      0 references
      transactional databases
      0 references
      data mining
      0 references
      partition
      0 references
      highly correlated pairs
      0 references

      Identifiers