Multipass algorithms for mining association rules in text databases (Q1606558)

From MaRDI portal





scientific article; zbMATH DE number 1771463
Language Label Description Also known as
default for all languages
No label defined
    English
    Multipass algorithms for mining association rules in text databases
    scientific article; zbMATH DE number 1771463

      Statements

      Multipass algorithms for mining association rules in text databases (English)
      0 references
      0 references
      0 references
      25 July 2002
      0 references
      In this paper, we propose two new algorithms for mining association rules between words in text databases. The characteristics of text databases are quite different from those of retail transaction databases, and existing mining algorithms cannot handle text databases efficiently because of the large number of itemsets (i.e., words) that need to be counted. Two well-known mining algorithms, Apriori algorithm and Direct Hashing and Pruning (DHP) algorithm, are evaluated in the context of mining text databases, and are compared with the new proposed algorithms named Multipass-Apriori (M-Apriori) and Multipass-DHP (M-DHP). It has been shown that the proposed algorithms have better performance for large text databases.
      0 references
      Association rules
      0 references
      Data mining
      0 references
      Performance analysis
      0 references
      Text database
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references