Pagewise preview ]

CategoryValue
Available viahttp://dbpubs.stanford.edu/pub/1999-10
Submitted on 26th of February 2000
Author Fujiwara, S.; Motwani, R.; Ullman, J.
Title Dynamic Miss COunting Algorithms: Finding Implication and Similarity Rules With COnfidence Pruning
Date of publication 1999
Citation S. Fujiwara,R. Motwani,J. Ullman: Dynamic Miss COunting Algorithms: Finding Implication and Similarity Rules With COnfidence Pruning. to appear in ICDE, 1999
Language English
Project Information Integration
Type Conference or Journal Paper
Subject group Data Mining
Abstract Dynamic Miss-Counting algorithms are proposed, which nd all implication and similarity rules with condence pruning but without support pruning. To handle data sets with a large number of columns, we propose dynamic pruning techniques that can be applied during data scanning. DMC counts the numbers of rows in which each pair of columns disagree instead of counting the number of hits. DMC deletes a candidate as soon as the number of misses exceeds the maximum number of misses allowed for that pair . We also propose several optimization techniques that reduce the required memory size signicantly. We evaluated our algorithms by using 4 data sets, i.e., Web access logs, Web page-link graph, News documents, and a Dictionary. These data sets have between 74,000 and 700,000 items. Experiments show that DMC can nd high-condence rules for such a large data sets efciently
Keywords Data mining, association rule
Fulltext source
  • Postscript (ps, ps.gz, ps.zip)
  • PDF (pdf, pdf.gz, pdf.zip)
  • Plain text (text, text.gz, text.zip)
  • Management of the document bysiroker@db.stanford.edu

    Pagewise preview ]


    Stanford InfoLab Publication Server