Skip to main navigation Skip to search Skip to main content

An efficient algorithm for mining large item sets

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

It propose Online Mining Algorithm (OMA) which online discover large item sets. Without pre-setting a default threshold, the OMA algorithm achieves its efficiency and threshold-flexibility by calculating item-sets' counts. It is unnecessary and independent of the default threshold and can flexibly adapt to any user's input threshold. In addition, we propose Cluster-Based Association Rule Algorithm (CARA) creates cluster tables to aid discovery of large item sets. It only requires a single scan of the database, followed by contrasts with the partial cluster tables. It not only prunes considerable amounts of data reducing the time needed to perform data scans and requiring less contrast, but also ensures the correctness of the mined results. By using the CARA algorithm to create cluster tables in advance, each CPU can be utilized to process a cluster table; thus large item sets can be immediately mined even when the database is very large.

Original languageEnglish
Title of host publicationCITSA 2006 - 3rd Int. Conf. on Cybernetics and Information Technol., Systems and Applications, Jointly with the 4th Int. Conf. on Computing, Communications and Control Technologies, CCCT 2006 - Proc.
PublisherInternational Institute of Informatics and Systemics, IIIS
Pages151-154
Number of pages4
ISBN (Print)9806560841, 9789806560840
StatePublished - 2006
Event3rd International Conference on Cybernetics and Information Technologies, Systems and Applications, CITSA 2006, Jointly with the 4th International Conference on Computing, Communications and Control Technologies, CCCT 2006 - Orlando, FL, United States
Duration: 20 Jul 200623 Jul 2006

Publication series

NameCITSA 2006 - 3rd Int. Conf. on Cybernetics and Information Technol., Systems and Applications, Jointly with the 4th Int. Conf. on Computing, Communications and Control Technologies, CCCT 2006 - Proc.
Volume2

Conference

Conference3rd International Conference on Cybernetics and Information Technologies, Systems and Applications, CITSA 2006, Jointly with the 4th International Conference on Computing, Communications and Control Technologies, CCCT 2006
Country/TerritoryUnited States
CityOrlando, FL
Period20/07/0623/07/06

Keywords

  • Association rules
  • Data mining

Fingerprint

Dive into the research topics of 'An efficient algorithm for mining large item sets'. Together they form a unique fingerprint.

Cite this