Skip to main navigation Skip to search Skip to main content

An efficient algorithm for mining large item sets

  • School of Computer Science and Technology (School of Software), Harbin Institute of Technology Weihai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

It propose Online Mining Algorithm ( OMA) which online discover large item sets. Without presetting a default threshold, the OMA algorithm achieves its efficiency and threshold-flexibility by calculating item-sets' counts. It is unnecessary and independent of the default threshold and can flexibly adapt to any user's input threshold. In addition, we propose Cluster-Based Association Rule Algorithm (CARA) creates cluster tables to aid discovery of large item sets. It only requires a single scan of the database, followed by contrasts with the partial cluster tables. It not only prunes considerable amounts of data reducing the time needed to perform data scans and requiring less contrast, but also ensures the correctness of the mined results. By using the CARA algorithm to create cluster tables in advance, each CPU can be utilized to process a cluster table; thus large item sets can be immediately mined even when the database is very large.

Original languageEnglish
Title of host publicationProceedings - 5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008
Pages561-564
Number of pages4
DOIs
StatePublished - 2008
Externally publishedYes
Event5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008 - Jinan, Shandong, China
Duration: 18 Oct 200820 Oct 2008

Publication series

NameProceedings - 5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008
Volume2

Conference

Conference5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008
Country/TerritoryChina
CityJinan, Shandong
Period18/10/0820/10/08

Keywords

  • Association rules
  • Data mining
  • Large item sets

Fingerprint

Dive into the research topics of 'An efficient algorithm for mining large item sets'. Together they form a unique fingerprint.

Cite this