Skip to main navigation Skip to search Skip to main content

Efficient Skyline Frequent-Utility Itemset Mining Algorithm on Massive Data (Extended abstract)

  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Itemset mining is a crucial technology for extracting interesting patterns that meet predefined thresholds from transaction databases, such as frequent itemset mining (FIM) and high-utility itemset mining (HUIM). Despite their significance, few studies have explored the simultaneous consideration of both support and utility. This gap arises from the theoretical complexity of integrating these dimensions and the practical difficulty of setting appropriate thresholds for both. To overcome this limitation, we introduce Skyline frequent-utility itemset mining (SFUIM), a method that examines frequent and high-utility itemsets without requiring predefined thresholds. Nevertheless, SFUIM faces significant challenges due to its expansive search space and intensive computational requirements. In this paper, we propose a PSI algorithm and its enhanced version PSI*, which confines calculations to specific partitions by prefix-based partitioning. Experiments demonstrate that PSI* outperforms state-of-the-art methods, especially on large-scale datasets.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025
PublisherIEEE Computer Society
Pages4750-4751
Number of pages2
ISBN (Electronic)9798331536039
DOIs
StatePublished - 2025
Externally publishedYes
Event41st IEEE International Conference on Data Engineering, ICDE 2025 - Hong Kong, China
Duration: 19 May 202523 May 2025

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627
ISSN (Electronic)2375-0286

Conference

Conference41st IEEE International Conference on Data Engineering, ICDE 2025
Country/TerritoryChina
CityHong Kong
Period19/05/2523/05/25

Keywords

  • Frequent-utility itemset
  • large-scale data
  • skyline

Fingerprint

Dive into the research topics of 'Efficient Skyline Frequent-Utility Itemset Mining Algorithm on Massive Data (Extended abstract)'. Together they form a unique fingerprint.

Cite this