Skip to main navigation Skip to search Skip to main content

bSAX: A Novel Sketch for Efficient Data Series Similarity Search

  • Han Hu
  • , Jiye Qiu
  • , Hongzhi Wang*
  • , Bin Liang
  • , Songling Zou
  • *Corresponding author for this work
  • Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In contemporary applications of data analysis, data series similarity search holds great significance. A substantial body of research design data series indices for exact similarity search. The iSAX family, employing iSAX sketch to represent a SAX collection, stands as a pivotal research direction. However, we find a critical flaw in the iSAX sketch that its representation of SAX collections leads to a significant deterioration in the lower bound distance, thereby impacting the pruning efficiency and search performance of the index. To address the limitation, we propose a novel sketch, called bSAX. bSAX, by leveraging boundary information from SAX summarizations, offers a tighter lower bound distance than iSAX. Moreover, we design a novel index for data series, with a cost model involving compressed information loss. Conducting comprehensive experimental comparisons, we validate the superior performance of bSAX in the similarity search.

Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications - 29th International Conference, DASFAA 2024, Proceedings
EditorsMakoto Onizuka, Chuan Xiao, Jae-Gil Lee, Yongxin Tong, Yoshiharu Ishikawa, Kejing Lu, Sihem Amer-Yahia, H.V. Jagadish
PublisherSpringer Science and Business Media Deutschland GmbH
Pages442-458
Number of pages17
ISBN (Print)9789819755684
DOIs
StatePublished - 2024
Event29th International Conference on Database Systems for Advanced Applications, DASFAA 2024 - Gifu, Japan
Duration: 2 Jul 20245 Jul 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14854 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th International Conference on Database Systems for Advanced Applications, DASFAA 2024
Country/TerritoryJapan
CityGifu
Period2/07/245/07/24

Keywords

  • Data series.
  • Similarity search
  • Summarization

Fingerprint

Dive into the research topics of 'bSAX: A Novel Sketch for Efficient Data Series Similarity Search'. Together they form a unique fingerprint.

Cite this