Skip to main navigation Skip to search Skip to main content

Clustered chain path index for XML document: Efficiently processing branch queries

  • Hongqiang Wang*
  • , Jianzhong Li
  • , Hongzhi Wang*
  • *Corresponding author for this work
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Branch query processing is a core operation of XML query processing. In recent years, a number of stack based twig join algorithms have been proposed to process twig queries based on tag stream index. However, in tag stream index, each element is labeled separately without considering the similarity among elements. Besides, algorithms based on tag stream index perform inefficiently on large document. This paper proposes a novel index, named Clustered Chain Path Index, based on a novel labeling scheme. This index provides efficient support for processing branch queries. It also has the same cardinality as 1-index against tree structured XML document. Based on CCPI, efficient algorithms, KMP-Match-Path and Related-Path-Segment-Join, are proposed to process queries efficiently. Analysis and experimental results show that proposed query processing algorithms based on CCPI outperform other algorithms and have good scalability.

Original languageEnglish
Pages (from-to)153-168
Number of pages16
JournalWorld Wide Web
Volume11
Issue number1
DOIs
StatePublished - Mar 2008
Externally publishedYes

Keywords

  • 1-index
  • CCPI
  • Clustered chain path
  • Index
  • TwigStack
  • XML

Fingerprint

Dive into the research topics of 'Clustered chain path index for XML document: Efficiently processing branch queries'. Together they form a unique fingerprint.

Cite this