Abstract
Branch query processing is a core operation of XML query processing. In recent years, a number of stack based twig join algorithms have been proposed to process twig queries based on tag stream index. However, in tag stream index, each element is labeled separately without considering the similarity among elements. Besides, algorithms based on tag stream index perform inefficiently on large document. This paper proposes a novel index, named Clustered Chain Path Index, based on a novel labeling scheme. This index provides efficient support for processing branch queries. It also has the same cardinality as 1-index against tree structured XML document. Based on CCPI, efficient algorithms, KMP-Match-Path and Related-Path-Segment-Join, are proposed to process queries efficiently. Analysis and experimental results show that proposed query processing algorithms based on CCPI outperform other algorithms and have good scalability.
| Original language | English |
|---|---|
| Pages (from-to) | 153-168 |
| Number of pages | 16 |
| Journal | World Wide Web |
| Volume | 11 |
| Issue number | 1 |
| DOIs | |
| State | Published - Mar 2008 |
| Externally published | Yes |
Keywords
- 1-index
- CCPI
- Clustered chain path
- Index
- TwigStack
- XML
Fingerprint
Dive into the research topics of 'Clustered chain path index for XML document: Efficiently processing branch queries'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver