RMIT University
Browse

Efficient evaluation of generalized path pattern queries on XML data

conference contribution
posted on 2024-10-31, 16:37 authored by Xiaoying Wu, S Souldatos, Dimitri Theodoratos, T Dalamagas, Timoleon Sellis
Finding the occurrences of structural patterns in XML data is a key operation in XML query processing. Existing algorithms for this operation focus almost exclusively on path-patterns or tree-patterns. Requirements inflexible querying of XML data have motivated recently the introduction of query languages that allow a partial specification of path-patterns in a query. In this paper, we focus on the efficient evaluation of partial path queries, a generalization of path pattern queries. Our approach explicitly deals with repeated labels (that is, multiple occurrences of the same label in a query). We show that partial path queries can be represented as rooted dags for which a topological ordering of the nodes exists. We present three algorithms for the efficient evaluation of these queries under the indexed streaming evaluation model. The first one exploits a structural summary of data to generate a set of path-patterns that together are equivalent to a partial path query. To evaluate these path-patterns, we extend PathStack so that it can work on path-patterns with repeated labels. The second one extracts a spanning tree from the query dag, uses a stack-based algorithm to find the matches of the root-to-leaf paths in the tree, and merge-joins the matches to compute the answer. Finally, the third one exploits multiple pointers of stack entries and a topological ordering of the query dag to apply a stack-based holistic technique. An analysis of the algorithms and extensive experimental evaluation shows that the holistic algorithm outperforms the other ones.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1145/1367497.1367610
  2. 2.
    ISBN - Is published in 9781605580852 (urn:isbn:9781605580852)

Start page

835

End page

844

Total pages

10

Outlet

Proceedings of the 17th international conference on World Wide Web (WWW 2008)

Editors

Jinpeng Huai and Robin Chen

Name of conference

17th international conference on World Wide Web (WWW 2008)

Publisher

ACM

Place published

New York, USA

Start date

2008-04-21

End date

2008-04-25

Language

English

Copyright

© ACM

Former Identifier

2006036021

Esploro creation date

2020-06-22

Fedora creation date

2013-03-12

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC