RMIT University
Browse

ELCA evaluation for keyword search on probabilistic XML data

journal contribution
posted on 2024-11-01, 17:52 authored by Rui Zhou, Chengfei Liu, Jianxin Li, Jeffrey Yu
As probabilistic data management is becoming one of the main research focuses and keyword search is turning into a more popular query means, it is natural to think how to support keyword queries on probabilistic XML data. With regards to keyword query on deterministic XML documents, ELCA (Exclusive Lowest Common Ancestor) semantics allows more relevant fragments rooted at the ELCAs to appear as results and is more popular compared with other keyword query result semantics (such as SLCAs). In this paper, we investigate how to evaluate ELCA results for keyword queries on probabilistic XML documents. After defining probabilistic ELCA semantics in terms of possible world semantics, we propose an approach to compute ELCA probabilities without generating possible worlds. Then we develop an efficient stack-based algorithm that can find all probabilistic ELCA results and their ELCA probabilities for a given keyword query on a probabilistic XML document. Finally, we experimentally evaluate the proposed ELCA algorithm and compare it with its SLCA counterpart in aspects of result probability, time and space efficiency, and scalability.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1007/s11280-012-0166-4
  2. 2.
    ISSN - Is published in 1386145X

Journal

World Wide Web

Volume

16

Issue

2

Start page

171

End page

193

Total pages

23

Publisher

Springer

Place published

United States

Language

English

Copyright

© Springer Science+Business Media, LLC 2012

Former Identifier

2006050441

Esploro creation date

2020-06-22

Fedora creation date

2015-02-12

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC