RMIT University
Browse

XML keyword search with promising result type recommendations

journal contribution
posted on 2024-11-01, 16:49 authored by Jianxin Li, Chengfei Liu, Rui Zhou, Wei Wang
Keyword search enables inexperienced users to easily search XML database with no specific knowledge of complex structured query languages and XML data schemas. Existing work has addressed the problem of selecting data nodes that match keywords and connecting them in a meaningful way, e.g., SLCA and ELCA. However, it is time-consuming and unnecessary to serve all the connected subtrees to the users because in general the users are only interested in part of the relevant results. In this paper, we propose a new keyword search approach which basically utilizes the statistics of underlying XML data to decide the promising result types and then quickly retrieves the corresponding results with the help of selected promising result types. To guarantee the quality of the selected promising result types, we measure the correlations between result types and a keyword query by analyzing the distribution of relevant keywords and their structures within the XML data to be searched. In addition, relevant result types can be efficiently computed without keyword query evaluation and any schema information. To directly return top-k keyword search results that conform to the suggested promising result types, we design two new algorithms to adapt to the structural sensitivity of the keyword nodes over the keyword search results. Lastly, we implement all proposed approaches and present the relevant experimental results to show the effectiveness of our approach.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1007/s11280-012-0198-9
  2. 2.
    ISSN - Is published in 1386145X

Journal

World Wide Web

Volume

17

Start page

127

End page

159

Total pages

33

Publisher

Springer New York LLC

Place published

United States

Language

English

Copyright

© 2013 Springer Science+Business Media New York.

Former Identifier

2006050310

Esploro creation date

2020-06-22

Fedora creation date

2015-02-04

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC