RMIT University
Browse

Context-based diversification for keyword queries over XML data

journal contribution
posted on 2024-11-01, 16:56 authored by Jianxin Li, Chengfei Liu, Jeffrey Yu
While keyword query empowers ordinary users to search vast amount of data, the ambiguity of keyword query makes it difficult to effectively answer keyword queries, especially for short and vague keyword queries. To address this challenging problem, in this paper we propose an approach that automatically diversifies XML keyword search based on its different contexts in the XML data. Given a short and vague keyword query and XML data to be searched, we firstly derive keyword search candidates of the query by a simple feature selection model. And then, we design an effective XML keyword search diversification model to measure the quality of each candidate. After that, two efficient algorithms are proposed to incrementally compute top-k qualified query candidates as the diversified search intentions. Two selection criteria are targeted: the k selected query candidates are most relevant to the given query while they have to cover maximal number of distinct results. At last, a comprehensive evaluation on real and synthetic datasets demonstrates the effectiveness of our proposed diversification model and the efficiency of our algorithms.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1109/TKDE.2014.2334297
  2. 2.
    ISSN - Is published in 10414347

Journal

IEEE Transactions on Knowledge and Data Engineering

Volume

27

Issue

3

Start page

660

End page

672

Total pages

13

Publisher

IEEE

Place published

United States

Language

English

Copyright

© 2013 IEEE

Former Identifier

2006050110

Esploro creation date

2020-06-22

Fedora creation date

2015-02-12

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC