RMIT University
Browse

Discovering key concepts in verbose queries

conference contribution
posted on 2024-10-31, 15:33 authored by Michael Bendersky, Bruce Croft
Current search engines do not, in general, perform well with longer, more verbose queries. One of the main issues in processing these queries is identifying the key concepts that will have the most impact on effectiveness. In this paper, we develop and evaluate a technique that uses query-dependent, corpus-dependent, and corpus-independent features for automatic extraction of key concepts from verbose queries. We show that our method achieves higher accuracy in the identification of key concepts than standard weighting methods such as inverse document frequency. Finally, we propose a probabilistic model for integrating the weighted key concepts identified by our method into a query, and demonstrate that this integration significantly improves retrieval effectiveness for a large set of natural language description queries derived from TREC topics on several newswire and web collections.

History

Start page

491

End page

498

Total pages

8

Outlet

Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008)

Editors

Sung-Hyon Myaeng, Douglas W. Oard, Fabrizio Sebastiani, Tat-Seng Chua and Mun-Kew leong

Name of conference

31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008)

Publisher

ACM

Place published

New York, USA

Start date

2008-07-20

End date

2008-07-24

Language

English

Copyright

Copyright 2008 ACM.

Former Identifier

2006024294

Esploro creation date

2020-06-22

Fedora creation date

2013-03-12

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC