RMIT University
Browse

Modeling term associations for ad-hoc retrieval performance within language modeling framework

conference contribution
posted on 2024-10-31, 15:35 authored by Xing Wei, Bruce Croft
Previous research has shown that using term associations could improve the effectiveness of information retrieval (IR) systems. However, most of the existing approaches focus on query reformulation. Document reformulation has just begun to be studied recently. In this paper, we study how to utilize term association measures to do document modeling, and what types of measures are effective in document language models. We propose a probabilistic term association measure, compare it to some traditional methods, such as the similarity co-efficient and window-based methods, in the language modeling (LM) framework, and show that significant improvements over query likelihood (QL) retrieval can be obtained. We also compare the method with state-of-the-art document modeling techniques based on latent mixture models.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1007/978-3-540-71496-5_8
  2. 2.
    ISBN - Is published in 9783540714941 (urn:isbn:9783540714941)

Start page

52

End page

63

Total pages

12

Outlet

Proceedings of the 29th European Conference in Information Retrieval Research (ECIR2007)

Editors

Giambattista Amati, Claudio Carpineto and Giovanni Romano

Name of conference

29th European Conference in Information Retrieval Research (ECIR2007)

Publisher

Springer

Place published

Berlin, Germany

Start date

2007-04-02

End date

2007-04-05

Language

English

Copyright

© Springer-Verlag Berlin Heidelberg 2007

Former Identifier

2006024267

Esploro creation date

2020-06-22

Fedora creation date

2013-02-19

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC