RMIT University
Browse

A Markov random field model for term dependencies

conference contribution
posted on 2024-10-31, 15:32 authored by Donald Metzler, Bruce Croft
This paper develops a general, formal framework for modeling term dependencies via Markov random fields. The model allows for arbitrary text features to be incorporated as evidence. In particular, we make use of features based on occurrences of single terms, ordered phrases, and unordered phrases. We explore full independence, sequential dependence, and full dependence variants of the model. A novel approach is developed to train the model that directly maximizes the mean average precision rather than maximizing the likelihood of the training data. Ad hoc retrieval experiments are presented on several newswire and web collections, including the GOV2 collection used at the TREC 2004 Terabyte Track. The results show significant improvements are possible by modeling dependencies, especially on the larger web collections.

History

Start page

472

End page

479

Total pages

8

Outlet

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '05)

Name of conference

28th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '05)

Publisher

ACM

Place published

New York, USA

Start date

2005-08-15

End date

2005-08-19

Language

English

Copyright

Copyright 2005 ACM

Former Identifier

2006024168

Esploro creation date

2020-06-22

Fedora creation date

2013-02-19

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC