RMIT University
Browse

Investigating retrieval performance with manually-built topic models

conference contribution
posted on 2024-10-31, 15:42 authored by Xing Wei, Bruce Croft
Modeling text with topics is currently a popular research area in both Machine Learning and Information Retrieval (IR). Most of this research has focused on automatic methods though there are many hand-crafted topic resources available online. In this paper we investigate retrieval performance with topic models constructed manually based on a hand-crafted directory resource. The original query is smoothed on the manually selected topic model, which can also be viewed as an ¿ideal¿ user context model. Experiments with these topic models on the TREC retrieval tasks show that this type of topic model alone provides little benefit, and the overall performance is not as good as relevance modeling (which is an automatic query modification model). However, smoothing the query with topic models outperforms relevance models for a subset of the queries and automatic selection from these two models for particular queries gives better results overall than relevance models. We further demonstrate some improvements over relevance models with automatically built topic models based on the directory resource.

History

Start page

333

End page

349

Total pages

17

Outlet

Proceedings of the RIAO '07 Large Scale Semantic Access to Content (Text, Image, Video, and Sound)

Editors

David Evans, Sadaoki Furui, Chantal Soulé-Dupuy

Name of conference

RIAO '07 Large Scale Semantic Access to Content (Text, Image, Video, and Sound)

Publisher

Le Centre De Hautes Etudes Internationales D'Informatique Documentaire

Place published

Paris, France

Start date

2007-05-30

End date

2007-06-01

Language

English

Copyright

© Le Centre De Hautes Etudes Internationales D'Informatique Documentaire

Former Identifier

2006024271

Esploro creation date

2020-06-22

Fedora creation date

2012-11-01

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC