RMIT University
Browse

Query structuring and expansion with two-stage term dependence for Japanese web retrieval

journal contribution
posted on 2024-11-01, 09:12 authored by Koji Eguchi, Bruce Croft
In this paper, we propose a new term dependence model for information retrieval, which is based on a theoretical framework using Markov random fields. We assume two types of dependencies of terms given in a query: (i) long-range dependencies that may appear for instance within a passage or a sentence in a target document, and (ii) short-range dependencies that may appear for instance within a compound word in a target document. Based on this assumption, our two-stage term dependence model captures both long-range and short-range term dependencies differently, when more than one compound word appear in a query. We also investigate how query structuring with term dependence can improve the performance of query expansion using a relevance model. The relevance model is constructed using the retrieval results of the structured query with term dependence to expand the query. We show that our term dependence model works well, particularly when using query structuring with compound words, through experiments using a 100-gigabyte test collection of web documents mostly written in Japanese. We also show that the performance of the relevance model can be significantly improved by using the structured query with our term dependence model.

History

Journal

Information Retrieval

Volume

12

Issue

3

Start page

251

End page

274

Total pages

24

Publisher

Springer Netherlands

Place published

Netherlands

Language

English

Copyright

© 2009 Springer Science+Business Media, LLC

Former Identifier

2006024119

Esploro creation date

2020-06-22

Fedora creation date

2012-02-03

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC