RMIT University
Browse

sMARE: a new paradigm to evaluate and understand query performance prediction methods

journal contribution
posted on 2024-11-02, 19:49 authored by Guglielmo Faggioli, Oleg ZendelOleg Zendel, Shane CulpepperShane Culpepper, Nicola Ferro, Falk ScholerFalk Scholer
Query performance prediction (QPP) has been studied extensively in the IR community over the last two decades. A by-product of this research is a methodology to evaluate the effectiveness of QPP techniques. In this paper, we re-examine the existing evaluation methodology commonly used for QPP, and propose a new approach. Our key idea is to model QPP performance as a distribution instead of relying on point estimates. To obtain such distribution, we exploit the scaled Absolute Ranking Error (sARE) measure, and its mean the scaled Mean Absolute Ranking Error (sMARE). Our work demonstrates important statistical implications, and overcomes key limitations imposed by the currently used correlation-based point-estimate evaluation approaches. We also explore the potential benefits of using multiple query formulations and ANalysis Of VAriance (ANOVA) modeling in order to measure interactions between multiple factors. The resulting statistical analysis combined with a novel evaluation framework demonstrates the merits of modeling QPP performance as distributions, and enables detailed statistical ANOVA models for comparative analyses to be created.

Funding

New approaches to interactive sessional search for complex tasks

Australian Research Council

Find out more...

History

Journal

Information Retrieval Journal

Volume

25

Issue

2

Start page

94

End page

122

Total pages

29

Publisher

Springer

Place published

Germany

Language

English

Copyright

© The Author(s) 2022. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License

Former Identifier

2006115219

Esploro creation date

2022-10-30

Usage metrics

    Scholarly Works

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC