RMIT University
Browse

Choices in batch information retrieval evaluation

conference contribution
posted on 2024-10-31, 17:08 authored by Falk ScholerFalk Scholer, Alistair Moffat, Paul Thomas
Web search tools are used on a daily basis by billions of people. The commercial providers of these services spend large amounts of money measuring their own effectiveness and benchmarking against their competitors; nothing less than their corporate survival is at stake. Techniques for offline or "batch" evaluation of search quality have received considerable attention, spanning ways of constructing relevance judgments; ways of using them to generate numeric scores; and ways of inferring system "superiority" from sets of such scores. Our purpose in this paper is consider these mechanisms as a chain of inter-dependent activities, in order to explore some of the ramifications of alternative components. By disaggregating the different activities, and asking what the ultimate objective of the measurement process is, we provide new insights into evaluation approaches, and are able to suggest new combinations that might prove fruitful avenues for exploration. Our observations are examined with reference to data collected from a user study covering 34 users undertaking a total of six search tasks each, using two systems of markedly different quality. We hope to encourage broader awareness of the many factors that go into an evaluation of search effectiveness, and of the implications of these choices, and encourage researchers to carefully report all aspects of the evaluation process when describing their system performance experiments.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1145/2537734.2537745
  2. 2.
    ISBN - Is published in 9781450325240 (urn:isbn:9781450325240)

Start page

74

End page

81

Total pages

8

Outlet

Proceedings of the 18th Australasian Document Computing Symposium

Editors

Shane Culpepper, Guido Zuccon, Laurianne Sitbon

Name of conference

ADCS '13

Publisher

ACM

Place published

New York, United Sates

Start date

2013-12-05

End date

2013-12-06

Language

English

Copyright

© 2013 ACM

Former Identifier

2006043358

Esploro creation date

2020-06-22

Fedora creation date

2014-01-20

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC