RMIT University
Browse

The good and the bad system: Does the test collection predict users' effectiveness?

conference contribution
posted on 2024-10-31, 10:18 authored by Azzah Al-Maskari, Mark SandersonMark Sanderson, Paul Clough, Eija Airio
Test collections are extensively used in the evaluation of information retrieval systems. Crucial to their use is the degree to which results from them predict user effectiveness. At first, past studies did not substantiate a relationship between system and user effectiveness; more recently, however, correlations have begun to emerge. The results of this paper strengthen and extend those findings. We introduce a novel methodology for investigating the relationship, which shows great success in establishing a significant correlation between system and user effectiveness. It is shown that users behave differently and discern differences between pairs of systems that have a very small absolute difference in test collection effectiveness. Our results strengthen the use of test collections in IR evaluation, confirming that users' effectiveness can be predicted successfully.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1145/1390334.1390347
  2. 2.
    ISBN - Is published in 9781605581644 (urn:isbn:9781605581644)

Start page

59

End page

66

Total pages

8

Outlet

Proceedings of the 31st Annual International ACM SIGIR Conference

Editors

Sung-Hyon Myaeng, Douglas W. Oard, Fabrizio Sebastiani, Tat-Seng Chua and Mun-Kew Leong

Name of conference

31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Publisher

ACM

Place published

Singapore

Start date

2008-07-20

End date

2008-07-24

Language

English

Copyright

© 2008 ACM

Former Identifier

2006021714

Esploro creation date

2020-06-22

Fedora creation date

2013-03-12

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC