RMIT University
Browse

Test collection based evaluation of information retrieval systems

journal contribution
posted on 2024-11-01, 08:34 authored by Mark SandersonMark Sanderson
Use of test collections and evaluation measures to assess the effectiveness of information retrieval systems has its origins in work dating back to the early 1950s. Across the nearly 60 years since that work started, use of test collections is a de facto standard of evaluation. This monograph surveys the research conducted and explains the methods and measures devised for evaluation of retrieval systems, including a detailed look at the use of statistical significance testing in retrieval experimentation. This monograph reviews more recent examinations of the validity of the test collection approach and evaluation measures as well as outlining trends in current research exploiting query logs and live labs. At its core, the modern-day test collection is little different from the structures that the pioneering researchers in the 1950s and 1960s conceived of. This tutorial and review shows that despite its age, this long-standing evaluation method is still a highly valued tool for retrieval research.

History

Journal

Foundations and Trends in Information Retrieval

Volume

4

Issue

4

Start page

247

End page

375

Total pages

129

Publisher

Now Publishers Inc.

Place published

United States

Language

English

Copyright

© 2010 M. Sanderson

Former Identifier

2006021664

Esploro creation date

2020-06-22

Fedora creation date

2011-01-21

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC