RMIT University
Browse

Differences in effectiveness across sub-collections

conference contribution
posted on 2024-10-31, 16:06 authored by Mark SandersonMark Sanderson, Andrew Turpin, Ying Zhang, Falk ScholerFalk Scholer

The relative performance of retrieval systems when evaluated on one part of a test collection may bear little or no similarity to the relative performance measured on a different part of the collection. In this paper we report the results of a detailed study of the impact that different sub-collections have on retrieval effectiveness, analyzing the effect over many collections, and with different approaches to sub-dividing the collections. The effect is shown to be substantial, impacting on comparisons between retrieval runs that are statistically significant. Some possible causes for the effect are investigated, and the implications of this work are examined for test collection design and for the strength of conclusions one can draw from experimental results.

History

Start page

1965

End page

1969

Total pages

5

Outlet

Proceedings of the CIKM ACM Conference on Information and Knowledge Management 2012

Editors

Xue-wen Chen

Name of conference

CIKM ACM Conference on Information and Knowledge Management 2012

Publisher

ACM

Place published

Maui, United States

Start date

2012-10-29

End date

2012-11-02

Language

English

Copyright

© 2013 ACM, Inc.

Former Identifier

2006034618

Esploro creation date

2020-06-22

Fedora creation date

2013-01-13

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC