RMIT University
Browse

Relatively relevant: Assessor shift in document judgements

conference contribution
posted on 2024-10-31, 10:23 authored by Mark SandersonMark Sanderson, Falk ScholerFalk Scholer, Andrew Turpin
The evaluation of information retrieval systems relies on relevance judgements - human assessments of whether a document is relevant to a specified search request. In the past, it was demonstrated that test collection assessors disagree with each other to some extent on the relevance of documents and can be inconsistent in themselves. This paper describes a series of investigations on assessor consistency, which demonstrate that the inconsistency of an assessor varies over time. We show that when documents are presented to assessors in a relevance independent order, documents judged as relevant appear to cluster. Examining pairs of documents in a sequence ordered by timeof-judgement, we find that relevance assessors judge highly similar document pairs more consistently when the pairs are seen soon after each other; the consistency reduces when the pairs are judged further apart. We contend that our analysis shows that changes are not due to random error, but instead reflect a relevance shift, whereby the assessor's conception of what constitutes a relevant document changes over time. Studying types of relevance judgement we find that the shift in judgements is greatest between highly and partially relevant documents. We also examine the impact of this inconsistency on how retrieval runs are ranked relative to each other and find that there appears to be a noticeable effect on such rankings.

History

Related Materials

  1. 1.
    ISBN - Is published in 9781921426803 (urn:isbn:9781921426803)

Start page

60

End page

67

Total pages

8

Outlet

Proceedings of the Australasian Document Computing Symposium

Editors

Falk Scholer, Andrew Trotman, Andrew Turpin

Name of conference

17th Australasian Document Computing Symposium

Publisher

RMIT Press

Place published

Australia

Start date

2010-12-10

End date

2010-12-10

Language

English

Former Identifier

2006023502

Esploro creation date

2020-06-22

Fedora creation date

2011-11-08

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC