RMIT University
Browse

Popularity Bias in False-positive Metrics for Recommender Systems Evaluation

journal contribution
posted on 2024-11-02, 20:15 authored by Elisa Mena Maldonado, Rocio Canamares, Pablo Castells, Yongli RenYongli Ren, Mark SandersonMark Sanderson
We investigate the impact of popularity bias in false-positive metrics in the offline evaluation of recommender systems. Unlike their true-positive complements, false-positive metrics reward systems that minimize recommendations disliked by users. Our analysis is, to the best of our knowledge, the first to show that false-positive metrics tend to penalise popular items, the opposite behavior of true-positive metrics - causing a disagreement trend between both types of metrics in the presence of popularity biases. We present a theoretical analysis of the metrics that identifies the reason that the metrics disagree and determines rare situations where the metrics might agree - the key to the situation lies in the relationship between popularity and relevance distributions, in terms of their agreement and steepness - two fundamental concepts we formalize. We then examine three well-known datasets using multiple popular true- and false-positive metrics on 16 recommendation algorithms. Specific datasets are chosen to allow us to estimate both biased and unbiased metric values. The results of the empirical study confirm and illustrate our analytical findings. With the conditions of the disagreement of the two types of metrics established, we then determine under which circumstances true-positive or false-positive metrics should be used by researchers of offline evaluation in recommender systems.

History

Journal

ACM Transactions on Information Systems

Volume

39

Number

36

Issue

3

Start page

1

End page

43

Total pages

43

Publisher

Association for Computing Machinery

Place published

United States

Language

English

Copyright

© 2021 Copyright held by the owner/author(s). Publication rights licensed to ACM.

Former Identifier

2006117149

Esploro creation date

2022-08-26

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC