Features of Disagreement Between Retrieval Effectiveness Measures

Timothy Jones,Paul Thomas,Falk Scholer,Mark Sanderson

Features of Disagreement Between Retrieval Effectiveness Measures

2015

Timothy Jones
Paul Thomas
Falk Scholer
Mark Sanderson

Many IR effectiveness measures are motivated from intuition, theory, or user studies. In general, most effectiveness measures are well correlated with each other. But, what about where they don't correlate? Which rankings cause measures to disagree? Are these rankings predictable for particular pairs of measures? In this work, we examine how and where metrics disagree, and identify differences that should be considered when selecting metrics for use in evaluating retrieval systems.

Keywords:

Data mining
Information retrieval
IR evaluation
Computer science
Intuition
user studies

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations