Track 3: Knowledge Science, Knowledge Management and Knowledge Representation in IS

Relevance Judgment Convergence Degree – A Measure of Inconsistency among Assessors for Information Retrieval

Dengya Zhu, Curtin UniversityFollow
Shastri L. Nimmagadda, Curtin UniversityFollow
Kok Wai Wong, Murdoch UniversityFollow
Torsten Reiners, Curtin UniversityFollow

Abstract

Relevance judgment of human assessors is inherently subjective and dynamic when evaluation datasets are created for Information Retrieval (IR) systems. However, a small group of experts’ relevance judgment results are usually taken as ground truth to “objectively” evaluate the performance of the IR systems. Recent trends intend to employ a group of judges, such as outsourcing, to alleviate the potentially biased judgment results stemmed from using only a single expert’s judgment. Nevertheless, different judges may have different opinions and may not agree with each other, and the inconsistency in human relevance judgment may affect the IR system evaluation results. In this research, we introduce a Relevance Judgment Convergence Degree (RJCD) to measure the quality of queries in the evaluation datasets. Experimental results reveal a strong correlation coefficient between the proposed RJCD score and the performance differences between the two IR systems.

Recommended Citation

Zhu, D., Nimmagadda, Sh. L., Wong, K. W., & Reiners, T. (2022). Relevance Judgment Convergence Degree – A Measure of Inconsistency among Assessors for Information Retrieval. In R. A. Buchmann, G. C. Silaghi, D. Bufnea, V. Niculescu, G. Czibula, C. Barry, M. Lang, H. Linger, & C. Schneider (Eds.), Information Systems Development: Artificial Intelligence for Information Systems Development and Operations (ISD2022 Proceedings). Cluj-Napoca, Romania: Risoprint. ISBN: 978-973-53-2917-4. https://doi.org/10.62036/ISD.2022.38

Paper Type

Full Paper

DOI

10.62036/ISD.2022.38

References_DOI_ISD.2022.38.pdf (95 kB)

Download

COinS

Relevance Judgment Convergence Degree – A Measure of Inconsistency among Assessors for Information Retrieval

Track 3: Knowledge Science, Knowledge Management and Knowledge Representation in IS

Relevance Judgment Convergence Degree – A Measure of Inconsistency among Assessors for Information Retrieval

Abstract

Recommended Citation

Paper Type

DOI

Search

Browse

Author Corner

Links

Track 3: Knowledge Science, Knowledge Management and Knowledge Representation in IS

Relevance Judgment Convergence Degree – A Measure of Inconsistency among Assessors for Information Retrieval

Presenter Information

Abstract

Recommended Citation

Paper Type

DOI

Share

Search

Browse

Author Corner

Links