Abstract

Existing approaches for evaluating data quality were established for settings where user requirements regarding data use can be explicitly gathered. However, users are often faced with new, unfamiliar, and repurposed datasets, where they have not been involved in the data collection and data creation processes. Furthermore, there is evidence that there is typically a lack of supporting information, such as metadata, for such datasets. Yet, users need to evaluate the quality of such data and determine if the data can be used for intended purposes. In this paper, we aim to gain an empirical understanding of the role of metadata in evaluating the quality of repurposed data. Using an interview approach, we collected rich qualitative data that reveals current practices, key challenges, preferences, and approaches for improvement regarding evaluating the quality of repurposed data.

Share

COinS