Abstract

The internet is currently evolving from the "Web of Documents" into the "Web of Data" where data is available on web-scale in the so called Semantic Web (1) to retrieve information or (2) for data reuse, e.g. within applications for a higher degree of automation. At present, there is already a lot of data available on the Semantic Web, but unfortunately we do not know much about their quality due to missing techniques and methodologies for information quality assessment. In this paper, we provide a framework for information quality assessment of Semantic Web data called SWIQA by solely using Semantic Web technologies. Other than survey-based techniques for information quality assessment SWIQA employs data quality rule templates to express quality requirements which are automatically used to identify deficient data and calculate quality scores. Hence, using our approach minimizes manual effort while providing transparency about the quality of Semantic Web data. SWIQA may, therefore, be used by data consumers to find high quality data sources or by data owners to keep track of the quality of their own data.

Share

COinS