ICIS 2004 Proceedings

Reconciling Attribute Values from Multiple Data Sources

Zhengrui Jiang, University of Texas at Dallas
Sumit Sarkar, University of Texas at Dallas
Prabuddha De, Purdue University
Debabrata Dey, University of Washington

Abstract

Because of the heterogeneous nature of multiple data sources, data integration is often one of the most challenging tasks of today’s information systems. While the existing literature has focused on problems such as schema integration and entity identification, our current study attempts to answer a basic question: When an attribute value for a real-world entity is recorded differently in two databases, how should the “best” value be chosen from the set of possible values? We first show how probabilities for attribute values can be derived, and then propose a framework for deciding the cost-minimizing value based on the total cost of type I, type II, and misrepresentation errors.

Recommended Citation

Jiang, Zhengrui; Sarkar, Sumit; De, Prabuddha; and Dey, Debabrata, "Reconciling Attribute Values from Multiple Data Sources" (2004). ICIS 2004 Proceedings. 59.
https://aisel.aisnet.org/icis2004/59

Download

COinS

ICIS 2004 Proceedings

Reconciling Attribute Values from Multiple Data Sources

Abstract

Recommended Citation

Search

Links

Browse

Author Corner

ICIS 2004 Proceedings

Reconciling Attribute Values from Multiple Data Sources

Authors

Abstract

Recommended Citation

Share

Search

Links

Browse

Author Corner