Skip to main navigation Skip to search Skip to main content

Multi-Source Data Repairing: A Comprehensive Survey

  • Hangzhou Dianzi University
  • Nanjing University of Aeronautics and Astronautics
  • Ltd.
  • Tencent

Research output: Contribution to journalReview articlepeer-review

Abstract

In the era of Big Data, integrating information from multiple sources has proven valuable in various fields. To ensure a high-quality supply of multi-source data, repairing different types of errors in the multi-source data becomes critical. This paper categorizes errors in multi-source data into entity information overlapping, attribute value conflicts, and attribute value inconsistencies. We first summarize existing repairing methods for these errors and then examine and review the study of the detection and repair of compound-type errors in multi-source data. Finally, we indicate further research directions in multi-source data repair.

Original languageEnglish
Article number2314
JournalMathematics
Volume11
Issue number10
DOIs
StatePublished - May 2023

Keywords

  • data dependencies
  • data quality
  • data repairing
  • entity resolution
  • multiple sources
  • truth discovery

Fingerprint

Dive into the research topics of 'Multi-Source Data Repairing: A Comprehensive Survey'. Together they form a unique fingerprint.

Cite this