Skip to main navigation Skip to search Skip to main content

CrowdCleaner: A data cleaning system based on crowdsourcing

  • Chen Ye
  • , Hongzhi Wang
  • , Keli Li
  • , Qian Chen
  • , Jianhua Chen
  • , Jiangduo Song
  • , Weidong Yuan
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As data in real life is often dirty, data cleaning is a natural way to improve the data quality. However, due to the lack of human knowledge, existing automatic data cleaning systems cannot find the proper values for dirty data. Thus we propose an online data cleaning system CrowdCleaner based on Crowdsourcing. CrowdCleaner provides a friendly interface for users dealing with different data quality problems. In this demonstration, we show the architecture of CrowdCleaner and highlight a few of its key features. We will show the process of the CrowdCleaner to clean data.

Original languageEnglish
Title of host publicationWeb Technologies and Applications - 16th Asia-Pacific Web Conference, APWeb 2014, Proceedings
PublisherSpringer Verlag
Pages657-661
Number of pages5
ISBN (Print)9783319111155
DOIs
StatePublished - 2014
Externally publishedYes
Event16th Asia-Pacific Web Conference on Web Technologies and Applications, APWeb 2014 - Changsha, China
Duration: 5 Sep 20147 Sep 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8709 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th Asia-Pacific Web Conference on Web Technologies and Applications, APWeb 2014
Country/TerritoryChina
CityChangsha
Period5/09/147/09/14

Keywords

  • Data cleaning
  • crowdsourcing
  • truth discovery

Fingerprint

Dive into the research topics of 'CrowdCleaner: A data cleaning system based on crowdsourcing'. Together they form a unique fingerprint.

Cite this