Skip to main navigation Skip to search Skip to main content

Cleanits: A data cleaning system for industrial time series

  • Harbin Institute of Technology

Research output: Contribution to journalConference articlepeer-review

Abstract

The great amount of time series generated by machines has enormous value in intelligent industry. Knowledge can be discovered from high-quality time series, and used for production optimization and anomaly detection in industry. However, the original sensors data always contain many errors. This requires a sophisticated cleaning strategy and a well-designed system for industrial data cleaning. Motivated by this, we introduce Cleanits, a system for industrial time series cleaning. It implements an integrated cleaning strategy for detecting and repairing three kinds of errors in industrial time series. We develop reliable data cleaning algorithms, considering features of both industrial time series and domain knowledge. We demonstrate Cleanits with two real datasets from power plants. The system detects and repairs multiple dirty data precisely, and improves the quality of industrial time series effectively. Cleanits has a friendly interface for users, and result visualization along with logs are available during each cleaning process.

Original languageEnglish
Pages (from-to)1786-1789
Number of pages4
JournalProceedings of the VLDB Endowment
Volume12
Issue number12
DOIs
StatePublished - 2018
Event45th International Conference on Very Large Data Bases, VLDB 2019 - Los Angeles, United States
Duration: 26 Aug 201730 Aug 2017

Fingerprint

Dive into the research topics of 'Cleanits: A data cleaning system for industrial time series'. Together they form a unique fingerprint.

Cite this