Skip to main navigation Skip to search Skip to main content

DCUBE: CUBE on dirty databases

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the real world databases, dirty data such as inconsistent data, duplicate data affect the effectiveness of applications with database. It brings new challenges to efficiently process OLAP on the database with dirty data. CUBE is an important operator for OLAP. This paper proposes the CUBE operation based on overlapping clustering, and an effective and efficient storing and computing method for CUBE on the database with dirty data. Based on CUBE, this paper proposes efficient algorithms for answering aggregation queries, and the processing methods of other major operators for OLAP on the database with dirty data. Experimental results show the efficiency of the algorithms presented in this paper.

Original languageEnglish
Title of host publicationWeb-Age Information Management - 11th International Conference, WAIM 2010, Proceedings
Pages507-512
Number of pages6
DOIs
StatePublished - 2010
Externally publishedYes
Event11th International Conference on Web-Age Information Management, WAIM 2010 - Jiuzhaigou, China
Duration: 15 Jul 201017 Jul 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6184 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Conference on Web-Age Information Management, WAIM 2010
Country/TerritoryChina
CityJiuzhaigou
Period15/07/1017/07/10

Keywords

  • CUBE
  • OLAP
  • dirty data

Fingerprint

Dive into the research topics of 'DCUBE: CUBE on dirty databases'. Together they form a unique fingerprint.

Cite this