Skip to main navigation Skip to search Skip to main content

Cost-Based Lightweight Storage Automatic Decision for In-Database Machine Learning

  • Shuangshuang Cui
  • , Hongzhi Wang*
  • , Haiyao Gu
  • , Yuntian Xie
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Storage structure decision for a database aims to automatically determine the effective storage structure according to the data distribution and workload. With the integration of machine learning and database becoming closer, complex machine learning tasks are directly executed in database, and need the support of efficient storage structure. The existing storage decision methods are mainly oriented to common workloads and rely on the decision of experienced DBAs, which has low efficiency and high risk of error. Thus, an automated storage structure decision method for in-database machine learning is urgently needed. We propose a cost-based lightweight row-column storage automatic decision system. To the best of our knowledge, this is the first storage structure selection for machine learning tasks. Extensive experiments show that the accuracy of the storage structure above 90%, shorten the task execution time by about 85%, and greatly reduce the risk of decision error.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering - WISE 2021 - 22nd International Conference on Web Information Systems Engineering, WISE 2021, Proceedings
EditorsWenjie Zhang, Lei Zou, Zakaria Maamar, Lu Chen
PublisherSpringer Science and Business Media Deutschland GmbH
Pages119-126
Number of pages8
ISBN (Print)9783030908874
DOIs
StatePublished - 2021
Event22nd International Conference on Web Information Systems Engineering, WISE 2021 - Melbourne, Australia
Duration: 26 Oct 202129 Oct 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13080 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference22nd International Conference on Web Information Systems Engineering, WISE 2021
Country/TerritoryAustralia
CityMelbourne
Period26/10/2129/10/21

Keywords

  • AI for DB
  • Data partition
  • Row and column storage

Fingerprint

Dive into the research topics of 'Cost-Based Lightweight Storage Automatic Decision for In-Database Machine Learning'. Together they form a unique fingerprint.

Cite this