Skip to main navigation Skip to search Skip to main content

AutoMC: Automated Model Compression Based on Domain Knowledge and Progressive Search

  • Chunnan Wang
  • , Hongzhi Wang*
  • , Xiangyu Shi
  • *Corresponding author for this work
  • Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Model compression methods can reduce model complexity on the premise of maintaining acceptable performance, and thus promote the application of deep neural networks under resource constrained environments. Despite their great success, the selection of suitable compression methods and design of details of the compression scheme are difficult, requiring lots of domain knowledge as support, which is not friendly to non-expert users. To make more users easily access to the model compression scheme that best meet their needs, in this paper, we propose AutoMC, an effective and efficient automatic tool for model compression. In order to improve the search efficiency and quality, in AutoMC, we build the domain knowledge on model compression to deeply understand the characteristics and advantages of each compression method under different settings. This method can provide AutoMC with the more reasonable guidance and thus reduce useless evaluation. In addition, we present a progressive search strategy to efficiently explore pareto optimal compression scheme according to the learned prior knowledge combined with the historical evaluation information. This strategy can help AutoMC selectively and gradually explore more valuable search space, and thus reduce the search difficulty and improve the search efficiency. Extensive experimental results show that AutoMC can provide users with better compression schemes within short time compared to the existing compression methods and AutoML algorithms, which demonstrates the effectiveness and significance of our proposed algorithm.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 40th International Conference on Data Engineering, ICDE 2024
PublisherIEEE Computer Society
Pages1819-1832
Number of pages14
ISBN (Electronic)9798350317152
DOIs
StatePublished - 2024
Event40th IEEE International Conference on Data Engineering, ICDE 2024 - Utrecht, Netherlands
Duration: 13 May 202417 May 2024

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627
ISSN (Electronic)2375-0286

Conference

Conference40th IEEE International Conference on Data Engineering, ICDE 2024
Country/TerritoryNetherlands
CityUtrecht
Period13/05/2417/05/24

Keywords

  • Automated machine learning
  • domain knowledge
  • model compression
  • progressive search

Fingerprint

Dive into the research topics of 'AutoMC: Automated Model Compression Based on Domain Knowledge and Progressive Search'. Together they form a unique fingerprint.

Cite this