Skip to main navigation Skip to search Skip to main content

Apic: A Precomputation-Based Integer Compressor for OLTP Databases

  • Yufan Chen*
  • , Xiangyu Zou*
  • , Kaiwen Deng
  • , Hao Hu*
  • , Cai Deng*
  • , Ke Feng*
  • , Wen Xia*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Current compressors for OLTP databases perform well on text but face challenges with integers, although integers are a critical component of the workload. Most existing integer compressors are ineffective as a complementary solution, since they compress integers together and cannot decompress a certain integer individually, making them incompatible with the data access requirement of OLTP databases. To this end, we propose Apic, a precomputation-based arithmetic coding to efficiently compress each integers (a very tiny unit), though small data are always hard to compress, and ensure compatibility with OLTP datasets. Specifically, Apic presents Bitwidth-aware Precomputed Frequency and Prefixaware Precomputed Decoding to tackle challenges of applying arithmetic coding in this scenario, such as the substantial space costs of symbol frequencies and decompression complexity. Evaluations on real-world and desensitized commercial datasets suggest that Apic improves the compression ratio by up to 80% on integers over VByte, while preserving comparable decompression speed and thus query performance.

Original languageEnglish
Title of host publicationProceedings - DCC 2025
Subtitle of host publication2025 Data Compression Conference
EditorsAli Bilgin, James E. Fowler, Joan Serra-Sagrista, Yan Ye, James A. Storer
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages303-312
Number of pages10
ISBN (Electronic)9798331534714
DOIs
StatePublished - 2025
Externally publishedYes
Event2025 Data Compression Conference, DCC 2025 - Snowbird, United States
Duration: 18 Mar 202521 Mar 2025

Publication series

NameData Compression Conference Proceedings
ISSN (Print)1068-0314

Conference

Conference2025 Data Compression Conference, DCC 2025
Country/TerritoryUnited States
CitySnowbird
Period18/03/2521/03/25

Fingerprint

Dive into the research topics of 'Apic: A Precomputation-Based Integer Compressor for OLTP Databases'. Together they form a unique fingerprint.

Cite this