Skip to main navigation Skip to search Skip to main content

DDRM: An SLO-aware Deep Dynamic Resource Management Framework for Microservices

  • Liangping Tang
  • , Jin Wang*
  • , Wanyou Wang
  • , Gaotao Shi
  • , Zhijun Li*
  • *Corresponding author for this work
  • Soochow University
  • Faculty of Computing, Harbin Institute of Technology
  • Tianjin University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Loosely coupled microservice architectures have been widely adopted in cloud-native applications due to their inherent advantages in modularity, development agility, and scalability. However, the resulting complex and dynamic service topologies introduce intricate inter-service dependencies, which often lead to backpressure effects and queuing delays. These phenomena significantly challenge traditional monolithic and rule-based resource management approaches, which struggle to capture the non-linear performance characteristics and long-term effects of resource allocation decisions in such environments. To address these challenges, we propose DDRM, a two-stage predictor-decider collaborative framework for dynamic resource management in microservice systems. DDRM integrates deep learning to model inter-service interactions and predict the probability of Service Level Objective (SLO) violations, and employs reinforcement learning to optimize resource allocation decisions by maximizing long-term cumulative rewards while meeting SLO targets. Extensive evaluations demonstrate that DDRM outperforms state-of-the-art baselines by up to 29.8%, while exhibiting strong stability and adaptability under highly varying workloads.

Original languageEnglish
Title of host publicationProceedings of the 2025 IEEE International Conference on Cluster Computing, CLUSTER 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331530198
DOIs
StatePublished - 2025
Externally publishedYes
Event2025 IEEE International Conference on Cluster Computing, CLUSTER 2025 - Edinburgh, United Kingdom
Duration: 3 Sep 20255 Sep 2025

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
ISSN (Print)1552-5244

Conference

Conference2025 IEEE International Conference on Cluster Computing, CLUSTER 2025
Country/TerritoryUnited Kingdom
CityEdinburgh
Period3/09/255/09/25

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 8 - Decent Work and Economic Growth
    SDG 8 Decent Work and Economic Growth
  2. SDG 12 - Responsible Consumption and Production
    SDG 12 Responsible Consumption and Production

Keywords

  • cloud computing
  • deep learning for systems
  • microservices
  • resource efficiency
  • resource man-agement

Fingerprint

Dive into the research topics of 'DDRM: An SLO-aware Deep Dynamic Resource Management Framework for Microservices'. Together they form a unique fingerprint.

Cite this