Abstract
Loosely coupled microservice architectures have been widely adopted in cloud-native applications due to their inherent advantages in modularity, development agility, and scalability. However, the resulting complex and dynamic service topologies introduce intricate inter-service dependencies, which often lead to backpressure effects and queuing delays. These phenomena significantly challenge traditional monolithic and rule-based resource management approaches, which struggle to capture the non-linear performance characteristics and long-term effects of resource allocation decisions in such environments. To address these challenges, we propose DDRM, a two-stage predictor-decider collaborative framework for dynamic resource management in microservice systems. DDRM integrates deep learning to model inter-service interactions and predict the probability of Service Level Objective (SLO) violations, and employs reinforcement learning to optimize resource allocation decisions by maximizing long-term cumulative rewards while meeting SLO targets. Extensive evaluations demonstrate that DDRM outperforms state-of-the-art baselines by up to 29.8%, while exhibiting strong stability and adaptability under highly varying workloads.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2025 IEEE International Conference on Cluster Computing, CLUSTER 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798331530198 |
| DOIs | |
| State | Published - 2025 |
| Externally published | Yes |
| Event | 2025 IEEE International Conference on Cluster Computing, CLUSTER 2025 - Edinburgh, United Kingdom Duration: 3 Sep 2025 → 5 Sep 2025 |
Publication series
| Name | Proceedings - IEEE International Conference on Cluster Computing, ICCC |
|---|---|
| ISSN (Print) | 1552-5244 |
Conference
| Conference | 2025 IEEE International Conference on Cluster Computing, CLUSTER 2025 |
|---|---|
| Country/Territory | United Kingdom |
| City | Edinburgh |
| Period | 3/09/25 → 5/09/25 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 8 Decent Work and Economic Growth
-
SDG 12 Responsible Consumption and Production
Keywords
- cloud computing
- deep learning for systems
- microservices
- resource efficiency
- resource man-agement
Fingerprint
Dive into the research topics of 'DDRM: An SLO-aware Deep Dynamic Resource Management Framework for Microservices'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver