Abstract
With the proliferation of cloud services and the continuous growth in enterprises’ demand for dynamic multidimensional resources, the implementation of effective strategy for time-varying workload scheduling has become increasingly significant. In this paper, we propose a deep reinforcement learning (DRL)-based method for time-varying workload scheduling, aiming to allocate resources efficiently across servers in the cluster. Specifically, we integrate a classifier and queue scorer to construct a priority queue that exploits temporal resource utilization patterns across different workload classes. Then, we design parallel graph attention layers to capture the dimensional features and temporal dynamics of cloud server cluster. Moreover, we propose a DRL algorithm to generate scheduling strategies that can adapt to dynamic environments. Validation on real-world traces from Google cluster demonstrates that our method outperforms existing approaches in key metrics of cloud server cluster management.
| Original language | English |
|---|---|
| Pages (from-to) | 2838-2852 |
| Number of pages | 15 |
| Journal | IEEE Transactions on Network and Service Management |
| Volume | 22 |
| Issue number | 3 |
| DOIs | |
| State | Published - 2025 |
| Externally published | Yes |
Keywords
- Cloud computing
- resource allocation
- server cluster
- workload scheduling
Fingerprint
Dive into the research topics of 'DRL-Based Time-Varying Workload Scheduling With Priority and Resource Awareness'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver