TY - GEN
T1 - Optimizing Tail Latency by Critical Window-based Dynamic Cache Space Allocation
AU - Du, Haiwen
AU - Zhang, Rui
AU - Lu, Yixuan
AU - Zhu, Dongjie
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Data read and write tail latency in distributed storage systems affects the quality of service of applications. In this paper, we focus on requests with latency around 99.99th tail latency and design a critical window. By analyzing the target storage device distribution of requests in the critical window, we design a simple but effective cache space allocation method to optimize tail latency. Unlike traditional methods, it schedules target cache space allocation instead of requests. Since it does not change the processing of requests and I/Os, it reduces the extra time consumption incurred by the scheduling algorithm. At the same time, it solves the problems of lag, tail latency fluctuation, and high resource consumption of the load balancing-based tail latency guarantee algorithm on request scheduling. Finally, we verify the optimization effect of the method's tail-latency metrics.
AB - Data read and write tail latency in distributed storage systems affects the quality of service of applications. In this paper, we focus on requests with latency around 99.99th tail latency and design a critical window. By analyzing the target storage device distribution of requests in the critical window, we design a simple but effective cache space allocation method to optimize tail latency. Unlike traditional methods, it schedules target cache space allocation instead of requests. Since it does not change the processing of requests and I/Os, it reduces the extra time consumption incurred by the scheduling algorithm. At the same time, it solves the problems of lag, tail latency fluctuation, and high resource consumption of the load balancing-based tail latency guarantee algorithm on request scheduling. Finally, we verify the optimization effect of the method's tail-latency metrics.
KW - Cache space management
KW - Quality of service (QoS)
KW - Tail latency management
UR - https://www.scopus.com/pages/publications/85206369800
U2 - 10.1109/IWQoS61813.2024.10682852
DO - 10.1109/IWQoS61813.2024.10682852
M3 - 会议稿件
AN - SCOPUS:85206369800
T3 - IEEE International Workshop on Quality of Service, IWQoS
BT - 2024 IEEE/ACM 32nd International Symposium on Quality of Service, IWQoS 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 32nd IEEE/ACM International Symposium on Quality of Service, IWQoS 2024
Y2 - 19 June 2024 through 21 June 2024
ER -