Skip to main navigation Skip to search Skip to main content

Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems

  • Harbin Institute of Technology Shenzhen
  • Peng Cheng Laboratory
  • James Cook University Queensland
  • La Trobe University

Research output: Contribution to journalArticlepeer-review

Abstract

In clustered distributed storage systems (CDSSs), one of the main design goals is minimizing the transmission cost during the failed storage nodes repairing. Generalized regenerating codes (GRCs) are proposed to balance the intra-cluster repair bandwidth and the inter-cluster repair bandwidth for guaranteeing data availability. The trade-off performance of GRCs illustrates that, it can reduce storage overhead and inter-cluster repair bandwidths simultaneously. However, in practical big data storage scenarios, GRCs cannot give an effective solution to handle the heterogeneity of bandwidth costs among different clusters for node failures recovery. This paper proposes an asymmetric bandwidth allocation strategy (ABAS) of GRCs for the inter-cluster repair in heterogeneous CDSSs. Furthermore, an upper bound of the achievable capacity of ABAS is derived based on the information flow graph (IFG), and the constraints of storage capacity and intra-cluster repair bandwidth are also elaborated. Then, a metric termed global repair bandwidth cost (GRBC), which can be minimized regarding of the inter-cluster repair bandwidths by solving a linear programming problem, is defined. The numerical results demonstrate that, maintaining the same data availability and storage overhead, the proposed ABAS of GRCs can effectively reduce the GRBC compared to the traditional symmetric bandwidth allocation schemes.

Original languageEnglish
Pages (from-to)2469-2481
Number of pages13
JournalIET Communications
Volume15
Issue number19
DOIs
StatePublished - Dec 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems'. Together they form a unique fingerprint.

Cite this