Explicit Constructions of High-Rate MDS Array Codes With Optimal Repair Bandwidth

doi:10.1109/TIT.2017.2661313

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Explicit Constructions of Optimal-Access MDS Codes With Nearly Optimal Sub-Packetization

[...]

Min Ye¹, Alexander Barg¹•Institutions (1)

University of Maryland, College Park¹

24 Jul 2017-IEEE Transactions on Information Theory

TL;DR: In this article, an explicit construction of optimal-access MDS codes with sub-packetization is presented, which differs from the optimal value by at most a factor of $r^{2}$.

...read moreread less

Abstract: An $(n,k,l)$ maximum distance separable (MDS) array code of length $n$ , dimension $k=n-r$ , and sub-packetization $l$ is formed of $l\times n$ matrices over a finite field $F$ , with every column of the matrix stored on a separate node in the distributed storage system and viewed as a coordinate of the codeword. Repair of a failed node (recovery of one erased column) can be performed by accessing a set of $d\le n-1$ surviving (helper) nodes. The code is said to have the optimal access property if the amount of data accessed at each of the helper nodes meets a lower bound on this quantity. For optimal-access MDS codes with $d=n-1$ , the sub-packetization $l$ satisfies the bound $l\ge r^{(k-1)/r}$ . In our previous work (IEEE Trans. Inf. Theory, vol. 63, no. 4, 2017), for any $n$ and $r$ , we presented an explicit construction of optimal-access MDS codes with sub-packetization $l=r^{n-1}$ . In this paper, we take up the question of reducing the sub-packetization value $l$ to make it to approach the lower bound. We construct an explicit family of optimal-access codes with $l=r^{\lceil n/r\rceil }$ , which differs from the optimal value by at most a factor of $r^{2}$ . These codes can be constructed over any finite field $F$ as long as $|F|\ge r\lceil n/r\rceil $ , and afford low-complexity encoding and decoding procedures. We also define a version of the repair problem that bridges the context of regenerating codes and codes with locality constraints (LRC codes), which we call group repair with optimal access . In this variation, we assume that the set of $n=sm$ nodes is partitioned into $m$ repair groups of size $s$ , and require that the amount of accessed data for repair is the smallest possible whenever the $d=s+k-1$ helper nodes include all the other $s-1$ nodes from the same group as the failed node. For this problem, we construct a family of codes with the group optimal access property. These codes can be constructed over any field $F$ of size $|F|\ge n$ , and also afford low-complexity encoding and decoding procedures.

...read moreread less

185 citations

Journal Article•DOI•

Minimum Storage Regenerating Codes for All Parameters

[...]

Sreechakra Goparaju¹, Arman Fazeli¹, Alexander Vardy¹•Institutions (1)

University of California, San Diego¹

17 Apr 2017-IEEE Transactions on Information Theory

TL;DR: Regenerating codes for distributed storage have attracted much research interest in the past decade and can be relaxed to requiring the optimal repair bandwidth for systematic nodes only.

...read moreread less

Abstract: Regenerating codes for distributed storage have attracted much research interest in the past decade. Such codes trade the bandwidth needed to repair a failed node with the overall amount of data stored in the network. Minimum storage regenerating (MSR) codes are an important class of optimal regenerating codes that minimize (first) the amount of data stored per node and (then) the repair bandwidth. Specifically, an $[n,k,d]$ - $(\alpha )$ MSR code $ \mathbb {C}$ over $ \smash {\mathbb {F}_{\!q}}$ stores a file $ {\mathcal{ F}}$ consisting of $\alpha k$ symbols over $ \smash {\mathbb {F}_{\!q}}$ among $n$ nodes, each storing $\alpha $ symbols, in such a way that: 1) the file $ {\mathcal{ F}}$ can be recovered by downloading the content of any $k$ of the $n$ nodes and 2) the content of any failed node can be reconstructed by accessing any $d$ of the remaining $n-1$ nodes and downloading $\alpha /(d{-}k{+}1)$ symbols from each of these nodes. In practice, the file $ {\mathcal{ F}}$ is typically available in uncoded form on some $k$ of the $n$ nodes, known as systematic nodes , and the defining node-repair condition above can be relaxed to requiring the optimal repair bandwidth for systematic nodes only . Such codes are called systematic–repair MSR codes . Unfortunately, finite– $\alpha $ constructions of $[n,k,d]$ MSR codes are known only for certain special cases: either low rate, namely $k/n \leqslant 0.5$ , or high repair connectivity, namely $d = n-1$ . Our main result in this paper is a finite– $\alpha $ construction of systematic-repair $[n,k,d]$ MSR codes for all possible values of parameters $n,k,d$ . We also introduce a generalized construction for $[n,k]$ MSR codes to achieve the optimal repair bandwidth for all values of $d$ simultaneously.

...read moreread less

107 citations

Cites background from "Explicit Constructions of High-Rate..."

...We also refer the reader to [22]–[24], where MSR constructions with...
[...]
...Most recently, Ye and Barg [22], [23] show that [n, k, d] MSR codes can be explicitly constructed4 over a small finite field and with a near optimal sub-packetization α. Sasidharan et al. [24] also construct explicit [n, k, d = n − 1] MSR codes with these properties....
[...]
...Most recently, Ye and Barg [22], [23] show that [n, k, d] MSR codes can be explicitly constructed4 over a small finite field and with a near optimal sub-packetization α....
[...]

Journal Article•DOI•

Erasure coding for distributed storage: an overview

[...]

S. B. Balaji¹, M. Nikhil Krishnan¹, Myna Vajha¹, Vinayak Ramkumar¹, Birenjith Sasidharan¹, P. Vijay Kumar², P. Vijay Kumar¹ - Show less +3 more•Institutions (2)

Indian Institute of Science¹, University of Southern California²

06 Sep 2018-Science in China Series F: Information Sciences

TL;DR: This survey provides an overview of the efforts in this direction by introducing two new classes of erasure codes, namely regenerating codes and locally recoverable codes as well as by coming up with novel ways to repair the ubiquitous Reed-Solomon code.

...read moreread less

Abstract: In a distributed storage system, code symbols are dispersed across space in nodes or storage units as opposed to time. In settings such as that of a large data center, an important consideration is the efficient repair of a failed node. Efficient repair calls for erasure codes that in the face of node failure, are efficient in terms of minimizing the amount of repair data transferred over the network, the amount of data accessed at a helper node as well as the number of helper nodes contacted. Coding theory has evolved to handle these challenges by introducing two new classes of erasure codes, namely regenerating codes and locally recoverable codes as well as by coming up with novel ways to repair the ubiquitous Reed-Solomon code. This survey provides an overview of the efforts in this direction that have taken place over the past decade.

...read moreread less

81 citations

Journal Article•DOI•

A Generic Transformation to Enable Optimal Repair in MDS Codes for Distributed Storage Systems

[...]

Jie Li¹, Xiaohu Tang¹, Chao Tian²•Institutions (2)

Southwest Jiaotong University¹, University of Tennessee²

11 Jul 2018-IEEE Transactions on Information Theory

TL;DR: A generic transformation is proposed that can transform any nonbinary MDS code with the optimal repair bandwidth or the optimal rebuilding access for the systematic nodes only, into a new M DS code which possesses the corresponding repair optimality for all nodes.

...read moreread less

Abstract: We propose a generic transformation that can convert any nonbinary $(n=k{+}r,k)$ maximum distance separable (MDS) code into another $(n,k)$ MDS code over the same field such that: 1) some arbitrarily chosen $r$ nodes have the optimal repair bandwidth and the optimal rebuilding access; 2) for the remaining $k$ nodes, the normalized repair bandwidth and the normalized rebuilding access (over the file size) are preserved; and 3) the sub-packetization level is increased only by a factor of $r$ . Two immediate applications of this generic transformation are then presented. The first application is that we can transform any nonbinary MDS code with the optimal repair bandwidth or the optimal rebuilding access for the systematic nodes only, into a new MDS code which possesses the corresponding repair optimality for all nodes. The second application is that by applying the transformation multiple times, any nonbinary $(n,k)$ scalar MDS code can be converted into an $(n,k)$ MDS code with the optimal repair bandwidth and the optimal rebuilding access for all nodes, or only a subset of nodes, whose sub-packetization level is also optimal.

...read moreread less

65 citations

Cites background or methods or result from "Explicit Constructions of High-Rate..."

...As a result, the optimal repair bandwidth and the optimal rebuilding access1 were subsequently established [6], [7]....
[...]
...One key new ingredient in [7] and [22]–[24], in contrast to most previous efforts, is that these constructions are given in terms of parity-check matrix, and as a consequence they do not distinguish between the systematic nodes and the parity nodes at all....
[...]
...Independent and parallel to our work, Ye and Barg [7], [22] proposed several explicit constructions of high-rate MDS codes that can optimally repair all nodes....
[...]
...A comparison between the piggyback codes in [27] and [29] and the resultant MDS codes obtained from the first application in Section IV is provided in Table I, a comparison between the MDS codes proposed by Ye and Barg and the codes obtained from the first application in Section IV is provided in Table II, and a comparison between the MDS codes proposed in [22] and [23] and the codes obtained from the second application in Section IV is provided in Table III....
[...]
...A COMPARISON OF SOME PARAMETERS BETWEEN THE (n, k) MDS CODES IN [7], [22] AND THE EXPLICIT (n, k) MDS CODES OBTAINED...
[...]

Proceedings Article•DOI•

A generic transformation for optimal repair bandwidth and rebuilding access in MDS codes

[...]

Chao Tian¹, Jie Li², Xiaohu Tang²•Institutions (2)

University of Tennessee¹, Southwest Jiaotong University²

01 Jun 2017

TL;DR: It is shown that any non-binary MDS code with optimal repair bandwidth, or optimal rebuilding access, for only systematic nodes can be converted into an M DS code with the corresponding repair optimality for all nodes.

...read moreread less

Abstract: We propose a generic transformation on maximum distance separable (MDS) codes, which can convert any non-binary (k+r, k) MDS code into another (k+r, k) MDS code with the following properties: 1) An arbitrarily chosen r nodes will have the optimal repair bandwidth and the optimal rebuilding access, 2) the repair bandwidth and rebuilding access efficiencies of all other nodes are maintained as in the code before the transformation, 3) it uses the same finite field as the code before the transformation, and 4) the sub-packetization is increased only by a factor of r. As two immediate applications of this powerful transformation, we show that 1) any non-binary MDS code with optimal repair bandwidth, or optimal rebuilding access, for only systematic nodes can be converted into an MDS code with the corresponding repair optimality for all nodes; and 2) any non-binary scalar MDS code can be converted to an MDS code with optimal repair bandwidth and rebuilding access for all nodes, or to an MDS code with optimal rebuilding access for all systematic nodes and moreover with the optimal sub-packatization, by applying the transformation multiple times.

...read moreread less

64 citations

Collapse

Explicit Constructions of High-Rate MDS Array Codes With Optimal Repair Bandwidth

Citations

Cites background from "Explicit Constructions of High-Rate..."

Cites background or methods or result from "Explicit Constructions of High-Rate..."

References

"Explicit Constructions of High-Rate..." refers methods in this paper

"Explicit Constructions of High-Rate..." refers background or methods in this paper

Related Papers (5)