On the Locality of Codeword Symbols

Parikshit Gopalan; Cheng Huang; Huseyin Simitci; Sergey Yekhanin

Electronic Colloquium on Computational Complexity

Under the auspices of the Computational Complexity Foundation (CCF)

REPORTS > DETAIL:

Paper:

TR11-100 | 20th July 2011 19:25

On the Locality of Codeword Symbols

TR11-100 Authors: Parikshit Gopalan, Cheng Huang, Huseyin Simitci, Sergey Yekhanin
Publication: 26th July 2011 20:32
Downloads: 3908

Keywords:

Abstract:

Consider a linear $[n,k,d]_q$ code $\mc{C}.$ We say that that $i$-th coordinate of $\mc{C}$ has locality $r,$ if the value at this coordinate can be recovered from accessing some other $r$ coordinates of $\mc{C}.$ Data storage applications require codes with small
redundancy, low locality for information coordinates, large distance, and low locality for parity coordinates. In this paper we carry out an in-depth study of the relations between these parameters.

We establish a tight bound for the redundancy $n-k$ in terms of the message length, the distance, and the locality of information
coordinates. We refer to codes attaining the bound as optimal. We prove some structure theorems about optimal codes, which are
particularly strong for small distances. This gives a fairly complete picture of the tradeoffs between codewords length, worst-case distance and locality of information symbols.

We then consider the locality of parity check symbols and erasure correction beyond worst case distance for optimal codes. Using our structure theorem, we obtain a tight bound for the locality of parity symbols possible in such codes for a broad class of parameter settings. We prove that there is a tradeoff between having good locality for parity checks and the ability to correct erasures beyond
the minimum distance.

ISSN 1433-8092 | Imprint