Towards server-side repair for erasure coding-based distributed storage systems

Bo Chen, Anil Kumar Ammula, Reza Curtmola

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations

Abstract

Erasure coding is one of the main mechanisms to add redundancy in a distributed storage system, by which a file with k data segments is encoded into a file with n coded segments such that any k coded segments can be used to recover the original k data segments. Each coded segment is stored at a storage server. Under an adversarial setting in which the storage servers can exhibit Byzantine behavior, remote data checking (RDC) can be used to ensure that the stored data remains retrievable over time. The main previous RDC scheme to offer such strong security guarantees, HAIL, has an inefficient repair procedure, which puts a high load on the data owner when repairing even one corrupt data segment. In this work, we propose RDC-EC, a novel RDC scheme for erasure code-based distributed storage systems that can function under an adversarial setting. With RDC-EC we offer a solution to an open problem posed in previous work and build the first such system that has an efficient repair phase. The main insight is that RDC-EC is able to reduce the load on the data owner during the repair phase (i.e., lower bandwidth and computation) by shifting most of the burden from the data owner to the storage servers during repair. RDC-EC is able to maintain the advantages of systematic erasure coding: optimal storage for a certain reliability level and sub-file access. We build a prototype for RDC-EC and show experimentally that RDC-EC can handle efficiently large amounts of data.

Original languageEnglish (US)
Title of host publicationCODASPY 2015 - Proceedings of the 5th ACM Conference on Data and Application Security and Privacy
PublisherAssociation for Computing Machinery
Pages281-288
Number of pages8
ISBN (Electronic)9781450331913
DOIs
StatePublished - Mar 2 2015
Externally publishedYes
Event5th ACM Conference on Data and Application Security and Privacy, CODASPY 2015 - San Antonio, United States
Duration: Mar 2 2015Mar 4 2015

Publication series

NameCODASPY 2015 - Proceedings of the 5th ACM Conference on Data and Application Security and Privacy

Other

Other5th ACM Conference on Data and Application Security and Privacy, CODASPY 2015
Country/TerritoryUnited States
CitySan Antonio
Period3/2/153/4/15

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Software
  • Computer Science Applications

Keywords

  • Cloud storage
  • Erasure coding
  • Remote data integrity checking
  • Server-side repair

Fingerprint

Dive into the research topics of 'Towards server-side repair for erasure coding-based distributed storage systems'. Together they form a unique fingerprint.

Cite this