A Discrete Data Dividing Approach for Erasure-Code-Based Storage Applications

Weidong Sun,Yijie Wang,Yongquan Fu,Xiaoqiang Pei
DOI: https://doi.org/10.1109/SOSE.2014.44
2014-01-01
Abstract:Erasure codes are promising for improving the reliability of the storage system due to its space efficiency compared to the replication methods. Traditional erasure codes split data into equalsized data blocks and encode strips in different data blocks. This brings heavy repairing traffic when clients read parts of the data, since most strips read for repairing are not in the expected blocks. This paper proposes a novel discrete data dividing method to completely avoid this problem. The key idea is to encode strips from the same data block. We could see that for repairing failed blocks, the strips to be read are either in the same data block with corrupted strips or from the encoded strips. Therefore, no data is wasted. We design and implement this data layout into a HDFS-like storage system. Experiments over a small-scale testbed shows that the proposed discrete data divided method avoids downloading data blocks that are not needed for clients during the repairing operations.
What problem does this paper attempt to address?