FAGR: an Efficient File-aware Graph Recovery Scheme for Erasure Coded Cloud Storage Systems.

Heming Zeng,Chi Zhang,Chentao Wu,Gen Yang,Jie Li,Guangtao Xue,Minyi Guo
DOI: https://doi.org/10.1109/iccd50377.2020.00033
2020-01-01
Abstract:With the explosive growth of data in cloud storage systems, Erasure Codes (ECs) have become a typical data redundancy technology because of its low storage cost and high reliability. However, due to a large amount of complex computations and transmissions among massive data and parities, the recovery of lost data in erasure coded storage systems incurs high I/O latency. Although several fast recovery approaches devote to mitigating the recovery time from the application level or device level, the performance of file level recovery is still restricted. It is because a part of the complicated relationships among data, parity and files are ignored in the design of recovery process. To address the above problems, we propose a novel File-aware Graph Recovery (FAGR) scheme, to improve the file level recovery performance during the reconstruction process. The key idea of FAGR is establishing a graph with the mappings among files, blocks, stripes, parities, nodes and the access frequencies of files, and guides the recovery process from file point of view. A corresponding model is established to analyze the cost efficiency of recovery process, which guarantees that FAGR reconstructs the popular files in advance to accelerate the recovery. To demonstrate the effectiveness of FAGR, we conduct several numerical analysis and experiments in clusters. The results show that, compared to typical fast recovery methods, FAGR reduces the average response time of files by up to 81.63 % and improves the throughput by up to 4.44 ×.
What problem does this paper attempt to address?