OPTIMISING DISK READ FOR NODE FAILURE RECOVERY OF RDP STORAGE SYSTEMS

Yan Zhang,Yunfeng Zhu,Fang Niu,Yinlong Xu
DOI: https://doi.org/10.3969/j.issn.1000-386x.2015.06.025
2015-01-01
Abstract:Real distributed storage systems face frequent disk failures.In order to ensure data reliability,erasure codes are widely deployed to large-scale storage systems.In an erasure coded-based storage system,to quickly and efficiently recover the lost data in failed disk has great significance to maintaining the data reliability.In this paper,we study the disk failure recovery problem for storage systems with RDP,one of the most important double-fault tolerant erasure codes,and seek to optimise the disk access sequentiality during the failure recovery.The recovery scheme for single disk failure we proposed avoids to the greatest extent the random reads of disk data on the premise of ensuring the minimum amount of data being read,and also preserve the sequentiality of data reads.Through the experiments in a real distributed storage system we validate the practical performance of the proposed recovery scheme,it approves that our recovery algorithm can well meliorate the problem of recovery speed slowing down in hybrid recovery scheme caused by random disk accesses and eventually improves recovery efficiency.
What problem does this paper attempt to address?