Reliability Provision Mechanism for Large-Scale De-Duplication Storage Systems

GU Yu,LIU Chuanyi,SUN Linchun,YAN Bin,WANG Dongsheng,JU Dapeng
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2010.05.027
2010-01-01
Abstract:Data de-duplication technology achieves storage utilization by storing duplicate data chunks just once,but at the cost of error resilience or reliability.This paper presents a high reliability provision mechanism for large-scale de-duplication storage system(RLDS in short).RLDS packages variable data chunks into fixed sized objects,while exploits ECC code to encode the objects and distributes them among storage nodes in a redundancy group,which is dynamically generated according to actual failure domains.RLDS uses a distributed recovery process upon failures.Experimental results show that RLDS can provide the same storage utilization as RAID schemes,as well as comparable reliability to replication based schemes with more redundancies.The results also show that the average recovery time of RLDS based configurations is much shorter than that of RAID schemes and that RLDS can provide run-time dynamic load balance by recovering data of overloaded nodes to light load nodes.
What problem does this paper attempt to address?