Design and Realization of the Cloud Data Backup System Based on HDFS.

Dong Guo,Yong Du,Qiang Li,Liang Hu
DOI: https://doi.org/10.1007/978-3-642-24273-1_54
2011-01-01
Abstract:Based on cloud storage software HDFS, this paper has designed a cloud data backup system. Clients are divided into several groups and served by different servers so as to build backup/restore load balance. When backup server upload data to HDFS cluster or download data from HDFS cluster, it takes restore priority, conflict detection upload's strategies to reduce the network transmission pressure. To meet the feature that HDFS is suitable for large file's storage, backup server has combined small files to upload by setting a threshold, thus enhancing system performance. The HDFS-based cloud backup system designed by this paper has certain advantages on the aspects of safety, extendibility, economic efficiency and reliability.
What problem does this paper attempt to address?