Cloud Storage Management Technology for Small File Based on Two-Dimensional Packing Algorithm

Zhiyun Zheng,Shaofeng Zhao,Xingjin Zhang,Zhenfei Wang,Liping Lu
DOI: https://doi.org/10.1007/978-3-319-01766-2_97
2013-01-01
Abstract:In order to improve storage efficiency of small files in the cloud storage systems based on HDFS (Hadoop Distributed File System), this paper proposed a merging process approach based on a two-dimensional packing algorithm, called TDPHDFS (two-dimensional packing for HDFS). In it the correlations between file size and arrival time are comprehensively considered to assist the small files to be merged into large ones. The simulation results demonstrate that the storage efficiency of small files is improved, while the stability remains the same, yet less resource is consumed. The TDPHDFS algorithm can effectively reduce the performance penalty in both storage space and memory consuming while managing massive small files.
What problem does this paper attempt to address?