DESIGN AND DEMONSTRATION OF MULTI-CODES FRAMEWORK FOR COLD/HOT DATA STORAGE

Xuecai Wei,Qingyuan Gong,Jiajie Shen,Yangfan Zhou,Xin Wang
DOI: https://doi.org/10.3969/j.issn.1000-386x.2017.02.006
2017-01-01
Abstract:With the rapid development of the Internet and the explosive growth of data,large-scale distributed storage systems are widely used in Internet application.Recent Internet applications usually involve different types of data,and data can be considered as hot data or cold data based on their access frequency.However,a storage system with erasure codes is generally implemented with a fixed coding mechanism,which cannot adapt well to the diverse types of data coexisting in the same system.As a result,the system performance may greatly degrade.Thus,a new storage system framework is suggested to improve the system performance based on multiple codes,considering the difference between hot and cold data.For cold data,it can adopt a low-redundancy coding mechanism to improve space efficiency.For hot data,in contrast,it can reduce the data access time by taking a code that can be rapidly decoded.Then,real-world implementations of such a framework based on HDFS-RAID are designed,which is deployed in a Hadoop tested cluster.Besides,based on a real-world data access trace,the effectiveness of our system in improving the system performance is verified.The results show that the system can adapt well to the diverse types of data.
What problem does this paper attempt to address?