HS6: an Efficient H-Code RAID-6 Scaling by Optimizing Data Migrating and Parity Updating
Beijing Information Science and Technology University,You Xindong,Lv Xueqiang,Academy of Plateau Science and Sustainability
DOI: https://doi.org/10.1007/s11227-021-03739-y
IF: 3.3
2021-01-01
The Journal of Supercomputing
Abstract:Caused by massive data, large-scale data centers suffer from insufficient storage capacity. This increases the risk of component damage and disk failure. To enhance storage capacity of data centers, scaling has turned out to be one of the most popular choices. RAID-6 is deployed extensively that has superior reliability, availability, and scalability compared to other RAID levels. However, higher reliability also means expensive parity data update cost. To address this issue, this paper proposes an efficient RAID-6 scaling scheme, HS6, for H-Code. The basic idea of HS6 is reducing the total parity data update cost and saving the total scaling time by optimizing data migration and parity data update. The properties of HS6 can be summarized as follows: (1) it utilizes horizontal data migration to eliminate the horizontal parity data update, and (2) it makes full use of original parity data to cut down the total parity data update cost. Numerical results and experimental data analysis indicate that: (1) HS6 decreases the data migration rate by 76.92% $$\sim$$ 95.31%, 55.56% $$\sim$$ 84.46%, and 14.29% $$\sim$$ 16.67% compared to Round-Robin, Semi-RR, and HCS, (2) HS6 saves the total scaling time by 62.05% $$\sim$$ 76.75%, 53.24% $$\sim$$ 73.94%, and 3.68% $$\sim$$ 6.79% against Round-Robin, Semi-RR, and HCS, respectively, under offline and outperforms Round-Robin by 59.99% $$\sim$$ 79.02%, Semi-RR by 53.04% $$\sim$$ 76.41%, and HCS by 0.56% $$\sim$$ 23.78% in total scaling time under four workloads, and (3) HS6 maintains almost the same user average response time with Round-Robin, Semi-RR, and HCS during scaling and after scaling.