Skewed Data Distribution for Active Storage Systems on Hybrid Servers

Xiangyu Li,Shuibing He,Xianbin Xu
DOI: https://doi.org/10.14257/ijgdc.2016.9.5.10
2016-01-01
International Journal of Grid and Distributed Computing
Abstract:With the popularity of new storage technologies, hybrid active storage system provides an efficient way to improve the performance of high-performance computing applications. However, current active storage efforts have neglected the storage performance gap between heterogeneous servers, largely affecting the overall system performance. In this paper, we propose SDD, a Skewed Data Distribution scheme for hybrid active storage systems. In contrast to traditional even data distribution schemes, SDD distribute data on servers with skewed amount of data based on their performance. We have implemented a prototype of our proposed data layout scheme in a parallel I/O system, and demonstrated its benefits with a typical data processing application. Experimental results show our proposed data placement scheme can significantly improve the overall active storage system performance.
What problem does this paper attempt to address?