HDStore: An SSD/HDD Hybrid Distributed Storage Scheme for Large-Scale Data

Zhijie Feng,Zhiyong Feng,Xin Wang,Guozheng Rao,Yazhou Wei,Zhiyuan Li
DOI: https://doi.org/10.1007/978-3-319-11538-2_20
2014-01-01
Abstract:Traditional data storage schemes are primarily based upon Hard Disk Drives (HDD). However, with the appearance of large amount of data on the Web, the read/write performance based on HDD has reached a bottleneck. Thus the emerging of Solid State Drives (SSD) has provided an opportunity for the storage of the Web of data. In this paper, we propose an SSD/HDD hybrid distributed storage scheme, called HDStore, for large-scale data, in which the single fix-sized journal file using the append-only mode is stored on SSD to support efficient read and write, while several segment files focusing on read are stored on HDD. Through a series of operations build, split, move, and merge between the journal and segment files, we constructed HDStore storage scheme based on JS-model. The experimental results show that HDStore obtains an efficient optimization of data read/write, especially the write performance has increased by 15 % compared to the traditional HDD-based scheme.
What problem does this paper attempt to address?