A RAMCloud Storage System based on HDFS: Architecture, implementation and evaluation
Yifeng Luo,Siqiang Luo,Jihong Guan,Shuigeng Zhou
DOI: https://doi.org/10.1016/j.jss.2012.11.025
IF: 3.5
2013-03-01
Journal of Systems and Software
Abstract:Few cloud storage systems can handle random read accesses efficiently. In this paper, we present a RAMCloud Storage System, RCSS, to enable efficient random read accesses in cloud environments. Based on the Hadoop Distributed File System (HDFS), RCSS integrates the available memory resources in an HDFS cluster to form a cloud storage system, which backs up all data on HDFS-managed disks, and fetches data from disks into memory for handy accesses when files are opened for read or specified by users for memory storage. We extend the storage capacity of RCSS to that of the substrate disk-based HDFS by multiplexing all the available memory resources. Furthermore, RCSS supports MapReduce, which is a popular cloud computing paradigm. By serving data from memory instead of disks, RCSS can yield high random I/O performance with low latency and high throughput, and can achieve good availability and scalability as HDFS.
computer science, theory & methods, software engineering