DCache: A Distributed Cache Mechanism for HDFS based on RDMA

Xiao Zhang,Binbin Liu,Zian Gou,Jia Shi,Xiaonan Zhao
DOI: https://doi.org/10.1109/HPCC-SmartCity-DSS50907.2020.00035
2020-01-01
Abstract:Hadoop Distributed File System (HDFS) is an important component of Hadoop, which provides data storage service. The performance of the IO subsystem has a great influence on data processing efficiency. In the Hadoop system, computing jobs are scheduled to the nodes where the data is located to reduce IO time. The job scheduler and data distribution have a great impact on the efficiency of data proc...
What problem does this paper attempt to address?