Hdump: A Data Recovery Tool For Hadoop

Zhongsheng Li,Qiuhong Li,Wei Wang,Qitong Wang,Fengbin Qi,Yimin Liu,Peng Wang
DOI: https://doi.org/10.1007/978-3-319-91458-9_56
2018-01-01
Abstract:Hadoop is a popular distributed framework for massive data processing. HDFS is the underlying file system of Hadoop. More and more companies use Hadoop as data processing platform. Once Hadoop crashes, the data stored in HDFS can not be accessed directly. We present HDUMP, a light-weight bypassing file system, which aims to recover the data stored in HDFS when Hadoop crashes.
What problem does this paper attempt to address?