Design and Implementation of Data Encryption in Cloud Based on HDFS

Zhonghan Cheng,Hao Huang,Diming Zhang,Zhenjiang Qian
DOI: https://doi.org/10.2991/ccis-13.2013.64
2013-01-01
Abstract:As an open-source distributed programming framework, Hadoop has gradually become popular in industry recently. Its distributed file system (HDFS) enables storing large data with advantages of high fault tolerance and throughput. However, the fact that the current HDFS does not support intra-cloud data encryption yet makes data privacy becomes a key security issue. This paper presents ahybrid encryption method based on HDFS. We adopt symmetric encryption to encrypt and decrypt file blocks at datanodes and use asymmetric encryption scheme to protect the symmetric keys. By this method, we can prevent datanode intruders from stealing user data, while ensuring that clients are lightweight. The experiments show that with and without block encryption algorithm, our solution brings43% and 2% performance degradation compared to the generic HDFS.
What problem does this paper attempt to address?