A security framework in G-Hadoop for big data computing across distributed Cloud data centres

Jiaqi Zhao,Lizhe Wang,Jie Tao,Jinjun Chen,Weiye Sun,Rajiv Ranjan,Joanna Kołodziej,Achim Streit,Dimitrios Georgakopoulos
DOI: https://doi.org/10.1016/j.jcss.2014.02.006
IF: 1.043
2014-08-01
Journal of Computer and System Sciences
Abstract:MapReduce is regarded as an adequate programming model for large-scale data-intensive applications. The Hadoop framework is a well-known MapReduce implementation that runs the MapReduce tasks on a cluster system. G-Hadoop is an extension of the Hadoop MapReduce framework with the functionality of allowing the MapReduce tasks to run on multiple clusters. However, G-Hadoop simply reuses the user authentication and job submission mechanism of Hadoop, which is designed for a single cluster. This work proposes a new security model for G-Hadoop. The security model is based on several security solutions such as public key cryptography and the SSL protocol, and is dedicatedly designed for distributed environments. This security framework simplifies the users authentication and job submission process of the current G-Hadoop implementation with a single-sign-on approach. In addition, the designed security framework provides a number of different security mechanisms to protect the G-Hadoop system from traditional attacks.
computer science, theory & methods, hardware & architecture
What problem does this paper attempt to address?