High-integrity mapreduce computation in cloud with speculative execution

Jing Xiao,Zhiwei Xiao
DOI: https://doi.org/10.1007/978-3-642-24999-0_55
2011-01-01
Abstract:Cloud computing involves processing a huge amount of data using massively, distributed computing resources. However, the massive and distributed nature of cloud computing also make the integrity of computation upon easily be easily broken either by deliberate attacks or unconscious machine failures. In this paper, we propose to provide high-integrity feature to Map Reduce computation using speculative execution. The key idea of our approach is selectively replicating MapReduce tasks on a random computation node, and comparing the hash of the execution results to determine if the integrity of the task is compromised. A preliminary prototype, called Nessaj, has been implemented on Hadoop MapReduce framework. Experimental results show that Nessaj can detect and recover from our randomly injected attacks in high probability. The performance overhead is also moderate.
What problem does this paper attempt to address?