Invader Job: A Kind Of Malicious Failure Job On Hadoop Yarn

Lijing Cheng,Qingni Shen,Chuntao Dong
DOI: https://doi.org/10.1109/ICC.2018.8422331
2018-01-01
Abstract:In distributed computing platform, it's possible to occur unexpected job failure. Normally, the system performance will not be affected obviously. But, in Hadoop YARN, we find Invader Job, a kind of inappropriate user-definable parameter caused malicious failure job, may cut down the system performance greatly. In this paper, we find in Hadoop YARN, there are two vulnerabilities that can be used to construct invader job. First, it's easy to cause job failure by modifying the user-definable parameters inappropriately. Second, YARN doesn't check on the job before execution, and also doesn't check the failure reason before re-attempt. So that, invader job could fail as much as possible to occupy the scheduling resource over and over again. Thus, we propose a detection framework called InTect which employs SVM to predict invader jobs. Finally, we verify our findings using the cluster of our lab and Amazon EMR respectively. As a result, the cluster performance degrades 4 times than normal case. Moreover, the recall rate of our detection framework is more than 90%, which means the SVM model has a good discrimination for invader jobs.
What problem does this paper attempt to address?