Research on the Framework and Resource Scheduling Mechanisms of Hadoop YARN

Chuntao Dong,Wenting Li,Qingni Shen,Zhonghai Wu
2015-01-01
Abstract:Hadoop 2.0 proposes a new resource management system, named as YARN, which can support a variety of big data computing frameworks(such as MapReduce, Storm, Spark, etc.). YARN has become the current mainstream computing framework to deploy large data platform for many internet corporations(e.g. Alibaba Group, Tencent, etc.). Therefore, this paper analyzes the basic structure and the working lfow of the YARN framework, then focuses on its resource scheduling mechanisms used in the ResourceManager, including its resource management model, management mechanisms and vulnerabilities, and the implemented resource schedulers, the Capacity Scheduler and Fair Scheduler. Final y it also discusses the next generation of Omega scheduler.
What problem does this paper attempt to address?