Employing Checkpoint to Improve Job Scheduling in Large-Scale Systems

Shuangcheng Niu,Jidong Zhai,Xiaosong Ma,Mingliang Liu,Yan Zhai,Wenguang Chen,Weimin Zheng
DOI: https://doi.org/10.1007/978-3-642-35867-8_3
2013-01-01
Abstract:The FCFS-based backfill algorithm is widely used in scheduling high-performance computer systems. The algorithm relies on runtime estimate of jobs which is provided by users. However, statistics show the accuracy of user-provided estimate is poor. Users are very likely to provide a much longer runtime estimate than its real execution time.
What problem does this paper attempt to address?