A Wide-Area Collaborative Scheduling System Oriented to Big Data Processing Applications

ZHANG Chenhao,XIAO Limin,QIN Guangjun,SONG Yao,JIANG Shixuan,WANG Jiye
DOI: https://doi.org/10.11959/j.issn.2096-0271.2021050
IF: 3.3
2021-01-01
Big Data Research
Abstract:Based on the high-performance computing global virtual data space system, a wide-area collaborative scheduling system for big data processing applications was designed and implemented.This system can address the issue of how big data processing applications unified use wide-area storage and computing resources.And it can collaborative schedule of application data and computing tasks based on the computing characteristics of the application and data layout through collaborative scheduling, load balancing scheduling, data locality scheduling strategies.By unified scheduling of application data and computing tasks in the wide-area environment, it can coordinate the utilization of wide-area computing and storage resources, and effectively improve the running performance of big data processing applications.The actual test results in the national high-performance computing environment show that the scheduling method proposed can support big data processing applications effectively, and the running efficiency of typical applications such as wide-area target collaborative recognition and molecular docking can be increased by 3~4 times.
What problem does this paper attempt to address?