Online System for Grid Resource Monitoring and Machine Learning-Based Prediction

Liang Hu,Xi-Long Che,Si-Qing Zheng
DOI: https://doi.org/10.1109/tpds.2011.108
IF: 5.3
2012-01-01
IEEE Transactions on Parallel and Distributed Systems
Abstract:Resource allocation and job scheduling are the core functions of grid computing. These functions are based on adequate information of available resources. Timely acquiring resource status information is of great importance in ensuring overall performance of grid computing. This work aims at building a distributed system for grid resource monitoring and prediction. In this paper, we present the design and evaluation of a system architecture for grid resource monitoring and prediction. We discuss the key issues for system implementation, including machine learning-based methodologies for modeling and optimization of resource prediction models. Evaluations are performed on a prototype system. Our experimental results indicate that the efficiency and accuracy of our system meet the demand of online system for grid resource monitoring and prediction.
What problem does this paper attempt to address?