Service Reliability and Performance in Grid System with Star Topology.
Gregory Levitin,Yuan-Shun Dai
DOI: https://doi.org/10.1016/j.ress.2005.11.005
2007-01-01
IIE Transactions
Abstract:The paper considers grid computing systems in which the Resource Management System (RMS) divides a service task into subtasks and then sends the subtasks to different specialized resources for execution. In order to provide the desired level of service reliability the RMS can assign the same subtasks to several independent resources for parallel execution. Because of data dependency, some subtasks cannot be executed until they have received input data that is the result of other subtasks. This imposes precedence constraints on the order of subtask execution. Usually, there are two types of failure in resources and communication channels: permanent failures and transient failures. Permanent failures render the equipment unavailable and whilst transient failures do not make the equipment unavailable they do prevent the completion of the performed actions. Service reliability and performance indices are introduced and a fast numerical algorithm for their evaluation for any given subtask distribution in a grid with a star architecture is presented. This algorithm is based on the universal generating function technique. Illustrative examples are presented.
What problem does this paper attempt to address?