A Study of Service Reliability and Availability for Distributed Systems

Ys Dai,M Xie,Kl Poh,Gq Liu
DOI: https://doi.org/10.1016/s0951-8320(02)00200-4
IF: 7.247
2003-01-01
Reliability Engineering & System Safety
Abstract:Distributed systems are usually designed and developed to provide certain important services such as in computing and communication systems. In this paper, a general model is presented for a centralized heterogeneous distributed system, which is widely used in distributed system design. Based on this model, the distributed service reliability which is defined as the probability of successfully providing the service in a distributed environment, an important performance measure for this type of systems, is investigated. An application example is used to illustrate the procedure. Furthermore, with the help of the model, various issues such as the release time to achieve a service reliability requirement, and the sensitivity of model parameters are studied. This type of analysis is important in the application of this type of models.
What problem does this paper attempt to address?