Risk-Driven Proactive Fault-Tolerant Operation of IaaS Providers

Jordi Guitart,Mario Macias,Karim Djemame,Tom Kirkham,Ming Jiang,Django Armstrong
DOI: https://doi.org/10.1109/cloudcom.2013.62
2013-01-01
Abstract:In order to improve service execution in Clouds, the management of Cloud Infrastructure has to take measures to adhere to Service Level Agreements and Business Level Objectives, from the application layer through to how services are supported at the lowest hardware levels. In this paper a risk model methodology and holistic management approach is developed specific to the operation of the Cloud Infrastructure Provider and is applied through improvements to SLA fault tolerance in Cloud Infrastructure. Risk assessments are used to analyse execution specific data from the Cloud Infrastructure and linked to a business driven holistic management component that is part of a Cloud Manager. Initial results show improved eco-efficiency, virtual machine availability and reductions in SLA failure across the whole Cloud infrastructure by applying our combined risk-based fault tolerance approach.
What problem does this paper attempt to address?