Proactive self‐healing techniques for cloud computing: A systematic review

Seyed Reza Rouholamini,Meghdad Mirabi,Razieh Farazkish,Amir Sahafi
DOI: https://doi.org/10.1002/cpe.8246
2024-08-21
Concurrency and Computation Practice and Experience
Abstract:Summary Ensuring the seamless operation of cloud computing services is paramount for meeting user demands and ensuring business continuity. Fault‐tolerant self‐healing techniques play a crucial role in enhancing the reliability and availability of cloud platforms, minimizing downtime and ensuring uninterrupted service delivery. This article systematically categorizes and analyzes existing research on fault‐tolerant self‐healing techniques published between 2005 and 2024. We provide a comprehensive technical taxonomy organizing self‐healing techniques based on fault tolerance processes, encompassing considerations for both reliability and availability. Additionally, we evaluate applications of proactive self‐healing techniques, highlighting their achievements, and limitations in enhancing service continuity. Strategies to address identified weaknesses are discussed, alongside future research challenges and open issues in the domain of cloud resilience. Through this analysis, the article contributes to understanding self‐healing techniques in cloud computing, offering insights into their effectiveness in ensuring service continuity. The findings aim to guide future research efforts in developing more robust and resilient cloud infrastructures, ultimately enhancing overall service reliability and availability. By emphasizing the importance of fault tolerance and self‐healing techniques, this article lays the foundation for advancing the state‐of‐the‐art in cloud computing.
computer science, theory & methods, software engineering
What problem does this paper attempt to address?