The Vision of Autonomic Computing: Can LLMs Make It a Reality?

Zhiyang Zhang,Fangkai Yang,Xiaoting Qin,Jue Zhang,Qingwei Lin,Gong Cheng,Dongmei Zhang,Saravan Rajmohan,Qi Zhang
2024-07-19
Abstract:The Vision of Autonomic Computing (ACV), proposed over two decades ago, envisions computing systems that self-manage akin to biological organisms, adapting seamlessly to changing environments. Despite decades of research, achieving ACV remains challenging due to the dynamic and complex nature of modern computing systems. Recent advancements in Large Language Models (LLMs) offer promising solutions to these challenges by leveraging their extensive knowledge, language understanding, and task automation capabilities. This paper explores the feasibility of realizing ACV through an LLM-based multi-agent framework for microservice management. We introduce a five-level taxonomy for autonomous service maintenance and present an online evaluation benchmark based on the Sock Shop microservice demo project to assess our framework's performance. Our findings demonstrate significant progress towards achieving Level 3 autonomy, highlighting the effectiveness of LLMs in detecting and resolving issues within microservice architectures. This study contributes to advancing autonomic computing by pioneering the integration of LLMs into microservice management frameworks, paving the way for more adaptive and self-managing computing systems. The code will be made available at <a class="link-external link-https" href="https://aka.ms/ACV-LLM" rel="external noopener nofollow">this https URL</a>.
Artificial Intelligence,Computation and Language,Distributed, Parallel, and Cluster Computing,Multiagent Systems,Software Engineering
What problem does this paper attempt to address?
The paper aims to explore the potential and feasibility of Large Language Models (LLMs) in achieving the vision of Autonomic Computing (ACV), especially in the application of microservices management. The core objective of the paper is to address the issue of self-management in modern computing systems by constructing a multi-agent framework based on LLMs. Specifically, the authors propose the following points: 1. **Identifying Challenges**: The paper first outlines the challenges faced in realizing the vision of autonomic computing, which mainly stem from the dynamism and complexity of modern computing systems. 2. **Introducing Solutions**: Given the capabilities of LLMs in knowledge, language understanding, and task automation, the authors explore how to leverage these abilities to overcome the challenges in autonomic computing. 3. **Designing Framework**: The paper proposes a multi-tiered agent architecture based on LLMs for microservices management. This architecture includes high-level group managers and low-level autonomic agents, which are responsible for handling tasks across service components and specific service component tasks, respectively. 4. **Evaluation System**: To assess the effectiveness of this framework, the paper establishes an online evaluation benchmark based on the Sock Shop microservices project and defines a classification system for autonomous service maintenance with five levels. The evaluation tasks cover aspects ranging from simple command execution to complex fault detection, root cause analysis, and automatic repair. 5. **Experimental Results**: Experiments on the Sock Shop project revealed that the framework could achieve the third level of autonomy, meaning it can automatically detect issues and execute specific tasks to some extent, but there is room for improvement in root cause analysis and problem mitigation. In summary, this paper attempts to advance the development of autonomic computing by harnessing the power of LLMs, particularly for self-management and self-adaptation capabilities within microservice architectures. A series of experiments validated the effectiveness of this approach and provided new directions for future research in autonomic computing.