Enabling Large Language Models to Perform Power System Simulations with Previously Unseen Tools: A Case of Daline

Mengshuo Jia,Zeyu Cui,Gabriela Hug
2024-06-26
Abstract:The integration of experiment technologies with large language models (LLMs) is transforming scientific research, offering AI capabilities beyond specialized problem-solving to becoming research assistants for human scientists. In power systems, simulations are essential for research. However, LLMs face significant challenges in power system simulations due to limited pre-existing knowledge and the complexity of power grids. To address this issue, this work proposes a modular framework that integrates expertise from both the power system and LLM domains. This framework enhances LLMs' ability to perform power system simulations on previously unseen tools. Validated using 34 simulation tasks in Daline, a (optimal) power flow simulation and linearization toolbox not yet exposed to LLMs, the proposed framework improved GPT-4o's simulation coding accuracy from 0% to 96.07%, also outperforming the ChatGPT-4o web interface's 33.8% accuracy (with the entire knowledge base uploaded). These results highlight the potential of LLMs as research assistants in power systems.
Systems and Control,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the issues encountered by large language models (LLMs) in power system simulations. Specifically, although existing large language models possess strong natural language processing capabilities, they face numerous challenges when performing complex power system simulations. The main reasons are the lack of relevant prior knowledge and insufficient understanding of the complexity of power networks. To solve this problem, the authors propose a modular framework that combines domain expertise in power systems with the capabilities of large language models, enabling LLMs to perform power system simulations on tools they have never encountered before. Through validation on the DALINE simulation toolbox, this framework significantly improved the accuracy of LLMs in simulation coding, from 0% to 96.07%, far surpassing ChatGPT-4o using only the standard Retrieval-Augmented Generation (RAG) method (33.82%). This demonstrates the great potential of large language models as research assistants in the field of power systems.