A General Framework for Load Forecasting based on Pre-trained Large Language Model

Mingyang Gao,Suyang Zhou,Wei Gu,Zhi Wu,Haiquan Liu,Aihua Zhou
2024-09-03
Abstract:Accurate load forecasting is crucial for maintaining the power balance between generators and consumers,particularly with the increasing integration of renewable energy sources, which introduce significant intermittent volatility. With the advancement of data-driven methods, machine learning and deep learning models have become the predominant approaches for load forecasting tasks. In recent years, pre-trained large language models (LLMs) have achieved significant progress, demonstrating superior performance across various fields. This paper proposes a load forecasting method based on LLMs, offering not only precise predictive capabilities but also broad and flexible applicability. Additionally, a data modeling method is introduced to effectively transform load sequence data into natural language suitable for LLM training. Furthermore, a data enhancement strategy is designed to mitigate the impact of LLM hallucinations on forecasting results. The effectiveness of the proposed method is validated using two real-world datasets. Compared to existing methods, our approach demonstrates state-of-the-art performance across all validation metrics.
Systems and Control
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the key challenges in power load forecasting, especially in the context of the increasing integration of renewable energy. Specifically, the paper focuses on the following problems: 1. **Improving prediction accuracy**: Accurate load forecasting is crucial for maintaining the power balance between the power - generation side and the power - consumption side. With the introduction of renewable energy, the power load shows significant intermittency and volatility, which increases the difficulty of prediction. Although traditional machine - learning and deep - learning methods have improved the prediction accuracy to a certain extent, there is still room for improvement. 2. **Taking advantage of pre - trained large language models (LLMs)**: In recent years, pre - trained large language models (such as BERT, GPT, etc.) have made remarkable progress in natural - language - processing tasks and demonstrated the ability to capture long - range dependencies. However, these models have not been widely applied to power load - forecasting tasks yet. The paper proposes a load - forecasting framework based on pre - trained large language models, aiming to fully utilize the powerful performance of LLMs. 3. **Data modeling and enhancement**: In order to enable LLMs to effectively process power - load - sequence data, the paper introduces a data - modeling method that converts numerical sequences into natural - language expressions suitable for LLMs training. In addition, a data - enhancement strategy is designed to alleviate the "hallucination" problem (i.e., generating inaccurate or irrelevant data) that may occur in LLMs, thereby improving the reliability of prediction results. 4. **Applicability to multiple time scales and multiple scenarios**: The method proposed in the paper is not only applicable to load - forecasting tasks at different time scales but can also cope with multiple application scenarios, having wide flexibility and adaptability. Through the above methods, the paper hopes to outperform existing methods on multiple verification indicators and provide a more accurate and reliable load - forecasting solution. ### Formula presentation - **Data - conversion formula**: \[ X_{\text{text}} = S(X)=\{S(x_1), S(x_2), \dots, S(x_n)\} \quad \text{for } 1 \leq i \leq n \] where \(x_i\) is the \(i\)-th data point in the input sequence, and \(S\) represents the conversion function from real numbers to text. - **Statistical - information - enhancement formula**: \[ X_{\text{ts}}=\{X_{\text{text}}, X_{\text{stat}}\} \] \[ X_{\text{stat}} = \{\max(X_{\text{obs}}), \min(X_{\text{obs}}), \text{average}(X)\} \] where \(X_{\text{obs}}\) represents the historical load data \(N_{\text{obs}}\) steps before the prediction time. - **Separation of numerical sequence and language description**: \[ X_{\text{ets}}=\{X^*_{\text{text}}, X_{\text{stat}}\} \] \[ X^*_{\text{text}}=\{(t_1, x_1), (t_2, x_2), \dots, (t_n, x_n)\} \quad \text{for } 1 \leq i \leq n \] where \(t_i\) is the time step corresponding to the \(i\)-th data point. ### Summary By combining the powerful capabilities of pre - trained large language models and data - modeling and - enhancement methods for power - load - forecasting tasks, the paper proposes a general and flexible load - forecasting framework, aiming to improve prediction accuracy and solve the limitations in existing methods.