Abstract:Large language models (LLMs) have exhibited great potential in fault diagnosis of heating, ventilation, and air conditioning systems. However, the fault diagnosis accuracy of LLMs is still unsatisfactory, due to the lack of effective diagnosis accuracy enhancement methods for LLMs. To fill this gap, this study proposes a LLM finetuning method supervised by data with fault and fault-free labels to enhance the fault diagnosis accuracy of LLMs. This method designs a LLM self-correction strategy to automatically generate a fine-tuning dataset based on the labeled data. The generated fine-tuning dataset is applied to fine-tune a LLM. Moreover, a data augmentation-based approach is put forward to adaptively update the fine-tuning dataset for iteratively developing a high-performance fine-tuned LLM. The proposed method is utilized to fine-tune the GPT-3.5 model using the air handling unit (AHU) fault dataset from the RP-1312 project. The results show that the diagnosis accuracy of the GPT-3.5 model is increased from 29.5 % to 100.0 % after model fine-tuning. Compared with the GPT-4 model, the fine-tuned GPT-3.5 model achieves a 31.1 % higher average diagnosis accuracy. The fine-tuned GPT-3.5 model is also applied to diagnose faults in two AHUs from another open-source dataset to verify the generalization ability of this model. The two AHUs have different system structures and sensor configurations compared to the AHU in the RP-1312 dataset, and this dataset is not utilized to fine-tune the GPT-3.5 model. The average diagnosis accuracy of the GPT-3.5 model is increased from 46.0 % to 99.1 % and from 38.8 % to 98.9 % for the faults in the two AHUs, respectively, after model fine-tuning. Furthermore, the proposed method is verified using two fault datasets from a variable air volume box and a chiller plant system. After fine-tuning the GPT-3.5 model using the two datasets, the average diagnosis accuracy of this model is increased from 33.0 % to 98.3 % for variable air volume box faults and from 36.0 % to 99.1 % for chiller plant system faults. This study provides an effective solution to the development of domain-specific LLMs for this domain.

Domain-specific Large Language Models for Fault Diagnosis of Heating, Ventilation, and Air Conditioning Systems by Labeled-Data-supervised Fine-Tuning

Model-based Fault Detection and Diagnosis for HVAC.

Evaluation of Large Language Models (llms) on the Mastery of Knowledge and Skills in the Heating, Ventilation and Air Conditioning (HVAC) Industry

Integrating Active Learning and Semi-Supervised Learning for Improved Data-Driven HVAC Fault Diagnosis Performance

Evaluation and Improvement of Fault Detection for Large Language Models

A Fine-Tuned Large Language Model for Domain-Specific with Reinforcement Learning

A Fault Detection Model for Air Handling Units Based on the Machine Learning Algorithms

A machine learning classifier for automated fault detection and diagnosis (AFDD) of rooftop units, addressing practical challenges of application

How to improve the application potential of deep learning model in HVAC fault diagnosis: Based on pruning and interpretable deep learning method

Experimental study on performance assessments of HVAC cross-domain fault diagnosis methods oriented to incomplete data problems

Leveraging error-assisted fine-tuning large language models for manufacturing excellence

A fault diagnosis framework based on heterogeneous ensemble learning for air conditioning chiller with unbalanced samples

Domain fuzzy generalization networks for semi-supervised intelligent fault diagnosis under unseen working conditions

LLM-based Framework for Bearing Fault Diagnosis

Fault diagnosis of HVAC system with imbalanced data using multi-scale convolution composite neural network

A Semi-Supervised Approach To Fault Detection And Diagnosis For Building Hvac Systems Based On The Modified Generative Adversarial Network

An Intelligent Machinery Fault Diagnosis Method Based on GAN and Transfer Learning under Variable Working Conditions

Enhancing Large Language Model Performance To Answer Questions and Extract Information More Accurately

Label Supervised LLaMA Finetuning

Robust Mechanical Fault Diagnosis with Noisy Label Based on Multistage True Label Distribution Learning.

Research on Fault Diagnosis Strategy of Air-Conditioning Systems Based on DPCA and Machine Learning