A novel transformer-based DL model enhanced by position-sensitive attention and gated hierarchical LSTM for aero-engine RUL prediction

Xinping Chen
DOI: https://doi.org/10.1038/s41598-024-59095-3
IF: 4.6
2024-05-04
Scientific Reports
Abstract:Accurate prediction of remaining useful life (RUL) for aircraft engines is essential for proactive maintenance and safety assurance. However, existing methods such as physics-based models, classical recurrent neural networks, and convolutional neural networks face limitations in capturing long-term dependencies and modeling complex degradation patterns. In this study, we propose a novel deep-learning model based on the Transformer architecture to address these limitations. Specifically, to address the issue of insensitivity to local context in the attention mechanism employed by the Transformer encoder, we introduce a position-sensitive self-attention (PSA) unit to enhance the model's ability to incorporate local context by attending to the positional relationships of the input data at each time step. Additionally, a gated hierarchical long short-term memory network (GHLSTM) is designed to perform regression prediction at different time scales on the latent features, thereby improving the accuracy of RUL estimation for mechanical equipment. Experiments on the C-MAPSS dataset demonstrate that the proposed model outperforms existing methods in RUL prediction, showcasing its effectiveness in modeling complex degradation patterns and long-term dependencies.
multidisciplinary sciences
What problem does this paper attempt to address?
This paper attempts to solve several key problems in the prediction of remaining useful life (RUL) of aero - engines. Specifically: 1. **Limitations of existing methods**: Existing methods based on physical models, classical recurrent neural networks (RNN) and convolutional neural networks (CNN) have limitations in capturing long - term dependencies and modeling complex degradation patterns. 2. **Improvement of Transformer model**: Although the Transformer architecture performs well in processing long - sequence data, its self - attention mechanism is insensitive to local context, which limits its performance in RUL prediction. For this reason, this paper introduces a position - sensitive self - attention mechanism (PSA) to enhance the model's attention to the positional relationship of input data, so as to better incorporate local context information. 3. **Multi - time - scale regression prediction**: In order to further improve the ability to model long - term dependencies, this paper designs a gated hierarchical long - short - term memory network (GHLSTM), which can perform regression prediction at different time scales, so as to learn features more comprehensively and improve the accuracy of RUL prediction. ### Main contributions 1. **Introduction of position - sensitive self - attention mechanism (PSA)**: By considering the positional relationship of input data, the PSA mechanism enhances the model's sensitivity to local context, generates more effective hidden features, and thus improves the accuracy of RUL prediction. 2. **Proposal of gated hierarchical long - short - term memory network (GHLSTM)**: GHLSTM introduces multi - level gating structures in the network, which can gradually model dependencies at different time scales, so as to better handle large - scale sequential data and improve the accuracy of RUL prediction. 3. **Experimental verification**: The experimental results on the widely - used aerospace dataset C - MAPSS show that the proposed model is superior to existing methods in quantitative evaluation metrics, demonstrating its effectiveness in modeling complex degradation patterns and long - term dependencies. ### Summary This research proposes a new deep - learning model by combining the position - sensitive self - attention mechanism and the gated hierarchical long - short - term memory network, aiming to overcome the limitations of existing RUL prediction methods and achieve better performance in practical applications.