A lightweight diagnosis method for gear fault based on multi-path convolutional neural networks with attention mechanism

Tianming Chen,Manyi Wang,Yilin Jiang,Jiachen Yao,Ming Li
DOI: https://doi.org/10.1007/s10489-024-06094-6
IF: 5.3
2024-12-12
Applied Intelligence
Abstract:The fault diagnosis of gear is indeed a crucial aspect of maintaining rotating machinery, as it helps in ensuring the safe and efficient operation of industrial equipment. Deep learning models have gained significant attention for gear fault diagnosis due to their ability to automatically extract features from raw data, but they also come with their own set of challenges. One major limitation of existing methods is the insufficient consideration given to the impact of environmental noise at industrial field on the diagnostic effectiveness of the models. Additionally, there is a contradiction between the week computational resources of current embedded platforms for industrial field device applications and the large number of parameters and computations required for deep learning models. This may hinder the deployment of complex models in industrial field devices. To address these issues, a novel approach to multi-path convolutional neural network with dual branch attention (AMPCNN) has been proposed. This approach aims to enhance the recognition of different fault types and maintain high accuracy in noisy environments by extracting multi-scale features of the original vibration signal using multi-path convolution and dual branch attention mechanisms. Furthermore, a multi-knowledge distillation (MKD) method has been introduced to construct lightweight multi-sensor gear fault diagnosis models. This approach facilitates the transfer of multiple knowledge from a complex teacher network to a simpler student network, resulting in a lightweight model that exhibits excellent robustness in various noise environments. The experimental results show that the lightweight model achieves high accuracy while requiring significantly fewer floating-point operations and parameter quantities compared to the original teacher network.
computer science, artificial intelligence
What problem does this paper attempt to address?