A Global and Joint Knowledge Distillation Method with Gradient-Modulated Dynamic Parameter Adaption for EMU Bogie Bearing Fault Diagnosis

Tongyang Pan,Tiantian Wang,Jinglong Chen,Jingsong Xie,Sha Cao
DOI: https://doi.org/10.1016/j.measurement.2024.114927
IF: 5.6
2024-01-01
Measurement
Abstract:Deep learning has exhibited remarkable performance and achieved significant breakthroughs in railway transportation equipment fault diagnosis. However, in engineering practice, the escalating complexity of model computations poses a challenge, necessitating the compression and acceleration of models as a crucial technology for deploying intelligent algorithms on edge devices that have limited power and memory capabilities. Additionally, the disparity in the distributions of design and operational data poses another obstacle to accurate diagnosis. Despite various training strategies proposed, they often prioritize either network acceleration and quantification or domain adaptation, which may compromise accuracy and efficiency in real-world applications involving both domain-shifted data and resource-constrained environments.To address these challenges, we introduce a global and joint knowledge distillation approach that incorporates gradient-modulated dynamic parameter adaption specifically for bogie bearing fault diagnosis. By distilling the global and joint knowledge from sophisticated neural networks, the resulting compact models not only maintain diagnosis accuracy but also require fewer computational resources for inference. Furthermore, the gradient-modulated dynamic parameter updating strategy ensures stability during the iterative training of both the teacher and student networks. Extensive experiments conducted on two benchmark datasets demonstrate that the proposed algorithm surpasses pure distillation methods, highlighting its effectiveness and efficiency in railway transportation equipment fault diagnosis.
What problem does this paper attempt to address?