Semi-supervised meta-learning networks with squeeze-and-excitation attention for few-shot fault diagnosis

Yong Feng,Jinglong Chen,Tianci Zhang,Shuilong He,Enyong Xu,Zitong Zhou
DOI: https://doi.org/10.1016/j.isatra.2021.03.013
IF: 7.3
2022-01-01
ISA Transactions
Abstract:Propose semi-supervised meta-learning networks (SSMN) with squeeze-and-excitation attention for few-shot fault diagnosis.Generate discriminative features from raw inputs via attention mechanism.Refine prototypes for better fault identification by semi-supervised meta-learning.Combine SGD and Adam to converge the training loss efficiently for SSMN.Conduct extensive experiments to evaluate the effectiveness by bearing vibration datasets.In the engineering practice, lacking of data especially labeled data typically hinders the wide application of deep learning in mechanical fault diagnosis. However, collecting and labeling data is often expensive and time-consuming. To address this problem, a kind of semi-supervised meta-learning networks (SSMN) with squeeze-and-excitation attention is proposed for few-shot fault diagnosis in this paper. SSMN consists of a parameterized encoder, a non-parameterized prototype refinement process and a distance function. Based on attention mechanism, the encoder is able to extract distinct features to generate prototypes and enhance the identification accuracy. With semi-supervised few-shot learning, SSMN utilizes unlabeled data to refine original prototypes for better fault recognition. A combinatorial learning optimizer is designed to optimize SSMN efficiently. The effectiveness of the proposed method is demonstrated through three bearing vibration datasets and the results indicate the outstanding adaptability in different situations. Comparison with other approaches is also made under the same setup and the experimental results prove the superiority of the proposed method for few-shot fault diagnosis.
automation & control systems,instruments & instrumentation,engineering, multidisciplinary
What problem does this paper attempt to address?