Improved Generative Adversarial Network for Bearing Fault Diagnosis with a Small Number of Data and Unbalanced Data

Zhaohui Qin,Faguo Huang,Jiafang Pan,Junlin Niu,Haihua Qin
DOI: https://doi.org/10.3390/sym16030358
2024-03-16
Symmetry
Abstract:Fault data under real operating conditions are often difficult to collect, making the number of trained fault data small and out of proportion to normal data. Thus, fault diagnosis symmetry (balance) is compromised. This will result in less effective fault diagnosis methods for cases with a small number of data and data imbalances (S&I). We present an innovative solution to overcome this problem, which is composed of two components: data augmentation and fault diagnosis. In the data augmentation section, the S&I dataset is supplemented with a deep convolutional generative adversarial network based on a gradient penalty and Wasserstein distance (WDCGAN-GP), which solve the problems of the generative adversarial network (GAN) being prone to model collapse and the gradient vanishing during the training time. The addition of self-attention allows for a better identification and generation of sample features. Finally, the addition of spectral normalization can stabilize the training of the model. In the fault diagnosis section, fault diagnosis is performed through a convolutional neural network with coordinate attention (CNN-CA). Our experiments conducted on two bearing fault datasets for comparison demonstrate that the proposed method surpasses other comparative approaches in terms of the quality of data augmentation and the accuracy of fault diagnosis. It effectively addresses S&I fault diagnosis challenges.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper you've provided discusses a novel approach to address the challenges of small and imbalanced datasets in bearing fault diagnosis using machine learning. Here's a summary of the problem the paper tries to solve: The paper aims to improve fault diagnosis accuracy in scenarios where the available data is limited and imbalanced, specifically focusing on bearing fault diagnosis. In real-world conditions, collecting sufficient fault data can be difficult since machinery typically operates normally most of the time. This leads to datasets with a disproportionately small number of fault cases compared to normal operation cases, which can compromise the effectiveness of fault diagnosis methods. To tackle this issue, the authors propose an innovative solution that combines data augmentation and fault diagnosis. The data augmentation component uses a Deep Convolutional Generative Adversarial Network (DCGAN) with modifications such as Wasserstein distance, gradient penalty, and self-attention mechanisms to generate synthetic fault data. This addresses problems like model collapse and gradient vanishing during training, improving the quality and diversity of the generated samples. Additionally, spectral normalization is applied to stabilize model training. The fault diagnosis component employs a Convolutional Neural Network (CNN) with a coordinate attention mechanism (CNN-CA) to perform the diagnosis on the augmented dataset. The coordinate attention mechanism helps the model