Abstract:The problem of fault classification in industry has been studied extensively. Most classification algorithms are modeled on the premise of data balance. However, the difficulty of collecting industrial data in different modes is quite different. This inevitably leads to data imbalance, which will adversely affect the fault classification performance. This article proposes a novel data augmentation classifier (DAC) for imbalanced fault classification. Data augmentation based on generative adversarial networks (GANs) is an effective way to solve the problem of unbalanced classification. However, the randomness of the GAN generation process restricts the effect of data enhancement. DAC proposes a data selection strategy based on data filtering and data purification in model training to solve this problem. In addition, DAC combines supervised learning and data generation processes to obtain an end-to-end model. Meanwhile, multigenerator structure of DAC (MDAC) is proposed to solve the problem of incomplete learning of a single generator when data imbalances get complicated. The proposed DAC and MDAC are applied in two fault classification cases of the Tennessee Eastman (TE) benchmark process, results of which show superiority of DAC and MDAC compared to existing methods. Note to Practitioners-Data imbalances are common in fault classification and affect the effectiveness of modeling in industry. As a generative model, generative adversarial networks (GANs) provide new ideas for small-class data augmentation. However, the instability of its training process and the randomness of data generation affect the results of data augmentation. In this article, the GAN generation process is analyzed in detail. The results of the visualization indicate that no data generation was perfect at any one time. Based on the rules of GAN data generation, we propose a data selection strategy during training. High-quality data are selected for data augmentation through data filtering and data purification. Apart from this, we combine the training process of GAN and classification model for imbalanced data to reduce modeling time. Through industrial examples, we have evaluated the effectiveness of this method.

Self-Paced Video Data Augmentation by Generative Adversarial Networks with Insufficient Samples.

Data Augmentation Classifier for Imbalanced Fault Classification

A tutorial on generative adversarial networks with application to classification of imbalanced data

Data Augmentation in Emotion Classification Using Generative Adversarial Networks

Data Augmentation Based on Generative Adversarial Network with Mixed Attention Mechanism

Collaborative Discrimination-Enabled Generative Adversarial Network (CoD-GAN) for the Data Augmentation in Imbalanced Classification

IDA-GAN: A Novel Imbalanced Data Augmentation GAN

EID-GAN: Generative Adversarial Nets for Extremely Imbalanced Data Augmentation

Differentiable Augmentation for Data-Efficient GAN Training

SSGAN: A Semantic Similarity-Based GAN for Small-Sample Image Augmentation

The Imaginative Generative Adversarial Network: Automatic Data Augmentation for Dynamic Skeleton-Based Hand Gesture and Human Action Recognition

Video augmentation technique for human action recognition using genetic algorithm

Time series data augmentation method of small sample based on optimized generative adversarial network

Data Augmentation Using Adversarial Training for Construction-Equipment Classification

Performance Study of Image Data Augmentation by Generative Adversarial Networks

Data Augmentation for Image Classification using Generative AI

GAN based Data Augmentation to Resolve Class Imbalance

Augmenting data with generative adversarial networks: An overview

Data Augmentation Generated by Generative Adversarial Network for Small Sample Datasets Clustering

Data Augmentation for Audio-Visual Emotion Recognition with an Efficient Multimodal Conditional GAN

Feature Learning-Based Generative Adversarial Network Data Augmentation for Class-Based Few-Shot Learning