Deep Fuzzy Envelope Sample Generation Mechanism for Imbalanced Ensemble Classification

Fan Li,Yongming Li,Yinghua Shen,Witold Pedrycz,Xiaoheng Zhang,Pin Wang,Pufei Li,Chuanyan Zhou,Huan Cheng
DOI: https://doi.org/10.1109/tfuzz.2023.3321768
IF: 12.253
2024-01-01
IEEE Transactions on Fuzzy Systems
Abstract:Ensemble methods are widely used to tackle class imbalance problem. However, for existing imbalanced ensemble (IE) methods, the samples in each subset are resampled from the same dataset, and are directly input to the classifier for training, so the quality (diversity and separability) of the subsets is unsatisfactory usually. To solve the problem, a deep fuzzy envelope sample generation mechanism is proposed. First, the Fuzzy C-Means clustering based deep sample envelope pre-network (DSEN) is designed to mine correlation information among samples, thereby increasing the quality of the subsets. Second, the local manifold structure metric (LMSM) and global structure distribution metric (GSDM) are designed to construct local-global structure consistency mechanism (LGSCM) to enhance distribution consistency of interlayer samples of DSEN. Third, the DSEN and LGSCM are combined to form the final deep sample envelope network–DSENLG to refresh the existing subsets. Finally, base classifiers are applied on the new subsets generated by the DSENLG and then fused, thereby realizing a new IE algorithm. The experimental results show that the proposed algorithm is significantly better than existing representative IE algorithms and it achieves the highest improvement of 10.64%, 19.5%, 18.67% and 22.33% on four criteria over the state-of-the-art methods. The originality of the paper is threefold: (a) proposing the concept of “deep fuzzy samples” or “envelope samples”, which comprehensively considers the correlation information among original samples; (b) proposing the LGSCM to resolve the distribution inconsistency of interlayer samples; and (c) forming an fuzzy envelope sample based IE algorithm.
What problem does this paper attempt to address?