Boundary-Eliminated Pseudoinverse Linear Discriminant for Imbalanced Problems

Yujin Zhu,Zhe Wang,Hongyuan Zha,Daqi Gao
DOI: https://doi.org/10.1109/tnnls.2017.2676239
IF: 14.255
2017-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:Existing learning models for classification of imbalanced data sets can be grouped as either boundary-based or nonboundary-based depending on whether a decision hyperplane is used in the learning process. The focus of this paper is a new approach that leverages the advantage of both approaches. Specifically, our new model partitions the input space into three parts by creating two additional boundaries in the training process, and then makes the final decision based on a heuristic measurement between the test sample and a subset of selected training samples. Since the original hyperplane used by the underlying original classifier will be eliminated, the proposed model is named the boundary-eliminated (BE) model. Additionally, the pseudoinverse linear discriminant (PILD) is adopted for the BE model so as to obtain a novel classifier abbreviated as BEPILD. Experiments validate both the effectiveness and the efficiency of BEPILD, compared with 13 state-of-the-art classification methods, based on 31 imbalanced and 7 standard data sets.
What problem does this paper attempt to address?