Balanced self-paced learning with feature corruption

Yazhou Ren,Peng Zhao,Zenglin Xu,Dezhong Yao
DOI: https://doi.org/10.1109/IJCNN.2017.7966104
2017-01-01
Abstract:Self-paced learning (SPL), a recently proposed learning strategy, which progressively adds instances to train from simplicity to complexity, could typically reduce the risk of achieving local optima. SPL selects instances based on their losses among the entire data set in each iteration. This probably causes that the selected instances are highly imbalanced, e.g., very few (even on) instances of some classes are chosen, and further negatively affects the training process. To address this issue, we propose a balanced self-paced learning (BSPL) scenario, which iteratively selects training samples based on their loss values from each class, instead of from the entire data set. From another perspective, learning with marginalized corrupted features is an approach to control overfitting by artificially corrupting the training data. However, feature corruption techniques typically lead to that the classification problem is non-convex and easily traps in local optima. To alleviate this, we propose balanced self-paced learning with feature corruption (BSPL-FC), which considers the instance sampling and feature corruption simultaneously. BSPL-FC first treats the feature corruption as a regularizer and then applies BSPL to solve the regularized classification problem. BSPL-FC inherently has advantages in controlling overfitting and avoiding local optima. Experimental results show the effectiveness of the proposed model.
What problem does this paper attempt to address?