Learning Object from Small and Imbalanced Dataset with Boost-Bfko

Liansheng Zhuang,Wei Zhou,Qi Tian,Nenghai Yu
DOI: https://doi.org/10.1109/icme.2008.4607718
2008-01-01
Abstract:One of the main drawbacks of boosting is its overfitting and poor predictive accuracy when the training dataset is small and imbalanced. In this paper, we introduce a novel learning algorithm Boost-BFKO, which combines boosting and data generation. It is suitable for small and imbalanced training datasets. To enlarge training sets, Boost-BFKO uses the adaptive Balanced Feature Knockout procedure (BFKO) to generate new synthetic samples. To enrich the training sets, Boost-BFKO selects seed samples from the minority class, and rebalances the total weights of the different classes in the updated training dataset. Experiments on Caltech 101 database showed that our method achieves a desirable performance when only a few training samples are available for binary classification and multiple object classification.
What problem does this paper attempt to address?