MIS-Boost: Multiple Instance Selection Boosting

Emre Akbas,Bernard Ghanem,Narendra Ahuja
DOI: https://doi.org/10.48550/arXiv.1109.2388
2011-09-12
Abstract:In this paper, we present a new multiple instance learning (MIL) method, called MIS-Boost, which learns discriminative instance prototypes by explicit instance selection in a boosting framework. Unlike previous instance selection based MIL methods, we do not restrict the prototypes to a discrete set of training instances but allow them to take arbitrary values in the instance feature space. We also do not restrict the total number of prototypes and the number of selected-instances per bag; these quantities are completely data-driven. We show that MIS-Boost outperforms state-of-the-art MIL methods on a number of benchmark datasets. We also apply MIS-Boost to large-scale image classification, where we show that the automatically selected prototypes map to visually meaningful image regions.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is, in the framework of Multiple Instance Learning (MIL), how to learn discriminative instance prototypes through explicit instance selection. Specifically, MIL methods deal with the situation where the label of each "bag" in the training data is known, but the specific labels of individual instances within the bag are unknown. Traditional MIL methods are usually limited by the following points: 1. **Limitations of prototype selection**: Most methods limit prototypes to a discrete set of instances in the training set and cannot freely take any value in the instance feature space. 2. **Limitations on the number of instances selected per bag**: Some methods pre - define or fix the number of instances selected in each bag. 3. **Independence of the representation and classification stages**: Many methods consider prototype selection (the representation stage) and the final bag classifier learning (the classification stage) as independent processes. To address these problems, the paper proposes a new MIL method - MIS - Boost (Multiple Instance Selection Boosting). MIS - Boost improves traditional MIL methods in the following ways: - **No restriction on prototype sources**: It allows prototypes to take any value in the instance feature space, not limited to the instances in the training set. - **Data - driven determination of the number of prototypes**: It does not pre - set the total number of prototypes or the number of instances selected per bag, but automatically determines these parameters according to the data itself. - **Combination of the representation and classification stages**: It simultaneously optimizes prototype selection and classification performance during the learning process, making the whole process more efficient and consistent. Through these improvements, MIS - Boost can outperform existing MIL methods on multiple benchmark datasets and perform excellently in large - scale image classification tasks, and can automatically select prototypes that map to visually meaningful image regions.