Abstract:Multi-instance Learning (MIL) is a popular learning paradigm arising from many real applications. It assigns a label to a set of instances, named as a bag, and the bag’s label is determined by the instances within it. A bag is positive if and only if it has at least one positive instance. Since labeling bags is more complicated than labeling each instance, we will often face the mislabeling problem in MIL. Furthermore, it is more common that a negative bag has been mislabeled to a positive one since one mislabeled instance will lead to the change of the whole bag label. This is an important problem that originated from real applications, e.g., web mining and image classification, but little research has concentrated on it as far as we know. In this paper, we focus on this MIL problem with one side label noise that the negative bags are mislabeled as positive ones. To address this challenging problem, we propose a novel multi-instance learning method with One Side Label Noise (OSLN). We design a new double weighting approach under traditional framework to characterize the ’faithfulness’ of each instance and each bag in learning the classifier. Briefly, on the instance level, we employ a sparse weighting method to select the key instances, and the MIL problem with one size label noise is converted to a mislabeled supervised learning scenario. On the bag level, the weights of bags, together with the selected key instances, will be utilized to identify the real positive bags. In addition, we have solved our proposed model by an alternative iteration method with proved convergence behavior. Empirical studies on various datasets have validated the effectiveness of our method.

Multiple instance learning: A survey of problem characteristics and applications

Multiple-Instance Learning from Pairwise Comparison Bags

Multiple instance learning with bag dissimilarities

Multiple-Instance Learning from Similar and Dissimilar Bags

Multiple instance learning from similarity-confidence bags

Multiple-Instance Learning from Unlabeled Bags with Pairwise Similarity

A New Multiple Instance Algorithm Using Structural Information.

Instance-level Semisupervised Multiple Instance Learning.

Mild: Multiple-Instance Learning Via Disambiguation

Characterizing multiple instance datasets

Multiple Instance Learning Via Distance Metric Optimization

Rethinking Multiple Instance Learning: Developing an Instance-Level Classifier via Weakly-Supervised Self-Training

Scalable Multi-instance Learning

Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit Tests

Multiple-instance learning with instance selection via constructive covering algorithm

Multi-instance Positive and Unlabeled Learning with Bi-Level Embedding

Multiple-Instance Learning by Boosting Infinitely Many Shapelet-based Classifiers

Multi-Instance Learning with Any Hypothesis Class

Multi-Instance Learning: A Survey

Scalable Algorithms for Multi-Instance Learning.

Multi-Instance Learning with One Side Label Noise