Multi-Instance Learning with Any Hypothesis Class

Sivan Sabato,Naftali Tishby
DOI: https://doi.org/10.48550/arXiv.1107.2021
2012-08-14
Abstract:In the supervised learning setting termed Multiple-Instance Learning (MIL), the examples are bags of instances, and the bag label is a function of the labels of its instances. Typically, this function is the Boolean OR. The learner observes a sample of bags and the bag labels, but not the instance labels that determine the bag labels. The learner is then required to emit a classification rule for bags based on the sample. MIL has numerous applications, and many heuristic algorithms have been used successfully on this problem, each adapted to specific settings or applications. In this work we provide a unified theoretical analysis for MIL, which holds for any underlying hypothesis class, regardless of a specific application or problem domain. We show that the sample complexity of MIL is only poly-logarithmically dependent on the size of the bag, for any underlying hypothesis class. In addition, we introduce a new PAC-learning algorithm for MIL, which uses a regular supervised learning algorithm as an oracle. We prove that efficient PAC-learning for MIL can be generated from any efficient non-MIL supervised learning algorithm that handles one-sided error. The computational complexity of the resulting algorithm is only polynomially dependent on the bag size.
Machine Learning
What problem does this paper attempt to address?
This paper aims to solve the theoretical and algorithmic problems in Multiple - Instance Learning (MIL). Specifically: 1. **Theoretical Analysis**: The paper provides a unified theoretical framework to analyze MIL. This framework is applicable to any underlying hypothesis class, not limited to specific applications or problem domains. The paper proves that the sample complexity of MIL depends on the size of bags only in the form of polynomial logarithm, which holds for any underlying hypothesis class. In addition, distribution - related sample complexity bounds are also provided, and these bounds are useful when the average bag size is limited. 2. **Algorithm Design**: The paper introduces a new PAC - learning algorithm, which can use a conventional supervised - learning algorithm as an "oracle". The paper proves that if there is a non - MIL supervised - learning algorithm that efficiently handles one - sided errors, an efficient MIL PAC - learning algorithm can be generated. The computational complexity of this algorithm depends on the size of bags only in a polynomial form. 3. **Generality**: The methods in the paper are applicable not only to specific hypothesis classes, such as Axis Parallel Rectangles (APRs), but also to a wider range of hypothesis classes, label relationships, and target loss functions. This makes the results of the paper have broad application prospects. ### Main Contributions - **Sample Complexity Analysis**: It is proved that the sample complexity of MIL depends on the size of bags only in the form of polynomial logarithm, which is of great significance for understanding and optimizing the design of MIL algorithms. - **PAC - Learning Algorithm**: A new PAC - learning algorithm is proposed. This algorithm is based on existing non - MIL supervised - learning algorithms and has provable performance guarantees. - **Combination of Theory and Practice**: The paper not only provides theoretical analysis but also discusses various scenarios in practical applications, such as drug design, image classification, and web - page recommendation. ### Application Background - **Drug Design**: In drug design, MIL is used to predict which molecules can bind to specific binding sites. Each molecule may have multiple conformations (shapes). If at least one conformation can bind to the binding site, the molecule is marked as a positive example. However, it is impossible to determine which conformation binds successfully in the experiment, so the molecule can be regarded as a bag containing multiple conformations. - **Image Classification**: In image classification, an image can be regarded as a bag containing multiple regions (instances), and each region may contain different objects. MIL can be used to identify whether an image contains a specific object. - **Web - page Recommendation**: In web - page recommendation, a web - page can be regarded as a bag containing multiple keywords, and MIL can be used to predict whether a user will be interested in the web - page. ### Summary By providing a general theoretical framework and algorithm design method, the paper solves the key problems in MIL and provides a solid theoretical basis and practical tools for the application of MIL in multiple fields.