$\propto$SVM for learning with label proportions

Felix X. Yu,Dong Liu,Sanjiv Kumar,Tony Jebara,Shih-Fu Chang
DOI: https://doi.org/10.48550/arXiv.1306.0886
IF: 5.414
2013-06-04
Machine Learning
Abstract:We study the problem of learning with label proportions in which the training data is provided in groups and only the proportion of each class in each group is known. We propose a new method called proportion-SVM, or $\propto$SVM, which explicitly models the latent unknown instance labels together with the known group label proportions in a large-margin framework. Unlike the existing works, our approach avoids making restrictive assumptions about the data. The $\propto$SVM model leads to a non-convex integer programming problem. In order to solve it efficiently, we propose two algorithms: one based on simple alternating optimization and the other based on a convex relaxation. Extensive experiments on standard datasets show that $\propto$SVM outperforms the state-of-the-art, especially for larger group sizes.
What problem does this paper attempt to address?