The Kth, Median and Average Margin Bounds for AdaBoost

Wei Gao,Zhi-Hua Zhou
2010-01-01
Abstract:Margin theory provides one of the most popular explanations to the success of AdaBoost, where the central point is that the margin is the key for characterizing the performance of AdaBoost. This has been very influential, e.g., it has been used to argue that AdaBoost does not overfit since it tends to enlarge the margin even after the training set error reaches zero. Previously the minimum margin bound was established for AdaBoost, however, researchers believe that this is far from sufficient since the minimum margin bound considers only a single margin. Intuitively, the margin distribution is crucial, but unfortunately there is no bound for AdaBoost based on the whole margin distribution. In this paper, we prove the kth margin bound, and show that the previous minimum margin bound and Emargin bound for AdaBoost are special cases of the kth margin bound; both of them are single margin bounds. We derive the median margin bound which theoretically supports previous empirical argument that, for large training sample, a larger median margin will lead to a stronger generalization. More important, we prove the average margin bound which is, to the best of our knowledge, the first bound for AdaBoost based on the whole margin distribution.
What problem does this paper attempt to address?