Blocked 3×2 Cross-Validated T-Test for Comparing Supervised Classification Learning Algorithms

Wang Yu,Wang Ruibo,Jia Huichen,Li Jihong
DOI: https://doi.org/10.1162/neco_a_00532
IF: 3.278
2013-01-01
Neural Computation
Abstract:In the research of machine learning algorithms for classification tasks, the comparison of the performances of algorithms is extremely important, and a statistical test of significance for generalization error is often used to perform it in the machine learning literature. In view of the randomness of partitions in cross-validation, a new blocked 3×2 crossvalidation is proposed to estimate generalization error in this letter. We then conduct an analysis of variance of the blocked 3×2 cross-validated estimator. A relatively conservative variance estimator that considers the correlation between any two two-fold cross-validations, and was previously neglected in 5×2 cross-validated t and F-tests is put forward. A corresponding test using this variance estimator is presented to compare the performances of algorithms. Simulated results show that the performance of our test is comparable with that of 5×2 cross-validated tests but with less computation complexity.
What problem does this paper attempt to address?