Mutual Ensemble Learning for Brain Tumor Segmentation

Jingyu Hu,Xiaojing Gu,Xingsheng Gu
DOI: https://doi.org/10.1016/j.neucom.2022.06.058
IF: 6
2022-01-01
Neurocomputing
Abstract:It is challenging to reduce the generalization errors of brain tumor segmentation models on test data, as the nature of the high diversity of tumors. The model ensemble combining multiple models to make the final prediction is a reliable strategy to reduce generalization errors. Conventionally, these models that acted as member networks of the ensemble are trained separately. Each member network is trained independently on a data subset or a single perspective of 3D brain volumes. To enable each member network to be trained on the complete training dataset as well as 3D images, we propose a mutual learning method to mutually train multiple 3D segmentation networks of the ensemble, called mutual ensemble learning (MEL). Mutual learning can enable knowledge exchange between networks and let them teach each other during the training process so that each member can converge to a better local minimum compared with when trained separately. Meanwhile, mutual learning also keeps the partially independent errors made by different member networks, and combining these member networks can reduce overall errors on the test set. In a word, combining these stronger members gives better ensemble results and it would not bring more cost in the test stage. To realize the mutual learning of multiple 3D brain tumor segmentation networks, we introduce a novel loss called Consensus Dice loss to explicitly exchange information among different networks during training. This loss considers the overlap of prediction probabilities by multiple networks and ground truth to enhance the overlapping pixels’ probabilities. Extensive experiments are conducted on the brain tumor segmentation, i.e., BRATS 2015 and 2018 datasets. Results on the databases indicate that the proposed approach consistently improves the segmentation performance of the baseline network. Furthermore, our proposed method achieves state-of-the-art performance on the BRATS 2018 online validation set, and it is flexible to be extended by diverse homogeneous and heterogeneous state-of-the-art networks.
What problem does this paper attempt to address?