CrossMoDA 2021 Challenge: Benchmark of Cross-Modality Domain Adaptation Techniques for Vestibular Schwannoma and Cochlea Segmentation
Reuben Dorent,Aaron Kujawa,Marina Ivory,Spyridon Bakas,Nicola Rieke,Samuel Joutard,Ben Glocker,Jorge Cardoso,Marc Modat,Kayhan Batmanghelich,Arseniy Belkov,Maria Baldeon Calisto,Jae Won Choi,Benoit M. Dawant,Hexin Dong,Sergio Escalera,Yubo Fan,Lasse Hansen,Mattias P. Heinrich,Smriti Joshi,Victoriya Kashtanova,Hyeon Gyu Kim,Satoshi Kondo,Christian N. Kruse,Susana K. Lai-Yuen,Hao Li,Han Liu,Buntheng Ly,Ipek Oguz,Hyungseob Shin,Boris Shirokikh,Zixian Su,Guotai Wang,Jianghao Wu,Yanwu Xu,Kai Yao,Li Zhang,Sebastien Ourselin,Jonathan Shapey,Tom Vercauteren
DOI: https://doi.org/10.1016/j.media.2022.102628
IF: 10.9
2022-01-01
Medical Image Analysis
Abstract:Domain Adaptation (DA) has recently been of strong interest in the medical imaging community. While a large variety of DA techniques have been proposed for image segmentation, most of these techniques have been validated either on private datasets or on small publicly available datasets. Moreover, these datasets mostly addressed single-class problems. To tackle these limitations, the Cross-Modality Domain Adaptation (crossMoDA) challenge was organised in conjunction with the 24th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2021). CrossMoDA is the first large and multi-class benchmark for unsupervised cross-modality Domain Adaptation. The goal of the challenge is to segment two key brain structures involved in the follow-up and treatment planning of vestibular schwannoma (VS): the VS and the cochleas. Currently, the diagnosis and surveillance in patients with VS are commonly performed using contrast-enhanced T1 (ceT1) MR imaging. However, there is growing interest in using non-contrast imaging sequences such as high-resolution T2 (hrT2) imaging. For this reason, we established an unsupervised cross-modality segmentation benchmark. The training dataset provides annotated ceT1 scans (N=105) and unpaired non-annotated hrT2 scans (N=105). The aim was to automatically perform unilateral VS and bilateral cochlea segmentation on hrT2 scans as provided in the testing set (N=137). This problem is particularly challenging given the large intensity distribution gap across the modalities and the small volume of the structures. A total of 55 teams from 16 countries submitted predictions to the validation leaderboard. Among them, 16 teams from 9 different countries submitted their algorithm for the evaluation phase. The level of performance reached by the top-performing teams is strikingly high (best median Dice score - VS: 88.4%; Cochleas: 85.7%) and close to full supervision (median Dice score - VS: 92.5%; Cochleas: 87.7%). All top-performing methods made use of an image-to-image translation approach to transform the source-domain images into pseudo-target-domain images. A segmentation network was then trained using these generated images and the manual annotations provided for the source image.