Advancing diagnostic performance and clinical usability of neural networks via adversarial training and dual batch normalization

Tianyu Han,Sven Nebelung,Federico Pedersoli,Markus Zimmermann,Maximilian Schulze-Hagen,Michael Ho,Christoph Haarburger,Fabian Kiessling,Christiane Kuhl,Volkmar Schulz,Daniel Truhn
DOI: https://doi.org/10.1038/s41467-021-24464-3
2020-11-26
Abstract:Unmasking the decision-making process of machine learning models is essential for implementing diagnostic support systems in clinical practice. Here, we demonstrate that adversarially trained models can significantly enhance the usability of pathology detection as compared to their standard counterparts. We let six experienced radiologists rate the interpretability of saliency maps in datasets of X-rays, computed tomography, and magnetic resonance imaging scans. Significant improvements were found for our adversarial models, which could be further improved by the application of dual batch normalization. Contrary to previous research on adversarially trained models, we found that the accuracy of such models was equal to standard models when sufficiently large datasets and dual batch norm training were used. To ensure transferability, we additionally validated our results on an external test set of 22,433 X-rays. These findings elucidate that different paths for adversarial and real images are needed during training to achieve state of the art results with superior clinical interpretability.
Machine Learning,Computer Vision and Pattern Recognition,Image and Video Processing
What problem does this paper attempt to address?