Efficient synthesis of 3D MR images for schizophrenia diagnosis classification with generative adversarial networks

Sebastian King,Yasmin Hollenbenders,Alexandra Reichenbach
DOI: https://doi.org/10.1101/2024.06.01.24308319
2024-06-04
Abstract:Schizophrenia and other psychiatric disorders can greatly benefit from objective decision support in diagnosis and therapy. Machine learning approaches based on neuroimaging, e.g. magnetic resonance imaging (MRI), have the potential to serve this purpose. However, the medical data sets these algorithms can be trained on are often rather small, leading to overfit, and the resulting models can therewith not be transferred into a clinical setting. The generation of synthetic images from real data is a promising approach to overcome this shortcoming. Due to the small data set size and the size and complexity of medical images, i.e. their three-dimensional nature, those algorithms are challenged on several levels. We develop four generative adversarial network (GAN) architectures that tackle these challenges and evaluate them systematically with a data set of 193 MR images of schizophrenia patients and healthy controls. The best architecture, a GAN with spectral normalization regulation and an additional encoder (α-SN-GAN), is then extended with an auxiliary classifier into an ensemble of networks capable of generating distinct image sets for the two diagnostic categories. The synthetic images increase the accuracy of a diagnostic classifier from a baseline accuracy of around 61% to 79%. This novel end-to-end pipeline for schizophrenia diagnosis demonstrates a data and memory efficient approach to support clinical decision-making that can also be transferred to support other psychiatric disorders.
What problem does this paper attempt to address?
The main problem this paper attempts to address is the synthesis of three-dimensional magnetic resonance imaging (3D MRI) images using Generative Adversarial Networks (GAN) to improve the accuracy of schizophrenia diagnosis classification. Specifically, the researchers face the following challenges: 1. **Small dataset size**: Medical imaging datasets are usually small, which leads to overfitting when training deep learning models, thereby affecting the model's performance in clinical settings. 2. **Limitations of data augmentation techniques**: Traditional data augmentation methods (such as affine transformations) have limited effectiveness on MRI images, necessitating new techniques to generate high-quality synthetic images. 3. **Complexity of 3D images**: 3D MRI images have large data volumes and complex structures, posing higher demands on the generative models. To address these challenges, the researchers developed four different GAN architectures and systematically evaluated their performance. Ultimately, they selected a GAN with spectral normalization and an encoder (α-SN-GAN) and extended this architecture to generate image sets for different diagnostic categories. Experimental results showed that training with synthetic images significantly improved the accuracy of the diagnostic classifier, increasing from a baseline of approximately 61% to 79%. Through this study, the authors demonstrate an efficient method of data and memory utilization that can support clinical decision-making, and this approach also holds promise for application in the research of other mental disorders.