Pretrained Deep 2.5D Models for Efficient Predictive Modeling from Retinal OCT

Taha Emre,Marzieh Oghbaie,Arunava Chakravarty,Antoine Rivail,Sophie Riedl,Julia Mai,Hendrik P.N. Scholl,Sobha Sivaprasad,Daniel Rueckert,Andrew Lotery,Ursula Schmidt-Erfurth,Hrvoje Bogunović
DOI: https://doi.org/10.1007/978-3-031-44013-7_14
2023-07-26
Abstract:In the field of medical imaging, 3D deep learning models play a crucial role in building powerful predictive models of disease progression. However, the size of these models presents significant challenges, both in terms of computational resources and data requirements. Moreover, achieving high-quality pretraining of 3D models proves to be even more challenging. To address these issues, hybrid 2.5D approaches provide an effective solution for utilizing 3D volumetric data efficiently using 2D models. Combining 2D and 3D techniques offers a promising avenue for optimizing performance while minimizing memory requirements. In this paper, we explore 2.5D architectures based on a combination of convolutional neural networks (CNNs), long short-term memory (LSTM), and Transformers. In addition, leveraging the benefits of recent non-contrastive pretraining approaches in 2D, we enhanced the performance and data efficiency of 2.5D techniques even further. We demonstrate the effectiveness of architectures and associated pretraining on a task of predicting progression to wet age-related macular degeneration (AMD) within a six-month period on two large longitudinal OCT datasets.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of efficiently predicting the progression of age-related macular degeneration (AMD) to wet AMD in retinal optical coherence tomography (OCT). Specifically, the authors aim to develop an effective 3D prognostic model that can predict whether patients with intermediate AMD (iAMD) will progress to late-stage wet AMD within 6 months. ### Background and Challenges 1. **Limitations of 3D Models**: - 3D deep learning models play a significant role in medical imaging, but they require high computational resources and a large amount of training data. - Pre-training 3D models is very challenging, especially in the medical field where labeled data is scarce. 2. **Advantages of 2.5D Methods**: - 2.5D methods combine 2D and 3D techniques, allowing the use of 3D volumetric data while reducing memory requirements. - 2D models can be more effectively pre-trained on large-scale natural image datasets or using self-supervised learning (SSL) on domain-specific datasets. ### Research Objectives - Develop an efficient 2.5D architecture that combines Convolutional Neural Networks (CNN), Long Short-Term Memory networks (LSTM), and Transformers to predict AMD progression from OCT volumetric data. - Improve model performance and data efficiency under limited data conditions through pre-training and fine-tuning. ### Main Contributions - **2.5D Architecture**: Explored two 2.5D architectures, namely CNN + BiLSTM and CNN + Transformer, to generate B-scan embeddings and perform volumetric-level predictions. - **Pre-training Strategy**: Utilized Time-sensitive Non-Contrastive Learning (TINC) to pre-train 2D CNNs, enhancing model performance and data efficiency. - **Experimental Validation**: Conducted experiments on two large longitudinal OCT datasets, validating the effectiveness and advantages of the 2.5D approach in predicting AMD progression. ### Conclusion - The 2.5D approach is not only more efficient in terms of memory and label usage but also outperforms larger 3D models when appropriately pre-trained. - The BiLSTM method remains advantageous under limited and imbalanced 3D data conditions, especially in terms of the PRAUC metric. - Domain-specific pre-training (TINC) outperforms the use of ImageNet pre-trained weights in all cases, highlighting the importance of domain information for prediction tasks. These findings provide valuable insights for developing accurate and efficient models for predicting AMD progression.