Unveiling the Power of High-Quality OCT: an Effective Fundus-based Modality Fusion Network for Fovea Localization

Huaqing He,Li Lin,Zhiyuan Cai,Pujin Cheng,Xiaoying Tang
DOI: https://doi.org/10.1109/isbi56570.2024.10635780
2024-01-01
Abstract:Accurately locating the fovea, the central point of the macula, is crucial for the development of computer-aided diagnosis systems in the field of retinal diseases. However, achieving accurate localization can be challenging, especially with poor-quality fundus images. To address this, we propose a multi-modal fusion network that leverages high-quality optical coherence tomography (OCT) for robust fovea lo-calization. We demonstrate that the quality of OCT data is more critical for model performance than merely increasing the quantity of data. Our multi-modal framework demonstrates modality robustness, enabling stable performance even in the absence of the OCT modality during training. By employing the dual representation of Convolution Neural Network and Vision Transformer networks, our approach effectively extracts both global information of the fundus and local information of the macula region, facilitating the interaction and fusion of features. This comprehensive approach enhances the reliability and robustness of retinal image analysis, with a focus on the importance of OCT data quality. Our approach establishes new state-of-the-art results on the GAMMA dataset. We make our code available at https://github.com/HuaqingHe/EFMFuse.
What problem does this paper attempt to address?