OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods

Mikhail Kulyabin,Aleksei Zhdanov,Anastasia Nikiforova,Andrey Stepichev,Anna Kuznetsova,Mikhail Ronkin,Vasilii Borisov,Alexander Bogachev,Sergey Korotkich,Paul A. Constable,Andreas Maier
DOI: https://doi.org/10.1038/s41597-024-03182-7
2024-04-11
Scientific Data
Abstract:Abstract Optical coherence tomography (OCT) is a non-invasive imaging technique with extensive clinical applications in ophthalmology. OCT enables the visualization of the retinal layers, playing a vital role in the early detection and monitoring of retinal diseases. OCT uses the principle of light wave interference to create detailed images of the retinal microstructures, making it a valuable tool for diagnosing ocular conditions. This work presents an open-access OCT dataset (OCTDL) comprising over 2000 OCT images labeled according to disease group and retinal pathology. The dataset consists of OCT records of patients with Age-related Macular Degeneration (AMD), Diabetic Macular Edema (DME), Epiretinal Membrane (ERM), Retinal Artery Occlusion (RAO), Retinal Vein Occlusion (RVO), and Vitreomacular Interface Disease (VID). The images were acquired with an Optovue Avanti RTVue XR using raster scanning protocols with dynamic scan length and image resolution. Each retinal b-scan was acquired by centering on the fovea and interpreted and cataloged by an experienced retinal specialist. In this work, we applied Deep Learning classification techniques to this new open-access dataset.
multidisciplinary sciences
What problem does this paper attempt to address?
The main problem this paper attempts to address is the creation of an open-access Optical Coherence Tomography (OCT) dataset (OCTDL) to support the application of image-based deep learning methods in the diagnosis and monitoring of ophthalmic diseases. Specifically, this dataset contains over 2000 OCT images, annotated according to different pathological conditions, covering various eye diseases such as Age-related Macular Degeneration (AMD), Diabetic Macular Edema (DME), Epiretinal Membrane (ERM), Retinal Artery Occlusion (RAO), Retinal Vein Occlusion (RVO), and Vitreomacular Interface Disease (VID). ### Main Issues: 1. **Creation of the Dataset**: Construct a high-quality, large-scale OCT image dataset so that researchers and clinicians can use deep learning techniques for the automatic detection and classification of eye diseases. 2. **Diversity and Annotation of the Dataset**: Ensure the dataset includes a variety of different ophthalmic diseases, and that each image is meticulously annotated for training and testing deep learning models. 3. **Open Access to the Dataset**: Publicly release the dataset to promote scientific research and algorithm development, particularly in the early diagnosis and monitoring of retinal diseases. ### Background and Significance: - **Importance of OCT Technology**: OCT is a non-invasive imaging technology that is crucial in ophthalmic clinical applications, especially in the early detection and monitoring of retinal diseases. - **Limitations of Existing Datasets**: Although existing OCT datasets are numerous, they often suffer from incomplete annotations and limited disease types, which restricts the performance and generalization ability of deep learning models. - **Advantages of the New Dataset**: The OCTDL dataset not only includes a large number of high-quality OCT images but also provides detailed annotation information, which helps improve the accuracy and reliability of deep learning models. ### Methods and Results: - **Data Collection**: OCT images were acquired using the Optovue Avanti RTVue XR device through a dynamic scanning protocol, with each image focused on the macular region and interpreted and classified by experienced retinal specialists. - **Data Processing**: Images were preprocessed and augmented, including random cropping, horizontal and vertical flipping, rotation, translation, and Gaussian blurring, to increase the robustness of the model. - **Model Evaluation**: The performance of the OCTDL dataset was evaluated using two classic deep learning architectures, VGG16 and ResNet50, for classification tasks, and the results showed that these models performed well on the OCTDL dataset. ### Conclusion: The release of the OCTDL dataset provides an important resource for the application of image-based deep learning methods in the diagnosis and monitoring of ophthalmic diseases. The open access to this dataset helps advance research in the related field and improves the early diagnosis and treatment outcomes of retinal diseases.