A label-free and data-free training strategy for vasculature segmentation in serial sectioning OCT data

Etienne Chollet,Yael Balbastre,Caroline Magnain,Bruce Fischl,Hui Wang
2024-05-22
Abstract:Serial sectioning Optical Coherence Tomography (sOCT) is a high-throughput, label free microscopic imaging technique that is becoming increasingly popular to study post-mortem neurovasculature. Quantitative analysis of the vasculature requires highly accurate segmentation; however, sOCT has low signal-to-noise-ratio and displays a wide range of contrasts and artifacts that depend on acquisition parameters. Furthermore, labeled data is scarce and extremely time consuming to generate. Here, we leverage synthetic datasets of vessels to train a deep learning segmentation model. We construct the vessels with semi-realistic splines that simulate the vascular geometry and compare our model with realistic vascular labels generated by constrained constructive optimization. Both approaches yield similar Dice scores, although with very different false positive and false negative rates. This method addresses the complexity inherent in OCT images and paves the way for more accurate and efficient analysis of neurovascular structures.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in serial - section optical coherence tomography (sOCT) data, how to achieve vascular segmentation without labels and data. Specifically, the author aims to overcome the following challenges: 1. **Low signal - to - noise ratio and complex noise**: sOCT images have a low signal - to - noise ratio, and due to different acquisition parameters, they will show a wide range of contrast and artifacts. These factors make it difficult for traditional knowledge - based methods (such as Hessian filtering, morphological operations or region - growing techniques) to effectively segment blood vessels. 2. **Scarcity of labeled data**: High - quality labeled data is very scarce and time - consuming to generate, resulting in a lack of diverse training data for convolutional neural network (CNN) models, thus affecting their generalization ability. To solve these problems, the author proposes a method of training with synthetic datasets, specifically including: - **Spline - based label synthesis**: Generate a vascular tree by randomly drawing cubic splines in 3D space and introduce branch points to simulate more complex vascular geometries. - **Label synthesis based on Constrained Construction Optimization (CCO)**: Use the existing CCO method to generate more realistic vascular labels. - **Domain - randomized OCT synthesis**: Generate textures specific to blood vessels and brain parenchyma, combined with random intensity non - uniformity and speckle noise to simulate the characteristics of real OCT images. Through this method, the author hopes to train an accurate and efficient vascular segmentation model without real - labeled data, thereby improving the quantitative analysis ability of neurovascular structures. ### Key Formulas The article does not involve specific mathematical formulas, but mentions some loss functions and optimization methods, such as: - Use the Dice loss function proposed by Tilborghs et al. for model training: \[ \text{Dice Loss} = 1-\frac{2\sum (P_iG_i)+\epsilon}{\sum P_i^2+\sum G_i^2+\epsilon} \] where \(P_i\) is the predicted value, \(G_i\) is the true value, and \(\epsilon\) is a very small constant to avoid division - by - zero errors. ### Summary This study aims to train a deep - learning model with synthetic datasets to achieve sOCT vascular segmentation without labels and data, thereby improving the analysis accuracy and efficiency of neurovascular structures.