Segmentation-based cardiomegaly detection based on semi-supervised estimation of cardiothoracic ratio

Patrick Thiam,Christopher Kloth,Daniel Blaich,Andreas Liebold,Meinrad Beer,Hans A. Kestler
DOI: https://doi.org/10.1038/s41598-024-56079-1
IF: 4.6
2024-03-10
Scientific Reports
Abstract:The successful integration of neural networks in a clinical setting is still uncommon despite major successes achieved by artificial intelligence in other domains. This is mainly due to the black box characteristic of most optimized models and the undetermined generalization ability of the trained architectures. The current work tackles both issues in the radiology domain by focusing on developing an effective and interpretable cardiomegaly detection architecture based on segmentation models. The architecture consists of two distinct neural networks performing the segmentation of both cardiac and thoracic areas of a radiograph. The respective segmentation outputs are subsequently used to estimate the cardiothoracic ratio, and the corresponding radiograph is classified as a case of cardiomegaly based on a given threshold. Due to the scarcity of pixel-level labeled chest radiographs, both segmentation models are optimized in a semi-supervised manner. This results in a significant reduction in the costs of manual annotation. The resulting segmentation outputs significantly improve the interpretability of the architecture's final classification results. The generalization ability of the architecture is assessed in a cross-domain setting. The assessment shows the effectiveness of the semi-supervised optimization of the segmentation models and the robustness of the ensuing classification architecture.
multidisciplinary sciences
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the efficiency and accuracy of cardiomegaly detection, while enhancing the interpretability of the detection results. Specifically, the paper proposes a semi - supervised learning method based on segmentation to estimate the Cardiothoracic Ratio (CTR), and automatically detect cardiomegaly by setting a threshold. This method aims to overcome the performance degradation of existing methods in cross - domain settings and reduce the dependence on a large amount of manually - labeled data, thereby reducing costs and improving the generalization ability of the model. ### Main problems 1. **Improve the efficiency and accuracy of cardiomegaly detection**: - Existing methods for cardiomegaly detection usually rely on experts' knowledge for the calculation and interpretation of the Cardiothoracic Ratio, which is not only time - consuming but also susceptible to subjective factors. - The paper proposes a deep - learning method based on segmentation to detect cardiomegaly by automatically calculating the Cardiothoracic Ratio, thereby improving the efficiency and accuracy of detection. 2. **Enhance the interpretability of detection results**: - Many existing deep - learning models are regarded as "black boxes", with their internal operating mechanisms being opaque, resulting in difficult - to - interpret results. - The paper makes the final classification results more intuitive and interpretable by using a segmentation model to generate segmentation masks for the heart and thoracic regions. 3. **Reduce the dependence on a large amount of manually - labeled data**: - Manually labeling pixel - level labels for chest X - rays is a time - consuming and costly task. - The paper adopts a semi - supervised learning method, using a small amount of labeled data and a large amount of unlabeled data to optimize the segmentation model, thereby significantly reducing the cost of manual labeling. 4. **Improve the generalization ability of the model**: - Existing methods for cardiomegaly detection often perform poorly in cross - domain settings, that is, the performance on different datasets will significantly decline. - The paper verifies the robustness and generalization ability of the proposed method under different data distributions by evaluating the performance of the model on multiple different datasets. ### Method overview The architecture proposed in the paper consists of two main parts: 1. **Segmentation model**: - Use two neural networks to segment the heart and thoracic regions respectively. - The segmentation output is used to calculate the Cardiothoracic Ratio (CTR). 2. **Semi - supervised learning**: - Use a small amount of labeled data and a large amount of unlabeled data to optimize the segmentation model. - Through the Cross - Consistency Training method, enhance the generalization ability and robustness of the model. ### Formulas - **Calculation formula of Cardiothoracic Ratio (CTR)**: \[ \text{CTR}=\frac{\text{Maximum horizontal diameter of the heart}}{\text{Maximum horizontal diameter of the thorax}} \] - **Classification threshold**: \[ \text{Output}=\begin{cases} \text{Cardiomegaly}&\text{if }\text{CTR}>\pi\\ \text{Normal}&\text{otherwise} \end{cases} \] - **Loss function**: - Supervised loss \(L_S\): \[ L_S = \frac{1}{bs_l}\sum_{i = 1}^{bs_l}(H(y_l^i,\hat{y}_l^i)+\text{dice}(y_l^i,\hat{y}_l^i)) \] - Unsupervised loss \(L_U\): \[ L_U=\frac{1}{k\times bs_u}\sum_{j = 1}^{bs_u}\sum_{p = 1}^{k}d(\hat{y}_u^j,\hat{y}_{u,p}^j) \] - Total loss \(L\): \[ L = L_S+\omega_U L_U \] - Pixel - level squared error \(d\)