Development and application of a deep learning-based comprehensive early diagnostic model for chronic obstructive pulmonary disease

Zecheng Zhu,Shunjin Zhao,Jiahui Li,Yuting Wang,Luopiao Xu,Yubing Jia,Zihan Li,Wenyuan Li,Gang Chen,Xifeng Wu
DOI: https://doi.org/10.1186/s12931-024-02793-3
IF: 5.8
2024-04-18
Respiratory Research
Abstract:Abstract Background Chronic obstructive pulmonary disease (COPD) is a frequently diagnosed yet treatable condition, provided it is identified early and managed effectively. This study aims to develop an advanced COPD diagnostic model by integrating deep learning and radiomics features. Methods We utilized a dataset comprising CT images from 2,983 participants, of which 2,317 participants also provided epidemiological data through questionnaires. Deep learning features were extracted using a Variational Autoencoder, and radiomics features were obtained using the PyRadiomics package. Multi-Layer Perceptrons were used to construct models based on deep learning and radiomics features independently, as well as a fusion model integrating both. Subsequently, epidemiological questionnaire data were incorporated to establish a more comprehensive model. The diagnostic performance of standalone models, the fusion model and the comprehensive model was evaluated and compared using metrics including accuracy, precision, recall, F1-score, Brier score, receiver operating characteristic curves, and area under the curve (AUC). Results The fusion model exhibited outstanding performance with an AUC of 0.952, surpassing the standalone models based solely on deep learning features (AUC = 0.844) or radiomics features (AUC = 0.944). Notably, the comprehensive model, incorporating deep learning features, radiomics features, and questionnaire variables demonstrated the highest diagnostic performance among all models, yielding an AUC of 0.971. Conclusion We developed and implemented a data fusion strategy to construct a state-of-the-art COPD diagnostic model integrating deep learning features, radiomics features, and questionnaire variables. Our data fusion strategy proved effective, and the model can be easily deployed in clinical settings. Trial registration Not applicable. This study is NOT a clinical trial, it does not report the results of a health care intervention on human participants.
respiratory system
What problem does this paper attempt to address?
The paper attempts to address the challenge of early diagnosis of Chronic Obstructive Pulmonary Disease (COPD). Specifically, the paper aims to develop an advanced COPD diagnostic model by integrating deep learning and radiomics features to improve diagnostic accuracy and reliability. Traditional diagnostic methods, such as pulmonary function tests, have issues with low sensitivity and specificity, especially in the early stages of COPD. Therefore, the research team utilized CT images and questionnaire data, combined with deep learning and radiomics techniques, to construct a fusion model and further incorporated epidemiological questionnaire data to establish a comprehensive diagnostic model. ### Main Objectives of the Paper: 1. **Develop an advanced COPD diagnostic model**: Improve the accuracy of early COPD diagnosis by integrating deep learning and radiomics features. 2. **Evaluate the performance of different models**: Compare the diagnostic performance of single models (based on deep learning or radiomics features), fusion models (combining deep learning and radiomics features), and comprehensive models (combining deep learning, radiomics features, and questionnaire data). 3. **Validate the effectiveness of data fusion strategies**: Experimentally verify the effectiveness of data fusion strategies in improving COPD diagnostic performance. ### Research Background: - **Prevalence and harm of COPD**: COPD is a common chronic lung disease that causes approximately 3.23 million deaths globally each year. In China, the prevalence among people over 40 years old is as high as 13.7%. - **Importance of early diagnosis**: Early diagnosis of COPD is crucial for effective management and treatment, but current diagnostic methods have limitations, such as low sensitivity and specificity of pulmonary function tests. - **Advantages of CT imaging**: CT imaging shows higher sensitivity and specificity in the early diagnosis of COPD, but traditional manual reading methods are subjective and time-consuming. ### Methods: - **Dataset**: The study used CT image data from 2,983 participants, of which 2,317 participants also provided epidemiological questionnaire data. - **Feature extraction**: Variational Autoencoder (VAE) was used to extract deep learning features, and the PyRadiomics package was used to extract radiomics features. - **Model construction**: Single models based on deep learning features and radiomics features were constructed, as well as fusion models and comprehensive models. The comprehensive model also incorporated epidemiological questionnaire data. - **Performance evaluation**: The performance of the models was evaluated using metrics such as accuracy, precision, recall, F1 score, Brier score, Receiver Operating Characteristic (ROC) curve, and Area Under the Curve (AUC). ### Results: - **Performance of the fusion model**: The AUC of the fusion model reached 0.952, significantly better than the single models based on deep learning features (AUC = 0.844) or radiomics features (AUC = 0.944). - **Performance of the comprehensive model**: The comprehensive model (combining deep learning features, radiomics features, and questionnaire data) showed the highest diagnostic performance, with an AUC of 0.971. ### Conclusion: The research team successfully developed and implemented a data fusion strategy to construct an advanced COPD diagnostic model that combines deep learning features, radiomics features, and questionnaire data. This model has high accuracy and reliability in clinical applications and can effectively improve the early diagnosis rate of COPD.