Two-layer Ensemble of Deep Learning Models for Medical Image Segmentation

Truong Dang,Tien Thanh Nguyen,John McCall,Eyad Elyan,Carlos Francisco Moreno-García
DOI: https://doi.org/10.1007/s12559-024-10257-5
IF: 4.89
2024-01-31
Cognitive Computation
Abstract:Abstract One of the most important areas in medical image analysis is segmentation, in which raw image data is partitioned into structured and meaningful regions to gain further insights. By using Deep Neural Networks (DNN), AI-based automated segmentation algorithms can potentially assist physicians with more effective imaging-based diagnoses. However, since it is difficult to acquire high-quality ground truths for medical images and DNN hyperparameters require significant manual tuning, the results by DNN-based medical models might be limited. A potential solution is to combine multiple DNN models using ensemble learning. We propose a two-layer ensemble of deep learning models in which the prediction of each training image pixel made by each model in the first layer is used as the augmented data of the training image for the second layer of the ensemble. The prediction of the second layer is then combined by using a weight-based scheme which is found by solving linear regression problems. To the best of our knowledge, our paper is the first work which proposes a two-layer ensemble of deep learning models with an augmented data technique in medical image segmentation. Experiments conducted on five different medical image datasets for diverse segmentation tasks show that proposed method achieves better results in terms of several performance metrics compared to some well-known benchmark algorithms. Our proposed two-layer ensemble of deep learning models for segmentation of medical images shows effectiveness compared to several benchmark algorithms. The research can be expanded in several directions like image classification.
computer science, artificial intelligence,neurosciences
What problem does this paper attempt to address?
The paper primarily proposes a new solution to the problem of medical image segmentation. Specifically, the proposed solution in the paper aims to address the following key issues: 1. **Scarcity of high-quality annotated data**: In the field of medical imaging, obtaining high-quality annotated data is very difficult because it requires a high level of expertise. Compared to common image datasets (such as ImageNet), most medical image datasets have fewer annotated samples. 2. **Difficulty in tuning deep neural network (DNN) hyperparameters**: The performance of DNNs often relies on carefully tuned hyperparameter settings, which usually need to be determined through trial and error, increasing the complexity and uncertainty of model development. 3. **Limitations of a single model**: A single DNN model may encounter limitations when handling medical image segmentation tasks, especially in the absence of a large amount of training data. To address the above challenges, the paper proposes a two-layer ensemble deep learning model. The main contributions of this model include: - Proposing a heterogeneous ensemble deep segmentation model for medical image segmentation problems. - Using the prediction results of the deep segmentation models in each ensemble on the training image pixels as additional channels added to the original training images to create the training data for the second layer ensemble. - Proposing a weight-based scheme to combine the prediction results in the second layer ensemble. These weights are obtained by solving a linear regression problem to reflect the relationship between the prediction results and the true labels of the training observations. - Experiments conducted on five different medical image datasets show that the proposed method outperforms some known benchmark algorithms on various performance metrics. In summary, the goal of this paper is to improve the accuracy of medical image segmentation tasks by proposing a novel two-layer ensemble framework, especially in the context of limited datasets and difficult hyperparameter tuning. This approach leverages the advantages of ensemble learning to significantly improve segmentation results and has broad application potential.