Investigation of Energy-efficient AI Model Architectures and Compression Techniques for "Green" Fetal Brain Segmentation

Szymon Mazurek,Monika Pytlarz,Sylwia Malec,Alessandro Crimi
2024-04-03
Abstract:ificial intelligence have contributed to advancements across various industries. However, the rapid growth of artificial intelligence technologies also raises concerns about their environmental impact, due to associated carbon footprints to train computational models. Fetal brain segmentation in medical imaging is challenging due to the small size of the fetal brain and the limited image quality of fast 2D sequences. Deep neural networks are a promising method to overcome this challenge. In this context, the construction of larger models requires extensive data and computing power, leading to high energy consumption. Our study aims to explore model architectures and compression techniques that promote energy efficiency by optimizing the trade-off between accuracy and energy consumption through various strategies such as lightweight network design, architecture search, and optimized distributed training tools. We have identified several effective strategies including optimization of data loading, modern optimizers, distributed training strategy implementation, and reduced floating point operations precision usage with light model architectures while tuning parameters according to available computer resources. Our findings demonstrate that these methods lead to satisfactory model performance with low energy consumption during deep neural network training for medical image segmentation.
Image and Video Processing,Artificial Intelligence,Computer Vision and Pattern Recognition,Machine Learning,Performance
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to explore efficient artificial intelligence model architectures and compression techniques for fetal brain segmentation to achieve "green" medical image processing. Specifically, the research focuses on optimizing model architecture, data loading, training strategies, and other methods to reduce energy consumption while ensuring model accuracy, thereby reducing the environmental impact of deep neural network training. ### Background and Challenges 1. **Challenges of Fetal Brain Segmentation**: - The fetal brain is small in volume, and the image quality is limited. - The quality of fast 2D sequence images is poor, increasing the difficulty of segmentation. - Publicly available datasets are limited, and the image acquisition parameters are inconsistent, affecting the training effectiveness of deep learning algorithms. 2. **Need for "Green" Deep Learning**: - The complexity of current deep learning models is continuously increasing, leading to a significant rise in computational load and energy consumption. - High energy consumption not only brings economic burdens but also has negative environmental impacts. - In resource-limited or high-energy-cost situations (such as developing countries or battery-dependent portable devices), lightweight models are needed to achieve efficient medical image analysis. ### Research Objectives - **Evaluate the Effectiveness of Different Optimization Techniques**: Conduct a comprehensive evaluation of various energy-saving techniques on model performance and energy consumption from aspects such as model architecture selection, data usage efficiency, and training acceleration. - **Develop a Low-Energy Fetal Brain Segmentation Model**: Achieve a balance of high accuracy and low energy consumption through a series of optimization strategies, ultimately selecting the best model architecture and optimization methods. ### Main Contributions - **Systematic Evaluation**: Compared to existing solutions, this paper not only focuses on a single category (such as architecture design or hardware acceleration) but also comprehensively examines the combined effects of multiple optimization techniques. - **Practical Application Value**: The research results help promote advanced medical image analysis technology in resource-constrained environments, fostering the development of sustainable computing. ### Methods and Experiments - **Baseline Model**: U-Net is selected as the baseline model for preprocessing and model evaluation. - **Optimization Techniques**: Include data loading caching, modern optimizers, distributed training strategies, low-precision floating-point operations, etc. - **Model Architecture**: Evaluated various lightweight network architectures, such as MobileNetV3, MicroNet, EfficientNet, Squeeze-UNet, etc. - **Performance Evaluation**: Assessed the effects of different optimization methods using the Dice coefficient and energy consumption metrics. ### Results and Discussion - **Initial Configuration and Model Selection**: Attention-Squeeze-UNet significantly reduced training time while maintaining high accuracy, becoming the foundational model for subsequent optimization. - **Effectiveness of Optimization Techniques**: Techniques such as data caching, Novograd optimizer, and automatic learning rate adjustment significantly improved training speed and energy efficiency. - **Final Model Selection**: Attention-Squeeze-UNet achieved a high Dice score on the test dataset and had the best energy consumption per training epoch. ### Conclusions and Recommendations - **Data Loading Optimization**: Configure the data loading pipeline reasonably, especially using caching when memory resources are sufficient. - **Optimizer Selection**: Modern optimizers perform comparably to traditional optimizers, occupy less memory, and are more robust in hyperparameter selection. - **Distributed Strategy and Precision Optimization**: Optimal distributed training strategies and low-precision floating-point operations can significantly increase throughput without sacrificing performance. - **Model Architecture**: Using existing or custom model architectures with fewer parameters can speed up training, reduce computational demands, and maintain satisfactory performance. - **Parameter Tuning Methods**: Consider using parameter tuning methods when computational resources allow, as they can yield significant benefits in long-term operations. In summary, improving the efficiency of machine learning algorithms through optimization techniques can not only accelerate model training but also significantly reduce the carbon footprint associated with computational resources. The research results of this paper provide important references for achieving sustainable computing.