EfficientBioAI: Making Bioimaging AI Models Efficient in Energy, Latency and Representation

Yu Zhou,Justin Sonneck,Sweta Banerjee,Stefanie Dörr,Anika Grüneboom,Kristina Lorenz,Jianxu Chen
2023-06-09
Abstract:Artificial intelligence (AI) has been widely used in bioimage image analysis nowadays, but the efficiency of AI models, like the energy consumption and latency is not ignorable due to the growing model size and complexity, as well as the fast-growing analysis needs in modern biomedical studies. Like we can compress large images for efficient storage and sharing, we can also compress the AI models for efficient applications and deployment. In this work, we present EfficientBioAI, a plug-and-play toolbox that can compress given bioimaging AI models for them to run with significantly reduced energy cost and inference time on both CPU and GPU, without compromise on accuracy. In some cases, the prediction accuracy could even increase after compression, since the compression procedure could remove redundant information in the model representation and therefore reduce over-fitting. From four different bioimage analysis applications, we observed around 2-5 times speed-up during inference and 30-80$\%$ saving in energy. Cutting the runtime of large scale bioimage analysis from days to hours or getting a two-minutes bioimaging AI model inference done in near real-time will open new doors for method development and biomedical discoveries. We hope our toolbox will facilitate resource-constrained bioimaging AI and accelerate large-scale AI-based quantitative biological studies in an eco-friendly way, as well as stimulate further research on the efficiency of bioimaging AI.
Machine Learning,Image and Video Processing
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the efficiency of AI models in biomedical image analysis, specifically including the following aspects: 1. **Energy Consumption**: As the complexity and scale of AI models grow, the energy required for the models during the inference process increases significantly. For example, from 2012 to 2021, the energy consumption for one forward propagation of the state - of - the - art AI models increased from 0.1 joules to approximately 30 joules. 2. **Inference Latency**: When complex AI models run on CPUs or edge devices, the inference time may increase significantly, especially when processing very large images. This not only affects real - time performance but also limits the efficiency of large - scale biomedical research. 3. **Representation Efficiency**: Since the growth rate of training data cannot keep up with the growth of model complexity, the models may become inefficient in training and more prone to overfitting. To solve these problems, the authors proposed a toolkit named EfficientBioAI, aiming to improve the efficiency of AI models in terms of energy, latency, and representation by compressing them. Specific methods include: - **Quantization**: Quantize the weights and activations of the model from commonly used floating - point numbers (such as 32 - bit or 16 - bit) to lower precision (such as 8 - bit integers), thereby reducing energy consumption and inference latency. - **Pruning**: Reduce the model complexity by removing unimportant weights or neurons in the neural network, thereby reducing the computational overhead. - **Knowledge Distillation**: Transfer the knowledge of complex models to simpler and faster models to improve the inference speed and energy efficiency. Through these techniques, EfficientBioAI can significantly improve the efficiency of biomedical image analysis tasks without affecting the accuracy of the models. Experimental results show that in multiple biomedical image analysis tasks, EfficientBioAI can achieve 2 - 5 times of inference acceleration and save 30% - 80% of energy consumption. In addition, EfficientBioAI also has good compatibility and ease of use, supports multiple hardware platforms, and can be seamlessly integrated with existing biomedical image analysis tools, thereby promoting efficient biomedical research in resource - constrained environments.