OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models

Zhaojian Yu,Yinghao Wu,Zhuotao Deng,Yansong Tang,Xiao-Ping Zhang
2024-05-21
Abstract:In recent years, large-scale auto-regressive models have made significant progress in various tasks, such as text or video generation. However, the environmental impact of these models has been largely overlooked, with a lack of assessment and analysis of their carbon footprint. To address this gap, we introduce OpenCarbonEval, a unified framework for integrating large-scale models across diverse modalities to predict carbon emissions, which could provide AI service providers and users with a means to estimate emissions beforehand and help mitigate the environmental pressure associated with these models. In OpenCarbonEval, we propose a dynamic throughput modeling approach that could capture workload and hardware fluctuations in the training process for more precise emissions estimates. Our evaluation results demonstrate that OpenCarbonEval can more accurately predict training emissions than previous methods, and can be seamlessly applied to different modal tasks. Specifically, we show that OpenCarbonEval achieves superior performance in predicting carbon emissions for both visual models and language models. By promoting sustainable AI development and deployment, OpenCarbonEval can help reduce the environmental impact of large-scale models and contribute to a more environmentally responsible future for the AI community.
Computers and Society,Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the lack of evaluation regarding the environmental impact of large-scale autoregressive models (such as tasks in text generation and video generation). Specifically, there is a lack of effective methods for assessing and analyzing the carbon footprint of these models. To fill this gap, the authors propose a unified framework called OpenCarbonEval, which is used to predict the carbon emissions of large-scale models across different modalities. This framework provides AI service providers and users with a means to estimate carbon emissions before training, thereby helping to mitigate the environmental pressure brought by these models. ### Key Points 1. **Background**: - Large-scale autoregressive models have made significant progress in various tasks, but their environmental impact has been overlooked. - Existing carbon footprint assessment methods mainly target small-scale models and cannot keep up with the rapid expansion of large-scale models. 2. **Objectives**: - Propose a unified framework (OpenCarbonEval) that can accurately predict the carbon emissions of large-scale models before training. - Improve prediction accuracy by capturing workload and hardware fluctuations during the training process through a dynamic throughput modeling approach. 3. **Methods**: - **Dynamic Throughput Modeling**: Combining Little’s Law, a new dynamic throughput modeling method is proposed to adapt to different computational workloads and hardware configurations. - **Evaluation Metrics**: Validate the performance of OpenCarbonEval with actual data and compare it with existing static modeling methods. 4. **Results**: - Experimental results show that OpenCarbonEval performs excellently in predicting the carbon emissions of vision models and language models, being more accurate than existing methods. - OpenCarbonEval allows for a more comprehensive analysis of the carbon emissions of large-scale models, providing valuable insights for reducing environmental impact. 5. **Significance**: - Promote the development and deployment of sustainable AI, reducing the environmental impact of large-scale models. - Help the AI community develop and use AI technology more responsibly, promoting environmental protection. ### Conclusion OpenCarbonEval provides a unified and accurate framework for assessing the carbon emissions of large-scale AI models, contributing to a more transparent and environmentally friendly AI training process. By using OpenCarbonEval, researchers and users can better understand and manage the environmental impact of AI models, promoting the sustainable development of AI technology.