OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models

Zhaojian Yu,Yinghao Wu,Zhuotao Deng,Yansong Tang,Xiao-Ping Zhang

2024-05-21

Abstract:In recent years, large-scale auto-regressive models have made significant progress in various tasks, such as text or video generation. However, the environmental impact of these models has been largely overlooked, with a lack of assessment and analysis of their carbon footprint. To address this gap, we introduce OpenCarbonEval, a unified framework for integrating large-scale models across diverse modalities to predict carbon emissions, which could provide AI service providers and users with a means to estimate emissions beforehand and help mitigate the environmental pressure associated with these models. In OpenCarbonEval, we propose a dynamic throughput modeling approach that could capture workload and hardware fluctuations in the training process for more precise emissions estimates. Our evaluation results demonstrate that OpenCarbonEval can more accurately predict training emissions than previous methods, and can be seamlessly applied to different modal tasks. Specifically, we show that OpenCarbonEval achieves superior performance in predicting carbon emissions for both visual models and language models. By promoting sustainable AI development and deployment, OpenCarbonEval can help reduce the environmental impact of large-scale models and contribute to a more environmentally responsible future for the AI community.

Computers and Society,Machine Learning

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper aims to address the lack of evaluation regarding the environmental impact of large-scale autoregressive models (such as tasks in text generation and video generation). Specifically, there is a lack of effective methods for assessing and analyzing the carbon footprint of these models. To fill this gap, the authors propose a unified framework called OpenCarbonEval, which is used to predict the carbon emissions of large-scale models across different modalities. This framework provides AI service providers and users with a means to estimate carbon emissions before training, thereby helping to mitigate the environmental pressure brought by these models. ### Key Points 1. **Background**: - Large-scale autoregressive models have made significant progress in various tasks, but their environmental impact has been overlooked. - Existing carbon footprint assessment methods mainly target small-scale models and cannot keep up with the rapid expansion of large-scale models. 2. **Objectives**: - Propose a unified framework (OpenCarbonEval) that can accurately predict the carbon emissions of large-scale models before training. - Improve prediction accuracy by capturing workload and hardware fluctuations during the training process through a dynamic throughput modeling approach. 3. **Methods**: - **Dynamic Throughput Modeling**: Combining Little’s Law, a new dynamic throughput modeling method is proposed to adapt to different computational workloads and hardware configurations. - **Evaluation Metrics**: Validate the performance of OpenCarbonEval with actual data and compare it with existing static modeling methods. 4. **Results**: - Experimental results show that OpenCarbonEval performs excellently in predicting the carbon emissions of vision models and language models, being more accurate than existing methods. - OpenCarbonEval allows for a more comprehensive analysis of the carbon emissions of large-scale models, providing valuable insights for reducing environmental impact. 5. **Significance**: - Promote the development and deployment of sustainable AI, reducing the environmental impact of large-scale models. - Help the AI community develop and use AI technology more responsibly, promoting environmental protection. ### Conclusion OpenCarbonEval provides a unified and accurate framework for assessing the carbon emissions of large-scale AI models, contributing to a more transparent and environmentally friendly AI training process. By using OpenCarbonEval, researchers and users can better understand and manage the environmental impact of AI models, promoting the sustainable development of AI technology.

OpenCarbonEval: A Unified Carbon Emission Estimation Framework in Large-Scale AI Models

Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade Offs in Large Language Model Training

Revisit the Environmental Impact of Artificial Intelligence: the Overlooked Carbon Emission Source?

Generative AI for Low-Carbon Artificial Intelligence of Things with Large Language Models

Method and evaluations of the effective gain of artificial intelligence models for reducing CO2 emissions

LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Carbon Monitor AutoForecast-Asia: a real-time emission estimates of the residential sector for Asian major emitters with an automatic machine learning framework

eco2AI: Carbon Emissions Tracking of Machine Learning Models as the First Step Towards Sustainable AI

Carbon Emissions and Large Neural Network Training

AI-Driven Carbon Emissions Tracking and Mitigation Model

The Optuna–LightGBM–XGBoost Model: A Novel Approach for Estimating Carbon Emissions Based on the Electricity–Carbon Nexus

Modeling and predicting city-level CO2 emissions using open access data and machine learning

Modeling and Estimation of CO2 Emissions in China Based on Artificial Intelligence

Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision

Carbon Footprint Accounting Driven by Large Language Models and Retrieval-augmented Generation

Modeling and Spatio-Temporal Analysis on CO 2 Emissions in the Guangdong-Hong Kong-Macao Greater Bay Area and Surrounding Cities Based on Neural Network and Autoencoder

Beyond Efficiency: Scaling AI Sustainably

Decoupling representation contrastive learning for carbon emission prediction and analysis based on time series

Near Real-time CO$_2$ Emissions Based on Carbon Satellite and Artificial Intelligence

ISSA-enhanced GRU-Transformer: integrating sports wisdom into the frontier exploration of carbon emission prediction

A new multiregional carbon emissions forecasting model based on a multivariable information fusion mechanism and hybrid spatiotemporal graph convolution network