Skip to content

mlco2/codecarbon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

banner

Estimate and track carbon emissions from your compute, quantify and analyze their impact.

Documentation


DOI

About CodeCarbon πŸ’‘

While computing currently represents roughly 0.5% of the world’s energy consumption, that percentage is projected to grow beyond 2% in the coming years, which will entail a significant rise in global CO2 emissions if not done properly. Given this increase, it is important to quantify and track the extent and origin of this energy usage, and to minimize the emissions incurred as much as possible.

For this purpose, we created CodeCarbon, a Python package for tracking the carbon emissions produced by various kinds of computer programs, from straightforward algorithms to deep neural networks.

By taking into account your computing infrastructure, location, usage and running time, CodeCarbon can provide an estimate of how much CO2 you produced, and give you some comparisons with common modes of transportation to give you an order of magnitude.

Our hope is that this package will be used widely for estimating the carbon footprint of computing, and for establishing best practices with regards to the disclosure and reduction of this footprint.

Follow the steps below to set up the package and don't hesitate to open an issue if you need help!

Installation πŸ”‹

Create a virtual environment using conda for easier management of dependencies and packages. For installing conda, follow the instructions on the official conda website

conda create --name codecarbon
conda activate codecarbon

Install from PyPI repository

pip install codecarbon

or

pip install codecarbon[viz]

if you wish to use the built-in visualization tool described below.

Install from Conda repository

conda install -c conda-forge codecarbon

codecarbon is now installed in your the local environment

Quickstart πŸš€

Online mode

This is the most straightforward usage of the package, which is possible if you have access to the Internet, which is necessary to gather information regarding your geographical location.

from codecarbon import EmissionsTracker

tracker = EmissionsTracker()
tracker.start()
# GPU Intensive code goes here
tracker.stop()

You can also use the decorator:

from codecarbon import track_emissions

@track_emissions
def training_loop():
   pass

The EmissionsTracker also works as a context manager:

from codecarbon import EmissionsTracker

with EmissionsTracker() as tracker:
    # GPU Intensive code goes here

This will write a csv file of the CO2 emissions in the current directory.

Code Carbon API (ALPHA)

(This feature is currently in ALPHA stage, meaning it's only available for developers installing CodeCarbon from this repository. The documentation and implementation are is still under development.)

This mode use the Code Carbon API to upload the timeseries of your emissions.

Before using it, you need an experiment_id, to get one, run:

codecarbon init

It will ask the API for an experiment_id on the default project and save it to .codecarbon.config for you.

Then, you could tell Code Carbon to monitor your machine:

codecarbon monitor

Or use the API in your code:

from codecarbon import track_emissions

@track_emissions(save_to_api=True)
def train_model():
    # Your code here

if __name__ == "__main__":
    train_model()

This is the simplest way. You could specify others options in @track_emissions or in .codecarbon.config.

Offline mode

This mode can be used in setups without internet access, but requires a manual specification of your country code. A complete list of country ISO codes can be found on Wikipedia.

The offline tracker can be used as follows:

from codecarbon import OfflineEmissionsTracker

tracker = OfflineEmissionsTracker(country_iso_code="CAN")
tracker.start()
# GPU Intensive code goes here
tracker.stop()

or by using the decorator:

from codecarbon import track_emissions

@track_emissions(offline=True, country_iso_code="CAN")
def training_loop():
   pass

or by using the context manager:

from codecarbon import EmissionsTracker

with EmissionsTracker() as tracker:
    # GPU Intensive code goes here

Flush data when running

By default, Code Carbon only writes the CVS output file when it stops. But for a long run you may want to save intermediate data. It is possible to call the flush() method to do so.

Have a look to ./examples/mnist_callback.py for an example.

Note that if you use the API it will also call it when you call flush(). You could set api_call_interval to -1, so that it will not be called automatically and then force a call at the end of each epoch to get the emission of each epoch.

Using comet.ml

Nothing to do here πŸ’― ! Comet automatically starts a tracker and logs your emissions if you have CodeCarbon installed. More about comet and adding the CodeCarbon panel to your project in Comet Integration.

Configuration

CodeCarbon is developed with flexibility in mind. This means you don't have to keep passing the same arguments over and over to EmissionTracker objects in your scripts. Any and all arguments can be set from configuration files. CodeCarbon will look sequentially for arguments in:

  • ~/.codecarbon.config (global user config)
  • ./.codecarbon.config (local directory config)
  • CODECARBON_ARG (environment variables)
  • EmissionsTracker(arg=value)

Config files are INI text files which should look like:

[codecarbon]
arg=value

You can specify many parameters, for example :

[codecarbon]
experiment_id = 3f53714b-c1ed-4bc7-b791-0b3f076b81a2
api_endpoint = http://localhost:8008
#api_endpoint = https://api.codecarbon.io
log_level = DEBUG
save_to_api = True
api_call_interval = 600
logger_preamble = MySuperServer
misfire_grace_time = 15

More details in the documentation.

Examples 🐀

As an illustration of how to use CodeCarbon, we created a simple example using TensorFlow for digit classification on the MNIST dataset:

First, install Tensorflow 2.0:

pip install tensorflow

Then, run the examples in the examples/ folder, which will call the online version of the Emissions tracker by default:

python examples/mnist.py
python examples/mnist_decorator.py

This will create a .csv file with information about the energy that you used to carry out the classification task, and an estimate of the CO2 that you generated, complete with comparisons to common modes of transportation to give you a better idea of the order of magnitude of your emissions.

Built-in Visualization Tool 😍

To track the evolution of your CO2 emissions or just to see some nice graphs and charts, you can use the visualization tool that we created. As input, it takes a .csv file in the format generated by the Emissions Tracker, and it generates a visualization of the emissions incurred.

You can run try it out on a sample data file such as the one in examples/emissions.csv, and run it with the following command from the code base:

python codecarbon/viz/carbonboard.py --filepath="examples/emissions.csv"

If you have the package installed, you can run the CLI command:

carbonboard --filepath="examples/emissions.csv" --port=xxxx

Once you have generated your own .csv file based on your computations, you can feed that into the visualization tool to see a more visual representation of your own emissions.

Dashboard Summary

You can also see the carbon intensity of different regions and countries:

Global Equivalents

As well as the relative carbon intensity of different compute regions of cloud providers:

Cloud Emissions

Comet Integration πŸ₯‚

CodeCarbon automatically integrates with Comet for experiment tracking and visualization. Comet provides data scientists with powerful tools to track, compare, explain and reproduce their experiments, and now with CodeCarbon you can easily track the carbon footprint of your jobs along with your training metrics, hyperparameters, dataset samples, artifacts and more.

To get started with the Comet-CodeCarbon integration, make sure you have comet-ml installed:

pip install comet_ml>=3.2.2

The minimum Comet version

Go to Comet's website and create a free account. From your account settings page, copy your personal API key.

In the mnist-comet.py example file, replace the placeholder code with your API key:

experiment = Experiment(api_key="YOUR API KEY")

Run your experiment and click on the link in stdout to be taken back to the Comet UI. Automatically you'll see your metrics, hyperparameters, graph definition, system and environment details and more.

Comet will automatically create an EmissionsTracker object from the codecarbon package when your code runs. To visualize the carbon footprint of your experiment, go to the Panels tab in the left sidebar and click Add Panel.

From the Panel Gallery click the Public tab and search for CodeCarbon Footprint. Once you've found it, add it to your Experiment.

Now back in the Panels tab you'll see your CodeCarbon Footprint visualization in the Comet UI. To render the visualization by default, save your View. And voilΓ ! Every time you run your experiments, you'll be able to visualize your CodeCarbon emissions data alongside everything else you need to track for your research.

Report your emissions: LateX template πŸ“»

We believe that an important step towards reducing carbon emissions is the generalization of emissions reporting in papers, blog posts and publications in general. Here's an example LateX snippet you might want to use:

\usepackage{hyperref}

\subsection{CO2 Emission Related to Experiments}

Experiments were conducted using {cloud provider} in {region}, which has a carbon efficiency of {carbon efficiency} kgCO$_2$eq/kWh. A cumulative of {hours used} hours of computation was performed on hardware of type {hardware type} (TDP of W).

Total emissions are estimated to be {emission} kgCO$_2$eq of which {percentage offset} percents were directly offset by the cloud provider.

%Uncomment if you bought additional offsets:
%XX kg CO2eq were manually offset through \href{link}{Offset Provider}.

Estimations were conducted using the \href{https://github.com/mlco2/codecarbon}{CodeCarbon emissions tracker}, a joint effort from authors of \cite{lacoste2019quantifying} and \cite{lottick2019nergy}.

@article{lacoste2019quantifying,
  title={Quantifying the Carbon Emissions of Machine Learning},
  author={Lacoste, Alexandre and Luccioni, Alexandra and Schmidt, Victor and Dandres, Thomas},
  journal={Workshop on Tackling Climate Change with Machine Learning at NeurIPS 2019},
  year={2019}
}

@article{lottick2019nergy,
    title={Energy Usage Reports: Environmental awareness as part of algorithmic
  accountability},
    author={Kadan Lottick and Silvia Susai and Sorelle A. Friedler and Jonathan P. Wilson},
    year={2019},
    journal={Workshop on Tackling Climate Change with Machine Learning at NeurIPS 2019}
}

Citing CodeCarbon

@article{codecarbon,
  author={Victor Schmidt and Kamal Goyal and Aditya Joshi and Boris Feld and Liam Conell and Nikolas Laskaris and Doug Blank and Jonathan Wilson and Sorelle Friedler and Sasha Luccioni},
  title={{CodeCarbon: Estimate and Track Carbon Emissions from Machine Learning Computing}},
  year={2021},
  howpublished={\url{https://github.com/mlco2/codecarbon}},
  DOI={10.5281/zenodo.4658424},
  publisher={Zenodo},
}

Infrastructure Support πŸ–₯️

Currently the package supports following hardware infrastructure.

GPU

  • Tracks Nvidia GPUs power consumption using pynvml library, (which is installed with the package).

CPU

On Windows and Mac

  • Tracks Intel processors power consumption using the Intel Power Gadget
  • You need to install it independently for CodeCarbon to function.

Note: Please ensure that the Intel Power Gadget has the required security permissions on MacOS.

On Linux

Note: The Power Consumption will be tracked only if the RAPL files exist at the above mentioned path.

On all platforms

If CodeCarbon cannot find the appropriate software to track the CPUs' energy consumption, it will use a fallback strategy:

  • Find TDP:
    • If it can match your cpu type to a list of 2000+ Intel and AMD CPUs it will retrieve its thermal design power (TDPs)
    • If it really can't tell anything about your CPU it will use a constant TDP of 85W
  • CodeCarbon assumes that the (average) actual power consumption of CPUs will be 50% of their TDP.

Data Sources πŸ—’οΈ

To find the carbon efficiency of your cloud region, you can look into CodeCarbon's cloud data. If you are using a private infrastructure you can look into the CodeCarbon's private infrastructure. A number of resources can help you find the carbon efficiency of you local grid if you cannot find it in the previous links.

Contributing 🀝

We are hoping that the open-source community will help us edit the code and make it better!

You are welcome to open issues, even suggest solutions and better still contribute the fix/improvement! We can guide you if you're not sure where to start but want to help us out πŸ₯‡

In order to contribute a change to our code base, please submit a pull request (PR) via GitHub and someone from our team will go over it and accept it.

Check out our contribution guidelines ↗️

Contact @vict0rsch to be added to our slack workspace if you want to contribute regularly!

Build Documentation πŸ–¨οΈ

No software is complete without great documentation! To make generating documentation easier, install the sphinx package and use it to edit and improve the existing documentation:

pip install -U sphinx sphinx_rtd_theme
cd docs/edit
make docs

In order to make changes, edit the .rst files that are in the /docs/edit folder, and then run:

make docs

to regenerate the html files.

Contact πŸ“

Maintainers are @vict0rsch @benoit-cty and @SaboniAmine. Codecarbon is developed by volunteers from Mila and the DataForGoodFR community alongside donated professional time of engineers at Comet.ml and BCG GAMMA.