PUDL Example Notebooks

This repository contains a collection of Jupyter notebooks with examples of how to use the data and software distributed by Catalyst Cooperative's Public Utility Data Liberation (PUDL) project.

Run PUDL Notebooks on Kaggle

The easiest way to get up and running with these examples and a fresh copy of all the PUDL data is on Kaggle.

Kaggle offers substantial free computing resources and convenient data storage, so you can start playing with the PUDL data without needing to set up any software or download any data.

You'll find the PUDL data dictionary helpful for interpreting the data.

Running Jupyter locally

If you're already familiar with git, Python environments, filesystem paths, and running upyter notebooks locally, you can also work with these notebooks and the PUDL data locally:

Create a Python environment that includes common data science packages. We like to use the mamba package manager and the conda-forge channel.
Clone this repository.
Download the PUDL dataset from Kaggle (it's ~20GB!) and unzip it somewhere conveniently accessible from the notebooks in the cloned repo.
Start your JupyterLab or Jupyter Notebook server and navigate to the notebooks in the cloned repo.
You'll need to adjust the file paths in the notebooks to point at the directory where you put the PUDL data, and might need to adjust the packages installed in your Python environment to work with the notebooks.

Other Data Access Methods

See the PUDL documentation for other data access methods.

If you're familiar with cloud services, you can check out:

PUDL in the AWS Open Data Registry: s3://pudl.catalyst.coop (free access)
Google Cloud Storage: gs://pudl.catalyst.coop (requester pays)

Stalk us on the Internet

Supporting PUDL

These example notebooks are part of the Public Utility Data Liberation Project (PUDL), a project of Catalyst Cooperative. PUDL has been made possible by the generous support of our sustainers, grant funders, and volunteer open source contributors.

If you would like to support the ongoing development of PUDL, please consider becoming a sustainer.

Name		Name	Last commit message	Last commit date
Latest commit History 321 Commits
.github		.github
.gitignore		.gitignore
01-pudl-data-access.ipynb		01-pudl-data-access.ipynb
02-state-hourly-electricity-demand.ipynb		02-state-hourly-electricity-demand.ipynb
03-eia930-sanity-checks.ipynb		03-eia930-sanity-checks.ipynb
04-renewable-generation-profiles.ipynb		04-renewable-generation-profiles.ipynb
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PUDL Example Notebooks

Run PUDL Notebooks on Kaggle

Running Jupyter locally

Other Data Access Methods

Stalk us on the Internet

Supporting PUDL

About

Releases

Sponsor this project

Contributors 8

Languages

License

catalyst-cooperative/pudl-examples

Folders and files

Latest commit

History

Repository files navigation

PUDL Example Notebooks

Run PUDL Notebooks on Kaggle

Running Jupyter locally

Other Data Access Methods

Stalk us on the Internet

Supporting PUDL

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Sponsor this project

Contributors 8

Languages