Improve CI workflows, add autoformatters / linters #236

zaneselvans · 2022-05-26T20:41:04Z

No description provided.

zaneselvans · 2022-05-26T20:58:32Z

Okay @cmgosnell whenever you have a chance to look, this seems to work as well as it can given the current state of the main branch. The tests fail locally with Index mismatch errors that I think we've already talked about on the other long-running development branch, but the CI and other infrastructure stuff seems to work fine.

…true grans

zaneselvans · 2022-07-22T17:05:25Z

@katie-lamb I updated the CI and package requirements to only ever use Python 3.10 (and to use the new bot automerge PR workflows) so hopefully your 3.8/3.9 dependency resolution issues can be ignored.

For more information, see https://pre-commit.ci

…ferc1-eia into bot-auto-merge

Switch to bot-auto-merge workflow and auto-format YAML.

review-notebook-app · 2022-07-27T13:10:23Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

zaneselvans · 2022-07-29T21:17:50Z

What's the max memory usage of the tests at this point? Is it close to 7GB? I imagine the runner uses some memory on its own just to exist and have an OS, and even if the tests themselves were a bit under 7GB, the sum of the overhead and the tests could still be too much.

codecov · 2022-10-18T19:16:51Z

Codecov Report

❗ No coverage uploaded for pull request base (main@d9b708d). Click here to learn what that means.
Patch has no changes to coverable lines.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #236   +/-   ##
=======================================
  Coverage        ?   74.39%           
=======================================
  Files           ?       10           
  Lines           ?     1183           
  Branches        ?        0           
=======================================
  Hits            ?      880           
  Misses          ?      303           
  Partials        ?        0

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

katie-lamb · 2022-10-19T21:38:33Z

Update update update (for @cmgosnell):

The CI now passes wowww! Major changes:

I just made the GitHub workflow CI run on 5 years of data coverage (2015 - 2020). I did this instead of 1 year because with 5 years the matching models only perform slightly worse than with all the data, so the CI is somewhat testing how well the models are performing while still staying under the GitHub memory limit. Running tox or pytest without the --five-year-coverage argument will still run the CI on all the years of data.
I added new expected_errors CSV files for the data validation tests for the five year coverage. These roughly match the original expected_errors CSV files for all the data so I just went with it. But maybe there's a better way to check these expected errors files and make sure there's nothing wrong. The expected_errors for all years of data probably also need to be updated so that the validation tests pass.
The pudl installation in the pudl-rmi environment now pulls from the dev branch instead of the released version of PUDL. This might lead to more/faster maintenance down the line but it's also way nicer for modifying PUDL and having changes appear in this repo.
For clarity, I updated the plant_name_new and ownership columns in the plant part list to be plant_name_ppe and ownership_record_type respectively when it's created in PUDL. These name changes are reflected in this PR. Additionally, pudl.analysis.plant_parts_eia.PLANT_PARTS_ORDERED was taken out and now the PLANT_PARTS global dictionary does all the work.
Some clean up was done now that we're not memory constrained.
- The distinct plant parts list is now an output instead of being made as part of the plant part list output.
- The training data connections were moved back into the FERC to EIA connection module

I'm not entirely sure why one of the data validation tests isn't passing. It seems like a weird type error because I'm pretty sure the expected and actual data is the same. Need to look into this.

src/pudl_rmi/connect_deprish_to_ferc1.py

katie-lamb · 2022-10-20T22:47:28Z

I feel confused why the validation tests are failing when checking if the index are equal. It seems to be an issue with the type of the report_year index level (the actual index is an Index while the index read in from the CSV is Int64Index. I set the argument exact=False and am still failing the assert even though it's supposed to ignore type differences.

katie-lamb · 2022-10-26T01:49:08Z

test/integration/rmi_out_test.py

+                    ],
+                    level=levels_set_types,
+                )
+            )


This whole index level data type setting business is super janky but it fixes the weird one off error I was getting with the type mismatch. Maybe I should look into a more general/lasting solution here.

yeah this looks janky/hard to understand what is happening.. if you are going to keep it i'd suggest at the least adding some why/what comments in here

cmgosnell

hey @katie-lamb ! these changes looks good overall but added a smattering of suggestions/questions.

src/pudl_rmi/connect_deprish_to_ferc1.py

src/pudl_rmi/coordinate.py

src/pudl_rmi/connect_ferc1_to_eia.py

src/pudl_rmi/deprish.py

test/integration/rmi_out_test.py

cmgosnell · 2022-11-02T17:48:25Z

test/integration/rmi_out_test.py

+                    ],
+                    level=levels_set_types,
+                )
+            )


yeah this looks janky/hard to understand what is happening.. if you are going to keep it i'd suggest at the least adding some why/what comments in here

src/pudl_rmi/connect_ferc1_to_eia.py

…om deprish

… connection

cmgosnell

this looks great!

Improve CI workflows, add autoformatters / linters

5e6529d

zaneselvans requested a review from cmgosnell May 26, 2022 20:41

Avoid accidentally triggering comment type annotation pre-commit hook

5aa3c69

cmgosnell self-assigned this Jun 6, 2022

cmgosnell added the rmi use on all issues in this repo (for project management tools) label Jun 6, 2022

zaneselvans and others added 5 commits June 23, 2022 18:32

Update pre-commit hooks.

8ad072f

Merge branch 'main' into update-ci

709cb98

changed to use dev branch, this might be temporary, took out refs to …

39e41ea

…true grans

fixes to make integration tests pass

6c62629

Restrict to Python 3.10+ and add bot-automerge workflow.

b03e04d

katie-lamb and others added 11 commits July 26, 2022 23:23

change to rmi ci branch so cache clears

6087b87

clear cache in integration tests to see if it passes

d6a8f5a

Switch to bot-auto-merge workflow and auto-format YAML.

c941973

[pre-commit.ci] auto fixes from pre-commit.com hooks

1fc871e

For more information, see https://pre-commit.ci

Remove test matrix since we're only using Python 3.10

0d9cd64

Merge branch 'bot-auto-merge' of github.com:catalyst-cooperative/rmi-…

3a03cc1

…ferc1-eia into bot-auto-merge

Remove stale pudl_out directory.

8cddcad

Bump github action cache to v3.0.5

bb142fc

Update pandera version

eb74b8d

pre-commit autoupdate

615124e

Merge pull request #258 from catalyst-cooperative/bot-auto-merge

9d824c0

Switch to bot-auto-merge workflow and auto-format YAML.

katie-lamb added 4 commits July 28, 2022 11:11

delete dfs after creation in tests

827ff2c

github not running ci-test

8227c97

actually clear the cache

021ebd2

move clear cache to ppl in pudl

e466793

fix cache clearing

52c5867

katie-lamb added 6 commits September 25, 2022 21:05

switch to new pudl branch

ab866d1

updated tox settings to allow for small coverage arg

27ef3b8

get all but ferc to eia working with one year

123ab83

add parameter to consistency checks for ferc to eia

1a831f0

fix consistency checks bug

04136b9

fixed validation tests for the five year test

67838ca

katie-lamb linked an issue Oct 19, 2022 that may be closed by this pull request

enable 1-year of data processing #226

Closed

5 tasks

katie-lamb added 3 commits October 19, 2022 14:40

remove deletion of pudl out dfs in rmi out object

73c9fb4

moved prep train connections back into ferc to eia module

5e6e367

clean up pickled ppe flow

5e53121

katie-lamb reviewed Oct 20, 2022

View reviewed changes

src/pudl_rmi/connect_deprish_to_ferc1.py Outdated Show resolved Hide resolved

katie-lamb added 2 commits October 20, 2022 15:42

took out prep train connections from coordinate

4cb4077

make index equal check not exact

e343876

maybe fixed the validation index level types issue

b6db55f

katie-lamb reviewed Oct 26, 2022

View reviewed changes

cmgosnell requested changes Nov 2, 2022

View reviewed changes

katie-lamb added 8 commits November 7, 2022 08:31

removed df_to_scale thats no longer being used

61a3fa5

remove five year test flag from ferc to eia and start and end args fr…

3fa2114

…om deprish

took out early pickling of train connections, all done in ferc to eia…

8481a50

… connection

updated expected errors

fb22b72

updated expected error with fresh db

f40ae78

fix multiindex dtype issue by comparing dataframes instead

b213bb7

take out flag for cached distinct ppl

27df7d9

take out distinct ppe clobber

48befa7

cmgosnell approved these changes Nov 9, 2022

View reviewed changes

katie-lamb merged commit 87150eb into main Nov 10, 2022

katie-lamb deleted the update-ci branch November 10, 2022 02:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve CI workflows, add autoformatters / linters #236

Improve CI workflows, add autoformatters / linters #236

zaneselvans commented May 26, 2022

zaneselvans commented May 26, 2022

zaneselvans commented Jul 22, 2022

review-notebook-app bot commented Jul 27, 2022

zaneselvans commented Jul 29, 2022

codecov bot commented Oct 18, 2022 •

edited

Loading

katie-lamb commented Oct 19, 2022 •

edited

Loading

katie-lamb commented Oct 20, 2022

katie-lamb Oct 26, 2022

cmgosnell Nov 2, 2022

cmgosnell left a comment

cmgosnell Nov 2, 2022

cmgosnell left a comment

Improve CI workflows, add autoformatters / linters #236

Improve CI workflows, add autoformatters / linters #236

Conversation

zaneselvans commented May 26, 2022

zaneselvans commented May 26, 2022

zaneselvans commented Jul 22, 2022

review-notebook-app bot commented Jul 27, 2022

zaneselvans commented Jul 29, 2022

codecov bot commented Oct 18, 2022 • edited Loading

Codecov Report

katie-lamb commented Oct 19, 2022 • edited Loading

katie-lamb commented Oct 20, 2022

katie-lamb Oct 26, 2022

Choose a reason for hiding this comment

cmgosnell Nov 2, 2022

Choose a reason for hiding this comment

cmgosnell left a comment

Choose a reason for hiding this comment

cmgosnell Nov 2, 2022

Choose a reason for hiding this comment

cmgosnell left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 18, 2022 •

edited

Loading

katie-lamb commented Oct 19, 2022 •

edited

Loading