Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Engineer features for eia-ferc1 entity matching #1535

Open
1 of 4 tasks
katie-lamb opened this issue Mar 14, 2022 · 0 comments
Open
1 of 4 tasks

Engineer features for eia-ferc1 entity matching #1535

katie-lamb opened this issue Mar 14, 2022 · 0 comments
Labels
ccai Tasks related to CCAI grant for entity matching epic Any issue whose primary purpose is to organize other issues into a group.

Comments

@katie-lamb
Copy link
Member

katie-lamb commented Mar 14, 2022

Some feature engineering needs to be done on the PPL and FERC1 so that these datasets can be better used by Panda. These preprocessing steps for Panda should be put into a PUDL output layer.

EIA Plant Part List Column Descriptions

Tasks

  • Pare down the columns in the DFs we hand off to Chu lab - remove columns that are definitely not relevant in entity matching
  • Construct new columns in EIA data
  • (From weekly call notes) Add reference to the FERC 1 plant ID assignment in labeling column doc
  • create mapping from code abbreviations to their full length string

Other Ideas

  • add inter year data columns for feature comparison
@katie-lamb katie-lamb added epic Any issue whose primary purpose is to organize other issues into a group. ccai Tasks related to CCAI grant for entity matching labels Mar 14, 2022
@katie-lamb katie-lamb changed the title Create CCAI output layer Feature engineering Mar 15, 2022
@zaneselvans zaneselvans changed the title Feature engineering Engineer features for eia-ferc1 entity matching Mar 23, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ccai Tasks related to CCAI grant for entity matching epic Any issue whose primary purpose is to organize other issues into a group.
Projects
None yet
Development

No branches or pull requests

1 participant