-
Notifications
You must be signed in to change notification settings - Fork 3
Integration with Transforms #14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
smatzana
commented
Nov 30, 2021
- Adding some necessary attributes to a Flow in order to enable transforms
- Only affect Batch and OnlineFlows
65d5efd
to
ce564d9
Compare
ce564d9
to
c269a75
Compare
|
||
|
||
if __name__ == "__main__": | ||
TxTestFlow() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you please add a step for how the Transform is expected to be used? This would be used as an example and test of expected customer usage for us to review. thanks!
|
||
if len(self._txf_callbacks): | ||
for _, callback in self._txf_callbacks.items(): | ||
callback() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this callback actually calling the transform? better named txf_callback
.
It'll help to see an example transform in unit tests and/or tutorials.
|
||
add_txf_attributes(self) | ||
if not self._txf_registered_datasets.get(flow_name, None): | ||
self._txf_registered_datasets[flow_name] = [dataset] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do we maintain a dictionary of _txf_registered_datasets[flow_name]
where flow_name
is always the name of the flow?
I think you mean _txf_registered_datasets[dataset.name]
. However, that may not work because one could reference the multiple Flow dataset variable/parameter to the same logically named dataset.name
.
What is _txf_registered_datasets
used for? Could you please reference a PR for what it is used for?