Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve resume suggestions #3719

Merged
merged 13 commits into from
Apr 2, 2024

Conversation

ondrejzacha
Copy link
Contributor

Description

Supersedes #3026

Addresses #3002:

  • Parameter inputs are considered persistent, therefore a failed node with a parameter input won't necessarily trigger re-run of their ancestor nodes.

Additionally:

  • Only nodes producing non-persistent inputs are suggested for rerun (as opposed to all parent nodes of a node with 1+ non-persistent inputs)
  • Number of suggested nodes to re-run from is minimised (-> shorter message for the same pipeline)
  • Refactored into a testable function (therefore more applicable to other runners too)

Development notes

Additional tests added in test_sequential_runner.py – both to check the message in logs for sequential runner specifically, as well as a new suite of tests for the suggestion logic itself. I appreciate that the logic is not exactly straightforward, so please double check the tests / let me know if more tests are needed.

Developer Certificate of Origin

We need all contributions to comply with the Developer Certificate of Origin (DCO). All commits must be signed off by including a Signed-off-by line in the commit message. See our wiki for guidance.

If your PR is blocked due to unsigned commits, then you must follow the instructions under "Rebase the branch" on the GitHub Checks page for your PR. This will retroactively add the sign-off to all unsigned commits and allow the DCO check to pass.

Checklist

  • Read the contributing guidelines
  • Signed off each commit with a Developer Certificate of Origin (DCO)
  • Opened this PR as a 'Draft Pull Request' if it is work-in-progress
  • Updated the documentation to reflect the code changes
  • Added a description of this change in the RELEASE.md file
  • Added tests to cover my changes
  • Checked if this change will affect Kedro-Viz, and if so, communicated that with the Viz team

- if dataset (or param) is persistent & shared, don't keep looking for ancestors
- only look for ancestors producing impersistent inputs
- minimize number of suggested nodes (= shorter message for the same pipeline)
- testable logic, tests cases outside of scenarios for sequential runner

Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
kedro/runner/runner.py Outdated Show resolved Hide resolved
Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Comment on lines 289 to 308
def _nodes_with_external_inputs(
pipeline: Pipeline, nodes_of_interest: Iterable[Node]
) -> set[Node]:
"""For given ``Node``s in a ``Pipeline``, find their
subset which depends on external inputs of the ``Pipeline``.

Args:
pipeline: the ``Pipeline`` to search for direct parents in.
child: the ``Node`` to find parents of.
pipeline: the ``Pipeline`` to search for nodes in.
nodes_of_interest: the ``Node``s to analyze.

Returns:
A list of all ``Node``s that are direct parents of ``child``.
A set of ``Node``s that depend on external inputs
of nodes of interest.

"""
parent_pipeline = pipeline.only_nodes_with_outputs(*child.inputs)
return parent_pipeline.nodes
p_nodes_of_interest = pipeline.only_nodes(*(n.name for n in nodes_of_interest))
p_nodes_with_external_inputs = p_nodes_of_interest.only_nodes_with_inputs(
*p_nodes_of_interest.inputs()
)
return set(p_nodes_with_external_inputs.nodes)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The function name is _node_with_external_inputs but its arguments is a list of node. If you are trying to search for pipeline node that has external_inputs, can you use pipeline.inputs and then pipeline.only_nodes_with_inputs?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point, simplified that

kedro/runner/runner.py Outdated Show resolved Hide resolved
the run.

"""
all_nodes_that_need_to_run = find_all_required_nodes(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
all_nodes_that_need_to_run = find_all_required_nodes(
nodes_to_be_run = find_all_required_nodes(

?

Comment on lines 244 to 245
# Find which of the remaining nodes would need to run first (in topo sort)
persistent_ancestors = find_initial_node_group(pipeline, all_nodes_that_need_to_run)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this needed for "Number of suggested nodes to re-run from is minimised (-> shorter message for the same pipeline)" this?

Comment on lines 244 to 247
# Find which of the remaining nodes would need to run first (in topo sort)
persistent_ancestors = _find_initial_node_group(
pipeline, all_nodes_that_need_to_run
)
Copy link
Contributor

@noklam noklam Mar 18, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I find this a bit hard to understand the purpose of this, is it similar to this?

p = pipeline(all_nodes_tht_need_to_run)
p_inputs = p.inputs()
input_nodes = p.only_nodes_with_input(p_inputs)
input_nodes_names = [n.name for n in input_nodes]

It's just pseudocode so I don't guarantee it works

Copy link
Contributor Author

@ondrejzacha ondrejzacha Mar 18, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is not exactly the same. The difference appears when nodes share the same external inputs:

Pipeline([
    node(
        name="first_node",
        func=...,
        inputs=["external_input"],
        outputs="intermediate_output",
    ),
    node(
        name="second_node",
        func=...,
        inputs=["external_input", "intermediate_output"],
        outputs="final_output",
    )
])

Your suggestion would produce ["first_node", "second_node"] here, whereas with the topo sort group approach, only the ["first_node"] is produced (as this pipeline will have two topo sort groups, each containing a single node).

Copy link
Contributor

@noklam noklam left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left some comments, and manually tested and it works well.

I have some concerns to open up a lot of public methods which is unclear to me how will it be useful outside of this context (Maybe for using it in notebook? Would love to see some example of this if that's the case)

Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
@ondrejzacha
Copy link
Contributor Author

I have some concerns to open up a lot of public methods which is unclear to me how will it be useful outside of this context (Maybe for using it in notebook? Would love to see some example of this if that's the case)

Sure, I don't have a strong preference here. In my work I've found myself pulling these methods out of the runner code to figure out how to continue a run of a pipeline which did not fail gracefully (ran into an OOM error and never showed this suggestion) – but I can imagine that can very much be an exception

Copy link
Member

@merelcht merelcht left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for this improved PR @ondrejzacha ! 🙂 On first review, things look generally good to me. I've left a couple of clarification questions.

The tests look good as well, but I agree with your comment on moving it to a separate test file. The last set of tests aren't actually using any specific runner so it might confuse others coming across them to understand whether this is SequentialRunner specific or not.

kedro/runner/runner.py Outdated Show resolved Hide resolved
kedro/runner/runner.py Outdated Show resolved Hide resolved
kedro/runner/runner.py Show resolved Hide resolved
kedro/runner/runner.py Outdated Show resolved Hide resolved
- Use _EPHEMERAL attribute
- Move tests to separate file
- Docstring updates

Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Copy link
Contributor

@noklam noklam left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for making all the changes and it is taking so long. This is a great improvement and it's one step closer to rerunning pipeline in an efficient way.

Copy link
Contributor

@DimedS DimedS left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, @ondrejzacha , for the significant update to the resume suggestions algorithm and the development of high-quality tests! It's fantastic that the suggestions are now effectively minimised, excluding nodes that have already been successfully processed. I've left some minor comments, but overall, the PR looks great to me!

kedro/runner/runner.py Outdated Show resolved Hide resolved
kedro/runner/runner.py Show resolved Hide resolved
kedro/runner/runner.py Show resolved Hide resolved
Copy link
Member

@merelcht merelcht left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for addressing my comments and this contribution @ondrejzacha ⭐ ⭐

Would you mind updating the release notes with this change as well and adding your name under the contributors section: https://github.com/kedro-org/kedro/blob/main/RELEASE.md#community-contributions ?

…tions-2

Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
@ondrejzacha ondrejzacha requested a review from DimedS March 28, 2024 20:55
Copy link
Contributor

@DimedS DimedS left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for addressing comments, @ondrejzacha !

kedro/runner/runner.py Show resolved Hide resolved
kedro/runner/runner.py Show resolved Hide resolved
kedro/runner/runner.py Outdated Show resolved Hide resolved
Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
@ondrejzacha ondrejzacha requested a review from DimedS March 29, 2024 10:53
@ondrejzacha
Copy link
Contributor Author

I think from my side this is ready; the docs build is still failing so let me know if there's anything I can still do. Excited to see this merged, thanks for the reviews! 🤩

@noklam
Copy link
Contributor

noklam commented Mar 29, 2024

@ondrejzacha amazing, thanks again really appreciate the effort🙏 I will take over and merge this.

Copy link
Contributor

@DimedS DimedS left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for responding to the comments, @ondrejzacha. Fantastic work on the PR!

@merelcht merelcht mentioned this pull request Apr 2, 2024
10 tasks
Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
@merelcht merelcht merged commit 44b2ad8 into kedro-org:main Apr 2, 2024
39 of 40 checks passed
doxenix pushed a commit to doxenix/kedro that referenced this pull request Apr 3, 2024
* Improve suggestions to resume a failed pipeline

- if dataset (or param) is persistent & shared, don't keep looking for ancestors
- only look for ancestors producing impersistent inputs
- minimize number of suggested nodes (= shorter message for the same pipeline)
- testable logic, tests cases outside of scenarios for sequential runner

- Use _EPHEMERAL attribute
- Move tests to separate file
- Docstring updates

---------

Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Co-authored-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
AhdraMeraliQB pushed a commit that referenced this pull request Apr 17, 2024
* Improve suggestions to resume a failed pipeline

- if dataset (or param) is persistent & shared, don't keep looking for ancestors
- only look for ancestors producing impersistent inputs
- minimize number of suggested nodes (= shorter message for the same pipeline)
- testable logic, tests cases outside of scenarios for sequential runner

- Use _EPHEMERAL attribute
- Move tests to separate file
- Docstring updates

---------

Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Co-authored-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>
AhdraMeraliQB added a commit that referenced this pull request Apr 17, 2024
)

* Update kedro-catalog-0.19.json (#3724)

* Update kedro-catalog-0.19.json

Signed-off-by: Anthony De Bortoli <anthony.debortoli@protonmail.com>

* Update set_up_vscode.md

Signed-off-by: Anthony De Bortoli <anthony.debortoli@protonmail.com>

---------

Signed-off-by: Anthony De Bortoli <anthony.debortoli@protonmail.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update project tests directory structure in docs

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Add docs on writing tests

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Drop dependency on toposort in favour of built-in graphlib (#3728)

* Replace toposort with graphlib (built-in from Python 3.9)

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>

* Create toposort groups only when needed

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>

* Update RELEASE.md and graphlib version constraints

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>

* Remove mypy-toposort

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>

* Ensure that the suggest resume test has no node ordering requirement

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>

* Ensure stable toposorting by grouping and ungrouping the result

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>

---------

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Optimise pipeline addition and creation (#3730)

* Create toposort groups only when needed
* Ensure that the suggest resume test has no node ordering requirement
* Ensure stable toposorting by grouping and ungrouping the result
* Delay toposorting until pipeline.nodes is used
* Avoid using .nodes when topological order or new copy is unneeded
* Copy the nodes only if tags are provided
* Remove unnecessary condition in self.nodes

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Expand robots.txt for Kedro-Viz and Kedro-Datasets docs (#3729)

* Add project to robots.txt

Signed-off-by: Dmitry Sorokin <dmd40in@gmail.com>

* Add EOF

Signed-off-by: Dmitry Sorokin <dmd40in@gmail.com>

---------

Signed-off-by: Dmitry Sorokin <dmd40in@gmail.com>
Co-authored-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Kedro need more uv (#3740)

* Kedro need more uv

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>

* remove docker

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>

---------

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Resolve all path in Kedro (#3742)

* Kedro need more uv

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>

* remove docker

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>

* fix broken type hint and resolve project path

Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>

* fix type hint

Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>

* remove duplicate logic

Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>

* adding nok.py is definitely an accident

Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>

* fix test

Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>

* remove print

Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>

* add test

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>

---------

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>
Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Remove settings of rate limits and retries (#3769)

* double linkcheck limits

Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>

* fix ratelimit

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>

---------

Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
Signed-off-by: Nok <nok.lam.chan@quantumblack.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Improve resume suggestions (#3719)

* Improve suggestions to resume a failed pipeline

- if dataset (or param) is persistent & shared, don't keep looking for ancestors
- only look for ancestors producing impersistent inputs
- minimize number of suggested nodes (= shorter message for the same pipeline)
- testable logic, tests cases outside of scenarios for sequential runner

- Use _EPHEMERAL attribute
- Move tests to separate file
- Docstring updates

---------

Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Co-authored-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Build docs fix (#3773)

* Ignored forbidden url

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Returned linkscheck retries

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Removed odd comment

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

---------

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Clarify docs around custom resolvers (#3759)

* Updated custom resolver docs section

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Updated advanced configuration section for consistency

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Updated RELEASE.md

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Updated RELEASE.md

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Test linkcheck_workers decrease

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Increased the By default, the linkcheck_rate_limit_timeout to default

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Returned old docs build settings

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Fixed typo

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Ignore forbidden url

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

* Returned linkcheck retries

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

---------

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Add mlruns to gitignore to avoid pushing  mlflow local runs to github (#3765)

* Add mlruns to gitignore to avoid pushing  mlflow local runs to github

Signed-off-by: Yolan Honoré-Rougé <yolan.honore.rouge@gmail.com>

* update release.md

Signed-off-by: Yolan Honoré-Rougé <yolan.honore.rouge@gmail.com>

---------

Signed-off-by: Yolan Honoré-Rougé <yolan.honore.rouge@gmail.com>
Signed-off-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>
Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>
Co-authored-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update the dependencies page in the docs (#3772)

* Update the dependencies page

Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com>

* Update docs/source/kedro_project_setup/dependencies.md

Signed-off-by: Ankita Katiyar <110245118+ankatiyar@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>
Signed-off-by: Ankita Katiyar <110245118+ankatiyar@users.noreply.github.com>

* Fix lint

Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com>

* Move the last line to notes

Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com>

---------

Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com>
Signed-off-by: Ankita Katiyar <110245118+ankatiyar@users.noreply.github.com>
Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Change pipeline test location to project root/tests (#3731)

* Change pipeline test location to project root/tests

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Fix some test_pipeline tests

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Change delete pipeline to account for new structure

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Fix some tests

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Change tests path on micropkg

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Fix remaining tests

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Add changes to release notes

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Update file structure on micropackaging doc page

Signed-off-by: lrcouto <laurarccouto@gmail.com>

---------

Signed-off-by: lrcouto <laurarccouto@gmail.com>
Signed-off-by: L. R. Couto <57910428+lrcouto@users.noreply.github.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Add an option for kedro new to skip telemetry (#3701)

* First draft for telemetry consent flag on kedro new

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Add functioning --telemetry option to kedro new

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Update tests to acknowledge new flag

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Add tests for kedro new --telemetry flag

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Add changes to documentation and release notes

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Minor change to docs

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Lint

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Remove outdated comment and correct type hint

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Update docs/source/get_started/new_project.md

Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>
Signed-off-by: L. R. Couto <57910428+lrcouto@users.noreply.github.com>

* Lint

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Minor change on release note

Signed-off-by: lrcouto <laurarccouto@gmail.com>

---------

Signed-off-by: lrcouto <laurarccouto@gmail.com>
Signed-off-by: L. R. Couto <57910428+lrcouto@users.noreply.github.com>
Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update documentation for OmegaConfigLoader (#3778)

* Update documentation for OmegaConfigLoader

Signed-off-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>

* Update RELEASE.md

Signed-off-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>

* Update RELEASE.md

Signed-off-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>

* Update ignore-names.txt

Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>

---------

Signed-off-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>
Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>
Co-authored-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Fix path

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Lint

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Add changes to RELEASE.md

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Address comments from code review

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Empty

Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Remove unneeded imports

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Change recommendation from pytest config to editable install

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Add negative testing example

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Replace Dict with dict

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Remove test classes

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Change the assert step for the integration test

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Fix error handling for OmegaConfigLoader (#3784)

* Update omegaconf_config.py

Signed-off-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>

* Update RELEASE.md

Signed-off-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>

* add a more complicated test case

Signed-off-by: Nok <nok.lam.chan@quantumblack.com>

---------

Signed-off-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>
Signed-off-by: Nok <nok.lam.chan@quantumblack.com>
Co-authored-by: Nok <nok.lam.chan@quantumblack.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Add Simon Brugman to TSC (#3780)

Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update technical_steering_committee.md (#3796)

Signed-off-by: Marcin Zabłocki <m.zablo@gmail.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Remove jmespath dependency (#3797)

Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update spaceflights tutorial and starter requirements for kedro-datasets optional dependencies (#3664)

* Update spaceflights tutorial and starter requirements

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* fix e2e tests

Signed-off-by: lrcouto <laurarccouto@gmail.com>

* Fix e2e tests by distinguishing `kedro-datasets` dependency for different python versions (#3802)

Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com>

* Update docs/source/tutorial/tutorial_template.md

Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>
Signed-off-by: L. R. Couto <57910428+lrcouto@users.noreply.github.com>

---------

Signed-off-by: lrcouto <laurarccouto@gmail.com>
Signed-off-by: L. R. Couto <57910428+lrcouto@users.noreply.github.com>
Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com>
Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Consider Vale's suggestions

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Hide test in details

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Quick fix

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Typo (and wording changes)

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update robots.txt (#3803)

Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com>
Co-authored-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Add a test for transcoding loops of 1 or more nodes (#3810)

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Ensure no nodes can depend on themselves even when transcoding is used (#3812)

* Factor out transcoding helpers into a private module

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>

* Ensure node input/output validation doesn't allow transcoded self-loops

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>

* Updated release note to avoid github warning

Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>

---------

Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>
Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>
Co-authored-by: Elena Khaustova <ymax70rus@gmail.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update UUID telemetry docs (#3805)

Signed-off-by: Dmitry Sorokin <dmd40in@gmail.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Change path to starters test (#3816)

Signed-off-by: lrcouto <laurarccouto@gmail.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Move changes in RELEASE.md to docs section

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Change formatting

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Revert "Change formatting"

This reverts commit 9582a22.

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Apply changes from code review

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Add explanation on why cleanup isn't needed

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Change assert on successful pipeline to check logs

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update description of integration test under pipeline slicing

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Missing formatting

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

* Update tests directory structure

Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>

---------

Signed-off-by: Anthony De Bortoli <anthony.debortoli@protonmail.com>
Signed-off-by: Ahdra Merali <ahdra.merali@quantumblack.com>
Signed-off-by: Ivan Danov <idanov@users.noreply.github.com>
Signed-off-by: Dmitry Sorokin <dmd40in@gmail.com>
Signed-off-by: Nok <nok.lam.chan@quantumblack.com>
Signed-off-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
Signed-off-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Signed-off-by: Elena Khaustova <ymax70rus@gmail.com>
Signed-off-by: Yolan Honoré-Rougé <yolan.honore.rouge@gmail.com>
Signed-off-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>
Signed-off-by: Ankita Katiyar <ankitakatiyar2401@gmail.com>
Signed-off-by: Ankita Katiyar <110245118+ankatiyar@users.noreply.github.com>
Signed-off-by: lrcouto <laurarccouto@gmail.com>
Signed-off-by: L. R. Couto <57910428+lrcouto@users.noreply.github.com>
Signed-off-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>
Signed-off-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>
Signed-off-by: Marcin Zabłocki <m.zablo@gmail.com>
Signed-off-by: Merel Theisen <merel.theisen@quantumblack.com>
Signed-off-by: Ahdra Merali <90615669+AhdraMeraliQB@users.noreply.github.com>
Co-authored-by: Anthony De Bortoli <anthony.debortoli@protonmail.com>
Co-authored-by: Ivan Danov <idanov@users.noreply.github.com>
Co-authored-by: Dmitry Sorokin <40151847+DimedS@users.noreply.github.com>
Co-authored-by: Juan Luis Cano Rodríguez <juan_luis_cano@mckinsey.com>
Co-authored-by: Nok Lam Chan <nok.lam.chan@quantumblack.com>
Co-authored-by: Ondrej Zacha <ondrej.zacha@okra.ai>
Co-authored-by: ElenaKhaustova <157851531+ElenaKhaustova@users.noreply.github.com>
Co-authored-by: Yolan Honoré-Rougé <29451317+Galileo-Galilei@users.noreply.github.com>
Co-authored-by: Merel Theisen <49397448+merelcht@users.noreply.github.com>
Co-authored-by: Ankita Katiyar <110245118+ankatiyar@users.noreply.github.com>
Co-authored-by: Jo Stichbury <jo_stichbury@mckinsey.com>
Co-authored-by: L. R. Couto <57910428+lrcouto@users.noreply.github.com>
Co-authored-by: Puneet Saini <99470400+puneeter@users.noreply.github.com>
Co-authored-by: Marcin Zabłocki <m.zablo@gmail.com>
Co-authored-by: Elena Khaustova <ymax70rus@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants