[feature] Ability to disable caching for a particular pipeline run via the UI #6578

jackwhelpton · 2021-09-16T21:00:05Z

Feature Area

/area frontend

What feature would you like to see?

The ability to enable or disable (v2) caching for a pipeline run via the UI.

What is the use case or pain point?

When developing pipelines, it is possible to end up in a position where the component "succeeds" (does not error), but returns erroneous results. Once this has happened, correcting the component code and re-executing does not suffice, as the previous (incorrect) results are cached.

Is there a workaround currently?

We separate authoring pipelines from executing them, so typically our executions are via the Vertex Pipelines UI, by uploading the compiled JSON.

One workaround would be to extend the pipeline code to support executing the pipeline from code, which does allow the caching to be disabled. This change would have to be made to all our pipelines, and is onerous.

Either move the input files and change the components to bust the cache, or execute the pipeline using Python.

Neither of these approaches are ideal, and neither would be easily available to non-coders when they execute the pipeline (people who can upload a given JSON file and set parameters, but not make code changes).

Love this idea? Give it a 👍. We prioritize fulfilling features with the most 👍.

capri-xiyue · 2021-09-21T18:20:26Z

@zijianjoy
I talked with sdk side and vertex side, the agreement is to implement disable/enable caching level at client side first.
Later when we have a clear goal of caching CUJ, we can talk about implementing this in server side which means change pipeline job proto.
SDK already implemented such logic in

pipelines/sdk/python/kfp/v2/google/client/client.py

Lines 140 to 152 in 55a2fb5

    
           def _set_enable_caching_value(pipeline_spec: Dict[str, Any], 
        
                                         enable_caching: bool) -> None: 
        
               """Sets pipeline tasks caching options. 
        
               Args: 
        
                pipeline_spec: The dictionary of pipeline spec. 
        
                enable_caching: Whether to enable caching. 
        
               """ 
        
               for component in [pipeline_spec['root']] + list( 
        
                       pipeline_spec['components'].values()): 
        
                   if 'dag' in component: 
        
                       for task in component['dag']['tasks'].values(): 
        
                           task['cachingOptions'] = {'enableCache': enable_caching}

, you can follow such logic in UI.

zijianjoy · 2021-10-11T03:42:07Z

@capri-xiyue Sounds good on providing the ability to disable pipeline level caching on UI side, once we updated pipeline job proto to support this field. But question: currently when you create a run from pipeline template, UI doesn't use the PipelineJob payload itself, instead UI will create a run using pipeline_version_id for PIPELINE_VERSION resource_reference. So I guess it should be backend which exposes caching configuration first?

capri-xiyue · 2021-10-11T18:00:39Z

@capri-xiyue Sounds good on providing the ability to disable pipeline level caching on UI side, once we updated pipeline job proto to support this field. But question: currently when you create a run from pipeline template, UI doesn't use the PipelineJob payload itself, instead UI will create a run using pipeline_version_id for PIPELINE_VERSION resource_reference. So I guess it should be backend which exposes caching configuration first?

Can UI manipulate the template before UI call backend?

capri-xiyue · 2021-12-13T19:17:02Z

Discussed offline, backend needs to expose cache configuration first and then front end can disable caching for a particular pipeline run via the UI

capri-xiyue · 2021-12-24T00:04:03Z

Reassigned it to James Wu to further discuss the priority and the assignee

stale · 2022-04-17T06:27:12Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

juliusvonkohout · 2022-09-13T10:15:00Z

We will cover this and more in #8177 hopefully we can present it tomorrow in the KFP meeting

thesuperzapper · 2024-04-19T17:47:22Z

@zijianjoy it would be great to disable caching for a specific run via the UI, what do you think?

Many users don't understand what the cache does, and having that option in the UI would help them understand that sometimes their runs will pull from a cache.

Note, we already have a feature to do this when submitting runs from the SDK:

https://www.kubeflow.org/docs/components/pipelines/v2/caching/#how-to-use-caching

juliusvonkohout · 2024-04-22T09:34:17Z

There are also some older issues and PRs for exactly that. For example #8177

github-actions · 2024-06-22T07:41:46Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

juliusvonkohout · 2024-06-24T06:56:46Z

/lifecycle frozen

jackwhelpton added the kind/feature label Sep 16, 2021

google-oss-robot added the area/frontend label Sep 16, 2021

capri-xiyue assigned zijianjoy Sep 21, 2021

zijianjoy assigned capri-xiyue and unassigned zijianjoy Oct 11, 2021

capri-xiyue assigned james-jwu and unassigned capri-xiyue Dec 24, 2021

stale bot added the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Apr 17, 2022

stale bot removed the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Sep 13, 2022

github-actions bot added the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Jun 22, 2024

google-oss-prow bot added lifecycle/frozen and removed lifecycle/stale The issue / pull request is stale, any activities remove this label. labels Jun 24, 2024

gregsheremeta mentioned this issue Aug 9, 2024

[feature] Option to disable Caching in V2 at the KFP, Pipeline, and Run level #10839

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature] Ability to disable caching for a particular pipeline run via the UI #6578

[feature] Ability to disable caching for a particular pipeline run via the UI #6578

jackwhelpton commented Sep 16, 2021

capri-xiyue commented Sep 21, 2021

zijianjoy commented Oct 11, 2021 •

edited

Loading

capri-xiyue commented Oct 11, 2021

capri-xiyue commented Dec 13, 2021

capri-xiyue commented Dec 24, 2021

stale bot commented Apr 17, 2022

juliusvonkohout commented Sep 13, 2022

thesuperzapper commented Apr 19, 2024

juliusvonkohout commented Apr 22, 2024 •

edited

Loading

github-actions bot commented Jun 22, 2024

juliusvonkohout commented Jun 24, 2024

[feature] Ability to disable caching for a particular pipeline run via the UI #6578

[feature] Ability to disable caching for a particular pipeline run via the UI #6578

Comments

jackwhelpton commented Sep 16, 2021

Feature Area

What feature would you like to see?

What is the use case or pain point?

Is there a workaround currently?

Neither of these approaches are ideal, and neither would be easily available to non-coders when they execute the pipeline (people who can upload a given JSON file and set parameters, but not make code changes).

capri-xiyue commented Sep 21, 2021

zijianjoy commented Oct 11, 2021 • edited Loading

capri-xiyue commented Oct 11, 2021

capri-xiyue commented Dec 13, 2021

capri-xiyue commented Dec 24, 2021

stale bot commented Apr 17, 2022

juliusvonkohout commented Sep 13, 2022

thesuperzapper commented Apr 19, 2024

juliusvonkohout commented Apr 22, 2024 • edited Loading

github-actions bot commented Jun 22, 2024

juliusvonkohout commented Jun 24, 2024

zijianjoy commented Oct 11, 2021 •

edited

Loading

juliusvonkohout commented Apr 22, 2024 •

edited

Loading