Support for heterogeneous concurrency limits #7834

felix-ht · 2022-12-09T10:23:04Z

First check

I added a descriptive title to this issue.
I used the GitHub search to find a similar request and didn't find it.
I searched the Prefect documentation for this feature.

Prefect Version

2.x

Describe the current behavior

Currently i can only set concurrency limits flatly per agent or queue.

Describe the proposed behavior

I want to specify that the a machine can support one flow that requires a GPU, but many flows that do not require a GPU.

Example Use

The typical use case for this is a GPU node.

Run at most one gpu workflow on the GPU node
Run at many cpu only workflow concurrently on the GPU node

Lets take the following example:
-> Preprocessing -> Training-> Postprocessing

Preprocessing and Postprocessing would launch many concurrent flows (on the same machine). While Training might only have one flow that runs on the run on the GPU. It must be ensured that no more than one GPU flows run at the same time. Otherwise there will be resources conflicts.

So the CPU only concurrency limit is say 10 and the one for GPU enabled tasks would be 1.

Additional context

No response

The text was updated successfully, but these errors were encountered:

felix-ht · 2022-12-15T09:38:53Z

i justed noticed that you added https://docs.prefect.io/concepts/tasks/?h=conc#task-run-concurrency-limits

If we could speficy task concurrency limits per agent as well, the issue would be resolved. Might also be quite nicely alinged with how queue concurrency can be limited per queue or per agent.

prefect agent start --tag-concurrency-limit GPU 1

felix-ht · 2023-01-12T11:52:42Z

@madkinsz just tagging you to this doesn't get lost

zanieb · 2023-01-12T17:31:26Z

This should be addressed fully by the "Work pool" concept that is currently experimental.

felix-ht · 2023-02-13T09:53:30Z

@madkinsz so Work pool just dropped - however i cannot see how i am supposed to use work pools to achive the desired results.

The only option i see it is to keep doing it as we are currently doing it:

Each node has two agents running

one agent that polls from a GPU queue with a concurrency limit of the agent set to 1
another agent that polls form CPU queues

This has the big shortcoming that the whole flow running on the GPU machine will have to run with a concurrency limit of 1. And that on a machine that might have as many as 240 CPU threads and 2 TB of RAM. The training itself uses the cores - but that the flow cannot is weird to say the least.

felix-ht added enhancement An improvement of an existing feature status:triage labels Dec 9, 2022

zanieb added status:roadmap and removed status:triage labels Jan 12, 2023

billpalombi added the concurrency label May 3, 2023

cicdw removed the status:roadmap label Aug 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for heterogeneous concurrency limits #7834

Support for heterogeneous concurrency limits #7834

felix-ht commented Dec 9, 2022 •

edited

Loading

felix-ht commented Dec 15, 2022 •

edited

Loading

felix-ht commented Jan 12, 2023

zanieb commented Jan 12, 2023

felix-ht commented Feb 13, 2023 •

edited

Loading

Support for heterogeneous concurrency limits #7834

Support for heterogeneous concurrency limits #7834

Comments

felix-ht commented Dec 9, 2022 • edited Loading

First check

Prefect Version

Describe the current behavior

Describe the proposed behavior

Example Use

Additional context

felix-ht commented Dec 15, 2022 • edited Loading

felix-ht commented Jan 12, 2023

zanieb commented Jan 12, 2023

felix-ht commented Feb 13, 2023 • edited Loading

felix-ht commented Dec 9, 2022 •

edited

Loading

felix-ht commented Dec 15, 2022 •

edited

Loading

felix-ht commented Feb 13, 2023 •

edited

Loading