Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
concurrency.{max,current}
to/health-check
endpointmax
is fromcog.yaml
andcurrent
islen(pending)
of the sole runnermax
==maxRunners
andcurrent
is the sum oflen(pending)
across all runnersFor procedure mode:
maxRunners
= # CPU x 4, this is also the Cog server max concurrencyconcurrency.max
in each runner'scog.yaml
, which is 1 for non-async runnersconcurrency.max
has been reached, e.g. 4 copies for a async runner withconcurrency.max
= 2 for a total of 8 concurrent predictionsmaxRunners
, to prevent context switching, even if any runner might claim a higherconcurrency.max
, e.g. with # CPU = 1,maxRunners
= 4, 4 predictions on a single runner even if itsconcurrency.max
= 16concurrency.current
<concurrency.max
, we're guaranteed to have an evict-able runner slot.