Increase ControlQueueBufferThreshold defaults in Consumption #1846
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
We recently learned that some customers are experiencing increased latency in their orchestration completion times in the Consumption plan. This is because we recently (#1706) changed our concurrency defaults in the Consumption plan to accommodate its resource constraints and scaling behavior.
In that process, we decreased the value of
ControlQueueBufferThreshold
from 256 to just 32. That decrease appears to blame for the increased latency.This PR increases the default for JS and C# to be 128, a less dramatic decrease from 256. For Python, we keep the default as 32 as we theorize that the Python language worker would benefit from a small
ControlQueueBufferThreshold
value.This PR comes as a bit of a hot-patch and is subject to change in the future. In particular, we have a pending work item (#1700) to come up with PL-aware concurrency defaults, as well as improving the design of the
IPlatformInformationService
interface. Those changes are still pending.Issue describing the changes in this PR
Relates to #1842
Pull request checklist
pending_docs.md
release_notes.md