Skip to content

[BUG] Not possible to enable dynamic throttling/quota for OpenAI deployments? #30507

Open

Description

API Spec link

https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/resource-manager/Microsoft.CognitiveServices/preview/2024-06-01-preview/cognitiveservices.json

API Spec version

2024-06-01-preview

Describe the bug

The Azure portal has a switch "Enable Dynamic Quota" under "Edit Deployment" for Cognitive Services Deployments.

The spec doesn't have anything called "dynamic quota". It does have property dynamicThrottlingEnabled, but it's only settable for CS accounts. For deployments, it (or the containing property) is marked "readOnly": true.

How can API clients "Enable Dynamic Quota"?

Expected behavior

The API offers the same functionality than the portal, including "Enable Dynamic Quota".

Actual behavior

dynamicThrottlingEnabled is read-only.

Reproduction Steps

Just based on the linked API spec.

Environment

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

Labels

MgmtThis issue is related to a management-plane library.bugThis issue requires a change to an existing behavior in the product in order to be resolved.customer-reportedIssues that are reported by GitHub users external to the Azure organization.questionThe issue doesn't require a change to the product in order to be resolved. Most issues start as that

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions