Open AI chat completion API "Retry-After" header is not exposed to the browser

Hi, I'm trying to use stay within the tokens per minute limited enforced by the Open AI Chat Completion API.

When server responded with 429 error, I'm expecting the `Retry-After` field in the header exposed to the browser so the caller can retry after the server suggested period of time. The [documentation here](https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits#general-best-practices-to-remain-within-rate-limits) specially calls for such retry strategy

> General best practices to remain within rate limits
>
> To minimize issues related to rate limits, it's a good idea to use the following techniques:
> - Implement retry logic in your application.
> - ...


The problem is that the response header is set such that the `Retry-After` header field is hidden from the browser. See screenshot below:

![image](https://github.com/Azure/azure-rest-api-specs/assets/1895289/5780ab8a-f1f2-48ad-91af-07088967025e)

This makes it difficult for the client to set the right timeout. Please consider adding `Retry-After` to the `Access-Control-Expose-Headers` field.

References: 
- [Understanding rate limit](https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/quota#understanding-rate-limits)
- [General best practices to remain within rate limits](https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits)




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Open AI chat completion API "Retry-After" header is not exposed to the browser #24904

chuanqisun
openedon Jul 20, 2023

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Open AI chat completion API "Retry-After" header is not exposed to the browser #24904

Description

chuanqisunopenedon Jul 20, 2023

Metadata