Fix BYOK Quota exceeded caused by Intent Detection with Copilot model #228

fvclaus · 2025-07-12T23:23:49Z

Fixes microsoft/vscode#251944

Issue

Currently, the intent detection is hard-coded to use the copilot gpt-4o-mini model. This is an issue when you have exceeded your chat quota.

This will result in either one of the two errors

Sorry, your request failed. Reason: Canceled

This is caused after the BYOK Chat Request returns with a 200. The code will reset the copilot token, because it doesn't check that it originated from the Copilot API

if (response.status === 200 && authenticationService.copilotToken?.isFreeUser && authenticationService.copilotToken?.isChatQuotaExceeded) {
		authenticationService.resetCopilotToken();
}

This also causes unnecessary network requests

The second error I have seen (on possibly older versions of the plugin) is the Quota exceeded error

Solution

I have tried to fix it in a way that is extensible later on

Decouple the intent models from the regular models (although they are currently the same, this leaves the option to change it later)
Design the API of the new service in a way so that the intent detection model can be set per language model (currently it is only one intent detection model for all models)
The simplest fix would have been to use the current model of the Chat Request for intent detection, but that is potentially wasteful with expensive models and this isn't transparent to the user that another API request is sent behind the scenes
Not mix the BYOK Code with the other code, so I made the BYOK code register its models on startup at the new service
For all copilot models gpt-4o-mini is still used. I incorporated this into the message of the quick pick for users who want to have the same behaviour as before
I didn't want to mix intent detection with a copilot model with a chat request with a BYOK model, because this is makes it more complicate and hard to reason about the different states of the system

With this MR there is an error message the first time a BYOK model is used

The button opens a quick pick

fvclaus · 2025-07-13T12:09:35Z

I have moved the new service to common to fix the test. vscode dependencies are imported dynamically. Maybe there is a better way. The tests run fine in my repo now.

fvclaus · 2025-07-13T12:40:56Z

I have released the code. You can download it from here: https://github.com/fvclaus/vscode-copilot-chat/releases/tag/v0.29.0-byok-fix-v1

To install it, go to settings, click on the three dots and select "Install from VSIX..."

fvclaus added 2 commits July 13, 2025 00:45

Fixes byok quota exceeded for intent detection

209c809

Fixes undefined property access in getIntentDetectionModel

dbda880

fvclaus changed the title ~~Fix BYOK Quota exceeded caused by Intent Detection~~ Fix BYOK Quota exceeded caused by Intent Detection with Copilot model Jul 12, 2025

Moves IntentDetectionModelManagementService to common

c4ae701

fvclaus mentioned this pull request Jul 13, 2025

Copilot Free users with exhausted Chat requests can no longer use BYOK microsoft/vscode#251944

Open

bhavyaus requested a review from lramos15 July 13, 2025 15:54

OEvortex approved these changes Jul 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix BYOK Quota exceeded caused by Intent Detection with Copilot model #228

Fix BYOK Quota exceeded caused by Intent Detection with Copilot model #228

fvclaus commented Jul 12, 2025 •

edited

Loading

Uh oh!

fvclaus commented Jul 13, 2025

Uh oh!

fvclaus commented Jul 13, 2025

Uh oh!

Uh oh!

Fix BYOK Quota exceeded caused by Intent Detection with Copilot model #228

Are you sure you want to change the base?

Fix BYOK Quota exceeded caused by Intent Detection with Copilot model #228

Conversation

fvclaus commented Jul 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Solution

Uh oh!

fvclaus commented Jul 13, 2025

Uh oh!

fvclaus commented Jul 13, 2025

Uh oh!

Uh oh!

fvclaus commented Jul 12, 2025 •

edited

Loading