feat(openai): support for gpt-4o-transcribe-diarize #12408

psinha40898 · 2026-02-10T21:26:58Z

Background

Currently there is no reasonable way to use this model through the AI SDK.

Summary

Set providerOptions default responseFormat to diarize_json for this model adhering to conventions established in fix(provider/openai): do not set response_format to verbose_json if model is gpt-4o-transcribe #8246 (comment)
Enhance zod validation to respect TranscriptionDiarized type returned by OpenAI API
https://developers.openai.com/api/reference/resources/audio/subresources/transcriptions/methods/create

These two changes allow users to use the model in a meaningful way allowing for diarize_json type responses.

Allow users to pass chunking_strategy so that the model can be used on audio longer than 30 seconds.

This change allows users to use the model on audio that is greater than 30 seconds in length.

Manual Verification

I confirmed that on main code like

const transcript = await transcribe({
  model: openai.transcription('gpt-4o-transcribe-diarize'),
  audio: await readFile('audio.mp3'),
  providerOptions: {
        openai: {
          chunking_strategy: "auto",
        },
      };
});

Will return a response_format error stemming from the AI SDK defaulting the response_format to verbose_json on this model which does not support that.

With this PR, it will pass the chunking_strategy and also return a fully diarized json in the response.

console.log("Full Diarized Response:", transcript.responses); // this is now useful for this model

Checklist

Tests have been added / updated (for bug fixes / features)
Documentation has been added / updated (for bug fixes / features)
A patch changeset for relevant packages has been added (for bug fixes / features - run pnpm changeset in the project root)
I have reviewed this pull request (self-review)

Future Work

The examples can be a lot better if body is added to TranscriptionModelResponseMetaData
In general there is less parity between transcription models and providers than regular text generation models.

Related Issues

Fixes #11679

Related to #12409

…ameter

…nscriptionModelOptions just like the other providerOptions

vercel

Additional Suggestion:

The response_format is only appended to form data when providerOptions.openai is provided, so transcription requests without provider options send no response_format to the API.

psinha40898 · 2026-02-10T21:43:19Z

Additional Suggestion:

The response_format is only appended to form data when providerOptions.openai is provided, so transcription requests without provider options send no response_format to the API.

This is because the scope of this PR is not to change the opinions established in the codebase but instead extend those opinions to improve DX and increase feature coverage

psinha40898 added 6 commits February 5, 2026 04:42

feat(openai): full support for gpt-4o-transcribe-diarize model

2236e74

Merge branch 'main' into pyush/openai/fix/gpt-4o-transcribe-diarize

b188bfc

docs: inaccurate table and fix accidental regression for includes par…

eca2791

…ameter

chore: parse the chunking_strategy object to camelCase inside the tra…

b335db9

…nscriptionModelOptions just like the other providerOptions

chore:changeset

9b478d9

chore: better example given current ai sdk limitations

bfbc680

vercel-ai-sdk bot added the ai/provider label Feb 10, 2026

psinha40898 mentioned this pull request Feb 10, 2026

Feature Request: add body to TranscriptionModelResponseMetaData #12409

Open

1 task

psinha40898 marked this pull request as ready for review February 10, 2026 21:36

psinha40898 changed the title ~~feat(openai): support for gpt-4o-transcribe~~ feat(openai): support for gpt-4o-transcribe-diarize Feb 10, 2026

vercel bot reviewed Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(openai): support for gpt-4o-transcribe-diarize #12408

feat(openai): support for gpt-4o-transcribe-diarize #12408

Uh oh!

psinha40898 commented Feb 10, 2026 •

edited

Loading

Uh oh!

vercel bot left a comment

Uh oh!

psinha40898 commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(openai): support for gpt-4o-transcribe-diarize #12408

Are you sure you want to change the base?

feat(openai): support for gpt-4o-transcribe-diarize #12408

Uh oh!

Conversation

psinha40898 commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Summary

Manual Verification

Checklist

Future Work

Related Issues

Uh oh!

vercel bot left a comment

Choose a reason for hiding this comment

Uh oh!

psinha40898 commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

psinha40898 commented Feb 10, 2026 •

edited

Loading