Shubhra/ajs 32 add support for realtime model #530

Shubhrakanti · 2025-07-11T20:11:44Z

Adds support to realtime model. Voice only - function calling is not yet enabled. There is still a bug with interrupts. The audio immediately after an interrupt gets cut off.

changeset-bot · 2025-07-11T20:11:49Z

⚠️ No Changeset found

Latest commit: c57eb66

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

Shubhrakanti · 2025-07-14T21:06:46Z

plugins/openai/src/realtime/realtime_model.ts

+  constructor(
+    options: {
+      model?: string;
+      voice?: string;
+      temperature?: number;
+      toolChoice?: llm.ToolChoice;
+      baseURL?: string;
+      inputAudioTranscription?: api_proto.InputAudioTranscription | null;
+      // TODO(shubhra): add inputAudioNoiseReduction
+      turnDetection?: api_proto.TurnDetectionType | null;
+      speed?: number;
+      // TODO(shubhra): add openai tracing options
+      azureDeployment?: string;
+      apiKey?: string;
+      entraToken?: string;
+      apiVersion?: string;
+      maxSessionDuration?: number;
+      // TODO(shubhra): add connOptions
+    } = {},
+  ) {


just adds a default value

Shubhrakanti · 2025-07-14T21:06:54Z

plugins/openai/src/realtime/realtime_model.ts

+      for (const nf of this.bstream.write(f.data.buffer)) {
        this.sendEvent({
          type: 'input_audio_buffer.append',
-          audio: Buffer.from(nf.data).toString('base64'),
+          audio: Buffer.from(nf.data.buffer).toString('base64'),


really nasty bug

Shubhrakanti · 2025-07-14T21:08:11Z

plugins/openai/src/realtime/realtime_model.ts

+        textStream: new ReadableStream<string>({
+          async start(controller) {
+            for await (const chunk of itemGeneration.textChannel) {
+              controller.enqueue(chunk);
+            }
+          },
+          cancel() {
+            itemGeneration.textChannel.close();
+          },
+        }),
+        audioStream: new ReadableStream<AudioFrame>({
+          async start(controller) {
+            for await (const chunk of itemGeneration.audioChannel) {
+              controller.enqueue(chunk);
+            }
+          },
+          cancel() {
+            itemGeneration.audioChannel.close();
+          },
+        }),


might need to reconsider how we do this in the future but fine for now.

Shubhrakanti · 2025-07-14T21:08:41Z

plugins/openai/src/realtime/api_proto.ts

@@ -416,6 +416,7 @@ export interface InputAudioBufferSpeechStoppedEvent extends BaseServerEvent {

 export interface ConversationItemCreatedEvent extends BaseServerEvent {
  type: 'conversation.item.created';
+  previous_item_id: string;


we should try to pull these types from the open ai agents sdk.

Shubhrakanti · 2025-07-14T21:11:05Z

agents/src/voice/agent_activity.ts

  }

  updateAudioInput(audioStream: ReadableStream<AudioFrame>): void {
-    this.audioRecognition?.setInputAudioStream(audioStream);
+    // TODO(shubhra): might need to tee the streams here.


Add a ticket

Shubhrakanti · 2025-07-14T21:11:31Z

agents/src/voice/agent_activity.ts

-      this.realtimeSession.on(
-        'input_audio_transcription_completed',
-        this.onInputAudioTranscriptionCompleted,
+      this.realtimeSession.on('generation_created', (ev) => this.onGenerationCreated(ev));


arrow functions to preserve this context

Shubhrakanti · 2025-07-14T21:12:14Z

agents/src/llm/realtime.ts

+  private async _mainTaskImpl(signal: AbortSignal): Promise<void> {
+    const reader = this.deferredInputStream.stream.getReader();
+    while (true) {
+      const { done, value } = await reader.read();
+      if (done || signal.aborted) {
+        break;
+      }
+      this.pushAudio(value);
+    }
+  }


Again might need to reconsider this before launch but it's fine for now.

…odel

Shubhrakanti · 2025-07-15T17:50:28Z

agents/src/voice/generation.ts

+export function removeInstructions(chatCtx: ChatContext) {
+  // loop in case there are items with the same id (shouldn't happen!)
+  while (true) {
+    const idx = chatCtx.indexById(INSTRUCTIONS_MESSAGE_ID);
+    if (idx !== undefined) {
+      chatCtx.items.splice(idx, 1);
+    } else {
+      break;
+    }
+  }
+}


copied directly from python

Shubhrakanti added 2 commits July 11, 2025 12:36

init

be9d3cf

save progress

4390da5

Shubhrakanti changed the base branch from main to dev-1.0 July 11, 2025 20:11

Shubhrakanti added 7 commits July 11, 2025 13:22

save progress

73e9058

save progress

75422cd

forwarding audio

47583b5

audio working

92b6043

bugfixes

f6e1866

clean up logs

c19635c

clean logs

7631bff

Shubhrakanti commented Jul 14, 2025

View reviewed changes

Shubhrakanti added 2 commits July 14, 2025 17:33

clean up

a7cfbeb

update TODO

2df62e1

Shubhrakanti requested a review from toubatbrian July 15, 2025 17:46

Shubhrakanti added 2 commits July 15, 2025 10:49

undo changes

bbb867b

Merge branch 'dev-1.0' into shubhra/ajs-32-add-support-for-realtime-m…

c57eb66

…odel

Shubhrakanti commented Jul 15, 2025

View reviewed changes

Shubhrakanti merged commit c2f547c into dev-1.0 Jul 15, 2025
8 checks passed

Shubhrakanti deleted the shubhra/ajs-32-add-support-for-realtime-model branch July 15, 2025 17:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Shubhra/ajs 32 add support for realtime model #530

Shubhra/ajs 32 add support for realtime model #530

Uh oh!

Shubhrakanti commented Jul 11, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

Shubhrakanti Jul 14, 2025

Uh oh!

Shubhrakanti Jul 14, 2025

Uh oh!

Shubhrakanti Jul 14, 2025

Uh oh!

Shubhrakanti Jul 14, 2025

Uh oh!

Shubhrakanti Jul 14, 2025

Uh oh!

Shubhrakanti Jul 14, 2025

Uh oh!

Shubhrakanti Jul 14, 2025

Uh oh!

Shubhrakanti Jul 15, 2025

Uh oh!

Uh oh!

Uh oh!

Shubhra/ajs 32 add support for realtime model #530

Shubhra/ajs 32 add support for realtime model #530

Uh oh!

Conversation

Shubhrakanti commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Shubhrakanti commented Jul 11, 2025 •

edited

Loading

changeset-bot bot commented Jul 11, 2025 •

edited

Loading