Skip to content

Conversation

StephenHodgson
Copy link
Member

@StephenHodgson StephenHodgson commented Apr 27, 2025

  • fix mp3 and wav playback from SpeechRequests
  • added SemanticVAD options to realtime
  • added new audio models to static model list

- fix mp3 and wav playback from SpeechRequests
@StephenHodgson StephenHodgson requested a review from Copilot April 27, 2025 17:04
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR addresses two main issues: fixing mp3 and wav playback for SpeechRequests and adding new SemanticVAD options for realtime sessions. Key changes include updating the realtime test to use the new DisabledVAD, refactoring VoiceActivityDetectionSettings into an interface with multiple implementations (DisabledVAD, SemanticVAD, ServerVAD), and adjusting JSON conversion and SessionConfiguration to work with the new types.

Reviewed Changes

Copilot reviewed 8 out of 9 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
Tests/TestFixture_13_Realtime.cs Updated test to instantiate new DisabledVAD
Runtime/Realtime/VoiceActivityDetectionSettings.cs Refactored settings into an interface and added new VAD classes
Runtime/Realtime/TurnDetectionType.cs Added new enum member for Semantic_VAD
Runtime/Realtime/SessionConfiguration.cs Changed parameter type and default instantiation to ServerVAD
Runtime/OpenAIClient.cs Minor update in JSON converter initialization
Runtime/Extensions/VoiceActivityDetectionSettingsConverter.cs Updated converter to handle new IVoiceActivityDetectionSettings types
Runtime/Audio/SpeechClip.cs New SpeechClip constructor for AudioClip integration
Runtime/Audio/AudioEndpoint.cs Adjustments for PCM handling and improved audio response routing
Files not reviewed (1)
  • OpenAI/Packages/com.openai.unity/package.json: Language not supported

@StephenHodgson StephenHodgson linked an issue Apr 27, 2025 that may be closed by this pull request
StephenHodgson and others added 3 commits April 27, 2025 13:05
…ityDetectionSettingsConverter.cs

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
@StephenHodgson StephenHodgson linked an issue Apr 27, 2025 that may be closed by this pull request
@StephenHodgson StephenHodgson marked this pull request as ready for review April 27, 2025 18:43
@StephenHodgson StephenHodgson merged commit 0da27a4 into main Apr 27, 2025
17 checks passed
@StephenHodgson StephenHodgson deleted the development branch April 27, 2025 18:45
github-actions bot pushed a commit that referenced this pull request Apr 27, 2025
- fix mp3 and wav playback from SpeechRequests
- added SemanticVAD options to realtime
- added new audio models to static model list

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

SpeechRequest MP3 and WAV responses do not playback correctly Realtime API Interruption Add Semantic VAD to Realtime Add new audio models
1 participant