feat: Add audio parameter support to gemini tts models #11287

AyrennC · 2025-05-31T09:35:24Z

Title

Add 'audio' params support to all gemini tts models

Relevant issues

Fixes #11250
Fixes #11118

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🆕 New Feature
🐛 Bug Fix
✅ Test

Changes

Add is_model_gemini_audio_model() method to detect TTS models
Include 'audio' parameter in supported params for TTS models
Map OpenAI audio parameter to Gemini speechConfig format
Add assistant message transformation for Gemini audio output

vercel · 2025-05-31T09:35:29Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 31, 2025 9:02pm

- Add is_model_gemini_audio_model() method to detect TTS models - Include 'audio' parameter in supported params for TTS models - Map OpenAI audio parameter to Gemini speechConfig format - Add _extract_audio_response_from_parts() method to transform audio output to openai format

AyrennC · 2025-05-31T13:52:05Z

Squashed commits for a cleaner history
Tested gemini tts models locally to be working, gemini currently only support pcm16 audio format:

AyrennC · 2025-05-31T13:58:24Z

LiteLLM Mock Tests timed out after 8 minutes, all test were successful until time out.

tests/llm_translation/test_gemini_tts.py

litellm/llms/gemini/chat/transformation.py

krrishdholakia · 2025-05-31T17:57:54Z

litellm/llms/gemini/chat/transformation.py

+                        )
+
+                    # Map OpenAI audio parameter to Gemini speech config
+                    speech_config = {}


can we have these be typed dict's inside types/llms/vertex_ai.py - so any future updates are also tracked correctly

added typed dict for SpeechConfig and its child in types/llms/vertex_ai.py

krrishdholakia

migrate test to test_litellm, and simplify tts model check

rest looks great. thank you for your work on this

- simplified gemini tts model detection - moved gemini_tts test to test_litellm

AyrennC · 2025-05-31T21:01:49Z

migrate test to test_litellm, and simplify tts model check

rest looks great. thank you for your work on this

Thanks! I went ahead and made the suggested changes:

created typedict for speechconfig
simplified tts model detection
moved test to test_litellm

krrishdholakia · 2025-05-31T23:21:40Z

Thanks @AyrennC would you mind contributing docs for the change, so people know how to use this?

For VertexAI - here
For Google AI Studio - here

Contributing guide - https://docs.litellm.ai/docs/extras/contributing (although it's just an .md change, so i'm sure you can just do it on github as well)

vercel bot deployed to Preview May 31, 2025 09:36 View deployment

AyrennC mentioned this pull request May 31, 2025

[Bug]: Gemini TTS model returning 400 with 'audio' parameters #11250

Closed

AyrennC changed the title ~~feat: Add Gemini TTS audio parameter support~~ feat: Add audio parameter support to gemini tts models May 31, 2025

vercel bot deployed to Preview May 31, 2025 12:44 View deployment

AyrennC marked this pull request as draft May 31, 2025 12:45

vercel bot deployed to Preview May 31, 2025 12:48 View deployment

AyrennC force-pushed the gemini-tts-audio branch from b801117 to eb0b111 Compare May 31, 2025 13:18

vercel bot deployed to Preview May 31, 2025 13:19 View deployment

AyrennC marked this pull request as ready for review May 31, 2025 13:24

AyrennC marked this pull request as draft May 31, 2025 13:36

updated unit-test to use pcm16

bf9e3c9

vercel bot deployed to Preview May 31, 2025 13:49 View deployment

AyrennC marked this pull request as ready for review May 31, 2025 13:58

krrishdholakia reviewed May 31, 2025

View reviewed changes

tests/llm_translation/test_gemini_tts.py Outdated Show resolved Hide resolved

krrishdholakia reviewed May 31, 2025

View reviewed changes

litellm/llms/gemini/chat/transformation.py Outdated Show resolved Hide resolved

krrishdholakia reviewed May 31, 2025

View reviewed changes

krrishdholakia requested changes May 31, 2025

View reviewed changes

- created typedict for speechconfig

746de78

- simplified gemini tts model detection - moved gemini_tts test to test_litellm

vercel bot deployed to Preview May 31, 2025 20:58 View deployment

simplified is_model_gemini_audio_model more

b656acf

vercel bot deployed to Preview May 31, 2025 21:02 View deployment

AyrennC requested a review from krrishdholakia May 31, 2025 22:36

krrishdholakia merged commit 8ae7917 into BerriAI:main May 31, 2025
6 checks passed

AyrennC deleted the gemini-tts-audio branch June 1, 2025 05:37

AyrennC mentioned this pull request Jun 1, 2025

[Docs] Add audio / tts section for gemini and vertex #11306

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: Add audio parameter support to gemini tts models #11287

feat: Add audio parameter support to gemini tts models #11287

Uh oh!

AyrennC commented May 31, 2025 •

edited

Loading

Uh oh!

vercel bot commented May 31, 2025 •

edited

Loading

Uh oh!

AyrennC commented May 31, 2025

Uh oh!

AyrennC commented May 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

krrishdholakia May 31, 2025

Uh oh!

AyrennC May 31, 2025

Uh oh!

krrishdholakia left a comment

Uh oh!

AyrennC commented May 31, 2025

Uh oh!

Uh oh!

krrishdholakia commented May 31, 2025

Uh oh!

Uh oh!

Uh oh!

feat: Add audio parameter support to gemini tts models #11287

feat: Add audio parameter support to gemini tts models #11287

Uh oh!

Conversation

AyrennC commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Uh oh!

vercel bot commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AyrennC commented May 31, 2025

Uh oh!

AyrennC commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

krrishdholakia May 31, 2025

Choose a reason for hiding this comment

Uh oh!

AyrennC May 31, 2025

Choose a reason for hiding this comment

Uh oh!

krrishdholakia left a comment

Choose a reason for hiding this comment

Uh oh!

AyrennC commented May 31, 2025

Uh oh!

Uh oh!

krrishdholakia commented May 31, 2025

Uh oh!

Uh oh!

AyrennC commented May 31, 2025 •

edited

Loading

vercel bot commented May 31, 2025 •

edited

Loading

AyrennC commented May 31, 2025 •

edited

Loading