-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Simplismart Integration in Livekit #4349
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Simplismart Integration in Livekit #4349
Conversation
|
Hello and thanks for the PR! How can we test it? I checked your web app and can't find any TTS models or endpoints to test with. |
|
sure @Hormold , we are releasing it by today, and let you know |
|
Hey @Hormold, Happy to help if you have any more questions. Example code snippet: from livekit.agents import (
Agent,
AgentSession,
JobContext,
RunContext,
WorkerOptions,
cli,
function_tool,
)
from livekit.plugins import silero
from livekit.plugins import simplismart, openai
from dotenv import load_dotenv
import os
load_dotenv()
SIMPLISMART_API_KEY = os.getenv("SIMPLISMART_API_KEY")
@function_tool
async def lookup_weather(
context: RunContext,
location: str,
):
"""Used to look up weather information."""
return {"weather": "sunny", "temperature": 70}
async def entrypoint(ctx: JobContext):
await ctx.connect()
agent = Agent(
instructions="You are a friendly voice assistant built by LiveKit.",
)
session = AgentSession(
vad=silero.VAD.load(),
# any combination of STT, LLM, TTS, or realtime API can be used
stt=simplismart.STT(base_url="https://api.simplismart.live/predict", api_key=SIMPLISMART_API_KEY, model="openai/whisper-large-v3-turbo"),
llm=openai.LLM(model="google/gemma-3-4b-it", api_key=SIMPLISMART_API_KEY, base_url="https://api.simplismart.live"),
tts=simplismart.TTS(base_url="https://api.simplismart.live/tts", api_key=SIMPLISMART_API_KEY, model="Simplismart/orpheus-3b-0.1-ft"),
)
await session.start(agent=agent, room=ctx.room)
if __name__ == "__main__":
cli.run_app(WorkerOptions(entrypoint_fnc=entrypoint)) |
|
I tested it, and the voice sounds pretty smooth! The LLM isn't working for me; it's stuck with Gemma (7b and 28b) and Llama (70b). It works, but it's very, very slow. |
|
@Hormold Thanks for testing this out, glad to hear you liked the voice quality! For the LLM latency, I’d recommend trying a smaller model like Gemma 3 4B ( In the meantime, if the STT and TTS components look good, could we go ahead and merge this PR? Merry Christmas in advance 🎄 |
This is an integration PR of Simplismart's open source stt, tts and LLM models