-
Notifications
You must be signed in to change notification settings - Fork 284
Open
Description
In the demo section of the readme we have:
_Cartesia
Using Cartesia's Sonic 3 model to visually look at what's in the frame and tell a story with emotion.
• Real-time visual understanding
• Emotional storytelling
• Frame-by-frame analysis_
This sounds very interesting to me, but when I click the link, it does not seem to agree with what was said:
Cartesia is a service that provides Speech-to-Text (STT) and Text-to-Speech (TTS) capabilities. It's designed for real-time voice applications, making it ideal for voice AI agents, transcription pipelines, and conversational interfaces.
coderabbitai
Metadata
Metadata
Assignees
Labels
No labels