Closed as not planned
Description
I've built a simple voice agent using the latest Agent SDK and VoicePipeline with the default configurations. However, the voice conversation feels noticeably slow. Can you explain how the speech-to-text (STT), workflow, and text-to-speech (TTS) processes work behind the scenes, and offer suggestions on how to improve the speed or reduce latency in the pipeline?
Our case is via phone line (twilio).