LiveKit NVIDIA NeMo STT Plugin

A high-performance speech-to-text plugin for LiveKit agents using NVIDIA NeMo Parakeet and Canary models for accurate and fast speech recognition.

Features

Dual Model Support: Compatible with both NVIDIA NeMo Parakeet and Canary STT models
Ultra-Low Latency: Sub-100ms latency with Parakeet on RTX 4090
High Accuracy: Enterprise-grade speech recognition using NVIDIA's state-of-the-art models
Optimized Performance: Connection pooling and keep-alive optimizations for production use
LiveKit Integration: Seamless integration with LiveKit agents framework

Requirements

LiveKit Agents v1.2 or higher
NVIDIA NeMo STT server instance

Performance Benchmarks

Model	Hardware	Latency	Use Case
Parakeet	RTX 4090	<100ms	Real-time applications
Canary	RTX 4090	<150ms	High-accuracy transcription

Installation

Clone or download this plugin into your LiveKit-based agents project root directory
Set up the NVIDIA NeMo STT server using the FastAPI implementation
Ensure your server is running and accessible

Server Setup

Use the FastAPI server implementation for both Parakeet and Canary models:

Repository: taresh18/parakeet-FastAPI

This server provides unified support for both NVIDIA NeMo models with optimized inference endpoints.

Usage

Initialize your agent session with the CanarySTT plugin:

from parakeet import CanarySTT

session = AgentSession(
    # ... other configuration
    stt=CanarySTT(
        server_url="<your_server_url>",  # e.g., "http://localhost:8989"
    )
)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
parakeet.py		parakeet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LiveKit NVIDIA NeMo STT Plugin

Features

Requirements

Performance Benchmarks

Installation

Server Setup

Usage

About

Uh oh!

Uh oh!

Languages

taresh18/livekit-parakeet

Folders and files

Latest commit

History

Repository files navigation

LiveKit NVIDIA NeMo STT Plugin

Features

Requirements

Performance Benchmarks

Installation

Server Setup

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages