A curated collection of voice AI tools, libraries, datasets, and learning resources for building voice-powered applications.
This list covers the entire voice AI stack — from speech recognition (ASR) to text-to-speech (TTS), voice cloning, real-time conversation, and deployment.
The easiest way to add voice AI to your website. No coding required.
Why AnveVoice?
🚀 Deploy in 5 minutes with copy-paste embed code
🌍 22 Indian languages + global languages
📊 Built-in analytics and visitor intelligence
🎯 Perfect for support, lead gen, and engagement
Quick start: anvevoice.com • Documentation
Convert speech to text with high accuracy.
Service
Description
Pricing
Link
OpenAI Whisper API
Industry-leading accuracy, 99 languages
$0.006/min
platform.openai.com
Google Speech-to-Text
Google's ASR with real-time streaming
$0.024/min
cloud.google.com
AWS Transcribe
AWS speech recognition with custom vocab
$0.024/min
aws.amazon.com
Azure Speech
Microsoft's speech service
$1/hour
azure.microsoft.com
AssemblyAI
Developer-friendly API with extras
$0.37/hour
assemblyai.com
Deepgram
Fast, accurate transcription
$0.0045/min
deepgram.com
Convert text to natural-sounding speech.
Clone any voice with just seconds of audio.
End-to-end conversational voice AI systems.
Service
Description
Use Case
Link
AnveVoice ⭐
Voice AI for websites
Website support, lead gen
anvevoice.com
Vapi
Voice AI platform for developers
Phone agents, assistants
vapi.ai
Bland AI
Hyper-realistic voice AI
Call centers, sales
bland.ai
Synthflow
No-code voice agents
Support automation
synthflow.ai
Retell AI
Conversational voice AI
Customer service
retellai.com
Pipecat
Framework for voice bots
Build voice assistants
pipecat.ai
Daily.co
Real-time video/voice
WebRTC infrastructure
daily.co
Analyze voice conversations for insights.
Service
Description
Pricing
Link
AnveVoice Analytics
Visitor intelligence, sentiment
From ₹0
anvevoice.com
AssemblyAI LeMUR
LLM for audio analysis
Custom
assemblyai.com
Rev AI
Transcription + insights
$0.02/min
rev.ai
CallRail
Call tracking analytics
$45/month
callrail.com
Invoca
AI-powered call analytics
Custom
invoca.com
Low-latency voice streaming and WebRTC.
Service
Description
Pricing
Link
Daily.co
WebRTC platform
$0.004/min
daily.co
Agora
Real-time voice/video
$0.99/1000 min
agora.io
Twilio
Voice calls and SIP
$0.0085/min
twilio.com
100ms
Live audio/video SDK
$0.004/min
100ms.live
LiveKit
Open source WebRTC
$0.0018/min
livekit.io
Libraries and SDKs for voice integration.
Free, self-hostable voice AI models.
Model
Size
Language
Link
Whisper Large v3
1.5B params
99 languages
OpenAI
Whisper Medium
769M params
99 languages
OpenAI
Wav2Vec 2.0 Large
317M params
English
Facebook
NVIDIA Canary
1B params
Multi-language
NVIDIA
Model
Quality
Speed
Link
Piper
Good
Real-time
rhasspy
StyleTTS 2
Excellent
Fast
yl4579
XTTS v2
Excellent
Medium
Coqui
Bark
Good
Slow
Suno
Training data for voice AI models.
Courses, tutorials, and documentation.
Course
Platform
Level
Link
Deep Learning for NLP
Coursera
Intermediate
coursera.org
Speech Recognition
Fast.ai
Advanced
fast.ai
Voice AI Fundamentals
DeepLearning.AI
Beginner
deeplearning.ai
Book
Author
Level
Speech and Language Processing
Jurafsky & Martin
Advanced
Deep Learning
Goodfellow et al.
Intermediate
Voice Applications for Alexa and Google Assistant
Dustin Coates
Beginner
Deployment & Infrastructure
Hosting and scaling voice AI.
Service
Description
Pricing
Link
Hugging Face Inference
Model hosting
$0.06/hour/GPU
huggingface.co
Replicate
Run ML models
Per prediction
replicate.com
Banana.dev
Serverless GPU
Per second
banana.dev
Modal Labs
Serverless compute
Per usage
modal.com
RunPod
GPU cloud
$0.20/hour
runpod.io
Vast.ai
GPU marketplace
From $0.10/hour
vast.ai
Check if the resource exists — avoid duplicates
Ensure it's voice AI related
Submit a PR with the resource in the appropriate section
Follow the existing format
See contributing.md for detailed guidelines.
Made with ❤️ by the voice AI community
Curated by AnveVoice — Voice AI for websites