#

audio-ai

Here are 9 public repositories matching this topic...

narcotic-sh / senko

Very fast, accurate speaker diarization

speaker-diarization rapids diarization fbank silero-vad zanshin pyannote audio-ai

Updated Sep 23, 2025
Python

kyegomez / AudioFlamingo

Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities"

audio machine-learning ai ml artificial-intelligence transformer deeplearning attention-mechanism attention-model attention-is-all-you-need llm audio-ai

Updated Jan 27, 2025
Python

serp-ai / ai-text-to-audio-latent-diffusion

text-to-audio-latent-diffusion

text-to-audio latent-diffusion audio-diffusion text-to-audio-ai latent-audio-diffusion audio-ai ai-audio-generation

Updated Aug 25, 2023
Python

ksasso1028 / audio-reverb-removal

Code to train a custom time-domain autoencoder to dereverb audio

audio dsp pytorch autoencoder convolutional-neural-networks time-domain denoising-autoencoders denoising multi-task-learning dereverberation autoencoder-neural-network demucs audio-denoising audio-machine-learning audio-ml audio-ai convtasnet

Updated Nov 30, 2023
Python

aaivu / KuralNet

A deep learning-based Speech Emotion Recognition (SER) model trained primarily on Indian languages. Designed for applications in call centers, sentiment analysis, and accessibility tools.

nlp deep-learning sentiment-analysis speech-processing ser emotion-detection indian-languages speech-emotion-recognition audio-ai multilingual-ser accessibility-ai ai-for-speech

Updated Jul 25, 2025
Python

SoheilGtex / Voice-Cloning-SV2TTS-

Safe, production-ready starter for voice cloning via SV2TTS (RTVC wrapper). CLI, tests, Docker, CI, pre-commit. No model weights included.

nlp machine-learning text-to-speech deep-learning speech-synthesis neural-networks voice-cloning audio-ai

Updated Aug 18, 2025
Python

saoud30 / Audio-AI

🗣️ Audio AI: Your Audio & Video Transcription Powerhouse!

transcribe-audio-files streamlit-webapp assembly-ai audio-ai

Updated Oct 27, 2024
Python

hari7261 / AgentPodcast-AI

PodcastAgent uses advanced text-to-speech technology to create natural-sounding multi-speaker podcasts from any written content.

api text-to-speech ai podcast gemini podcast-generator audio-ai hari7261 podcast-ai

Updated Sep 2, 2025
Python

engasd999 / senko

⚡ Accelerate speaker diarization with Senko, processing 1 hour of audio in just 5 seconds on powerful hardware—boost your audio analysis efficiency.

theme esp8266 minecraft-skin telegram micropython powershell esp32 windows-theme welcome oh pyrogram windows-terminal rapids diarization senko figura pyannote audio-ai

Updated Sep 23, 2025
Python

Improve this page

Add a description, image, and links to the audio-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-ai topic, visit your repo's landing page and select "manage topics."