-
HOMI POP
- Sydney
-
18:25
(UTC +11:00) - blog.369500111.xyz
Audio
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Instant voice cloning by MIT and MyShell. Audio foundation model.
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
🎧 Open source Spotify client that doesn't require Premium nor uses Electron! Available for both desktop & mobile!
A generative speech model for daily dialogue.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
✨ AsrTools: Smart Voice-to-Text Tool | Efficient Batch Processing | User-Friendly Interface | No GPU Required | Supports SRT/TXT Output | Turn your audio into accurate text in an instant!
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Docker for multiple TTS Engines with a GRadio interface
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Offline Text To Speech synthesis for python
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型