End-to-End Speech Processing Toolkit
-
Updated
Feb 5, 2025 - Python
End-to-End Speech Processing Toolkit
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
[ICLR'23 Spotlight & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
A neural network for end-to-end speech denoising
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer
End-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
国内首个占据栅格网络全栈课程《从BEV到Occupancy Network,算法原理与工程实践》,包含端侧部署。Surrounding Semantic Occupancy Perception Course for Autonomous Driving (docs, ppt and source code) 在线课程主页:http://111.229.117.200:8100/ (作者独立搭建)
Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition
End-to-end Generative Optimization for AI Agents
End-to-End Neural Diarization
[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Add a description, image, and links to the end-to-end topic page so that developers can more easily learn about it.
To associate your repository with the end-to-end topic, visit your repo's landing page and select "manage topics."