end-to-end

Here are 137 public repositories matching this topic...

espnet / espnet

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Feb 5, 2025
Python

zzw922cn / Automatic_Speech_Recognition

Star

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

audio deep-learning tensorflow paper end-to-end evaluation cnn lstm speech-recognition rnn automatic-speech-recognition feature-vector data-preprocessing phonemes timit-dataset layer-normalization rnn-encoder-decoder chinese-speech-recognition

Updated Mar 24, 2023
Python

r9y9 / deepvoice3_pytorch

Sponsor

Star

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

python machine-learning end-to-end pytorch tts speech-synthesis speech-processing multi-speaker

Updated Dec 19, 2023
Python

hustvl / MapTR

Star

[ICLR'23 Spotlight & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

real-time end-to-end transformer autonomous-driving bev online-hdmap-construction vectorized-hdmap shape-representation iclr2023

Updated Oct 28, 2024
Python

freewym / espresso

Star

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

python end-to-end pytorch speech-recognition kaldi asr fairseq

Updated Sep 4, 2024
Python

hustvl / VAD

Star

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

end-to-end autonomous-driving

Updated Oct 31, 2024
Python

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

machine-learning deep-learning end-to-end recommendation-system gpu-acceleration recommender-system

Updated Dec 5, 2024
Python

kaituoxu / Speech-Transformer

Star

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

end-to-end pytorch transformer attention asr attention-is-all-you-need self-attention

Updated Apr 6, 2023
Python

openspeech-team / openspeech

Star

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

recognition open end-to-end speech speech-recognition e2e asr

Updated Oct 23, 2023
Python

drethage / speech-denoising-wavenet

Star

A neural network for end-to-end speech denoising

machine-learning deep-learning end-to-end speech neural-networks wavenet speech-processing speech-denoising

Updated Jul 6, 2023
Python

Henry-23 / VideoChat

Star

实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.

streaming real-time end-to-end tts lip-sync dialogue-systems asr talking-head digital-human multimodal-large-language-models musetalk gradio-python-app

Updated Nov 15, 2024
Python

megvii-research / MOTR

Star

[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer

end-to-end pytorch transformer multi-object-tracking

Updated Jan 15, 2024
Python

wvangansbeke / LaneDetection_End2End

Star

End-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)

computer-vision deep-learning end-to-end pytorch least-squares lane-detection self-driving-cars

Updated May 14, 2020
Python

sooftware / kospeech

Sponsor

Star

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

end-to-end pytorch transformer speech-recognition las seq2seq jasper asr conformer attention-is-all-you-need korean-speech e2e-asr las-models ksponspeech

Updated May 27, 2023
Python

Charmve / OccNet-Course

Sponsor

Star

国内首个占据栅格网络全栈课程《从BEV到Occupancy Network，算法原理与工程实践》，包含端侧部署。Surrounding Semantic Occupancy Perception Course for Autonomous Driving (docs, ppt and source code) 在线课程主页：http://111.229.117.200:8100/ (作者独立搭建)

end-to-end tesla self-driving-car autonomous-driving autonomous-vehicles bev occupancy occupancy-prediction