Skip to content
View gcambara's full-sized avatar

Block or report gcambara

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

torchview: visualize pytorch models

Python 908 43 Updated Apr 23, 2025

Collection of audio-focused loss functions in PyTorch

Python 773 72 Updated Jul 30, 2024

speacher is a speech teacher framework that enables research in curriculum learning for speech recognition and quality assessment for speech synthesis models.

Python 7 Updated Apr 2, 2022

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,032 1,958 Updated Nov 19, 2024

PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"

Python 15 3 Updated Jul 23, 2021

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Python 1,137 136 Updated Aug 22, 2023

Public domain corpus of Catalan text

16 2 Updated Dec 20, 2021

Pytorch!!!Pytorch!!!Pytorch!!! Dynamic Convolution: Attention over Convolution Kernels (CVPR-2020)

Python 574 90 Updated May 22, 2022

A library for efficient similarity search and clustering of dense vectors.

C++ 34,504 3,845 Updated Apr 23, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,168 257 Updated Apr 14, 2025

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Python 40 3 Updated Dec 28, 2022

"Deep Generative Modeling": Introductory Examples

Jupyter Notebook 1,169 186 Updated Sep 22, 2024

😎 A curated list of awesome GitHub Profile which updates in real time

26,333 3,946 Updated Aug 19, 2024

A PyTorch-based Speech Toolkit

Python 9,730 1,478 Updated Apr 16, 2025

Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)

291 28 Updated Nov 24, 2023

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 1 Updated May 6, 2018

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Python 1 Updated Jan 4, 2019

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Python 1 Updated Jan 30, 2019

A pytorch implementation of a Text-to-Speech system based on NVIDIA's Tacotron2 text2mel plus a neural vocoder

Jupyter Notebook 2 Updated Dec 17, 2020

This is our work on learning speaking style in speech synthesis but only using the pitch frequency sub-band as a speaker reference. We trained a modified version of the NVIDIA's Tacotron2 model but…

Python 2 Updated May 21, 2021

A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. Instead of using the whole mel-scale spectrogram representation in the GST input, we extracted and used only the pitch…

Python 7 4 Updated Jul 6, 2023

A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJSpeech Dataset.

Python 9 5 Updated Sep 4, 2023

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,377 499 Updated Mar 11, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 143,398 28,740 Updated Apr 24, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,351 3,482 Updated Apr 23, 2025