warisqr007

Waris Quamer warisqr007

10 followers · 0 following

College Station, Texas
https://www.linkedin.com/in/warisquamer

Achievements

Highlights

DST Public
Forked from ictnlp/DST

DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently

Python MIT License Updated Jan 26, 2025
speech-trident Public
Forked from ga642381/speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

Updated Nov 27, 2024
warisqr007.github.io Public

Waris Quamer's personal website.

HTML Updated Nov 9, 2024
moshi Public
Forked from kyutai-labs/moshi

Python Apache License 2.0 Updated Oct 11, 2024
itsp Public
Forked from Speech-Interaction-Technology-Aalto-U/itsp

Introduction to Speech Processing

Jupyter Notebook Creative Commons Attribution Share Alike 4.0 International Updated Sep 13, 2024
SummaryMixing Public
Forked from SamsungLabs/SummaryMixing

This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recognition (see: https://arxiv.org/abs/2307.07421). The code is read…

Python Other Updated Aug 30, 2024
NaturalVoices Public
Forked from 3loi/NaturalVoices

Jupyter Notebook MIT License Updated Aug 27, 2024
alloy-voice-assistant Public
Forked from svpino/alloy-voice-assistant

Python Updated Jun 14, 2024
audiomentations Public
Forked from iver56/audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python MIT License Updated Jun 10, 2024
ppg2ppg Public

Zero-Shot Foreign Accent Conversion without a Native Reference

speech-synthesis voice-conversion accent-conversion phonetic-posteriorgram

Python 28 6 Apache License 2.0 Updated May 1, 2024
Sense_glucose Public
Forked from kathanvyas/Zephyr-BioHArness-Data_Preprocess

The repository contains code for the Sense Project. Contains code from reading teh ECG-Sumamry file from Zephyr folder and then processing it with reading glucose file. the code contains all necess…

Python Updated Jan 12, 2024
DNS-Challenge Public
Forked from microsoft/DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python Creative Commons Attribution 4.0 International Updated Dec 13, 2023
LLVC Public
Forked from KoeAI/LLVC

Python MIT License Updated Nov 6, 2023
vq-ppg-vc Public

Vector Quantized PPGs based Voice conversion

transformer prosody voice-conversion acoustic-model speaker-embedding phonetic-posteriorgram prosody-transfer

Jupyter Notebook 7 1 Apache License 2.0 Updated Aug 27, 2023
vq-bnf-translator Public

Pronunciation correction in vector quantized PPG representation space

voice-conversion accent-conversion espnetv2 transformer-pytorch phonetic-posteriorgram

Python 7 1 Apache License 2.0 Updated May 8, 2023
primehilltop Public
Forked from creativetimofficial/paper-kit-react

Landing page for primehilltop

SCSS MIT License Updated May 5, 2023
warisqr007 Public

Config files for my GitHub profile.

config github-config

Updated Apr 28, 2023
vq-bnf Public

Vector Quantizing speech representations

speech-synthesis speech-recognition voice-conversion speech-processing vector-quantization

Python 4 1 Apache License 2.0 Updated Apr 6, 2023
mellotron Public
Forked from NVIDIA/mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook BSD 3-Clause "New" or "Revised" License Updated Mar 24, 2023
ConvSANN Public

Implementation of Self-Attentive Convolulation Neural Network for text semantic similarity task.

Python MIT License Updated Mar 24, 2023
StyleFlow Public
Forked from RameenAbdal/StyleFlow

StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows (ACM TOG 2021)

Python Updated Mar 24, 2023
watson-stt-wer-python Public
Forked from IBM/watson-stt-wer-python

Utilities for transcribing a set of audio files with IBM Watson Speech to Text (STT), then analyzing the error rate of the STT transcription against a known-good transcription

Python Apache License 2.0 Updated Feb 28, 2023
vc-vq-prosody Public

Jupyter Notebook 1 Apache License 2.0 Updated Feb 11, 2023
transformer-prosody-vc Public

Jupyter Notebook 1 Apache License 2.0 Updated Jan 27, 2023
FastSpeech2 Public
Forked from ming024/FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python MIT License Updated Jan 10, 2023
silero-models Public
Forked from snakers4/silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook Other Updated Dec 26, 2022
Conditional-Normalizing-Flow Public
Forked from 5yearsKim/Conditional-Normalizing-Flow

Conditional Generative model (Normalizing Flow) and experimenting style transfer using this model

Python Updated Nov 23, 2022
vc-spk-loss Public

Voice Conversion with additional speaker loss

Jupyter Notebook Apache License 2.0 Updated Nov 21, 2022
accent-conversion Public

Updated Nov 2, 2022
SRD-VC Public
Forked from YoungSeng/SRD-VC

Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

Python Updated Nov 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Waris Quamer warisqr007

Achievements

Achievements

Highlights

Block or report warisqr007

DST Public

speech-trident Public

warisqr007.github.io Public

moshi Public

itsp Public

SummaryMixing Public

NaturalVoices Public

alloy-voice-assistant Public

audiomentations Public

ppg2ppg Public

Sense_glucose Public

DNS-Challenge Public

LLVC Public

vq-ppg-vc Public

vq-bnf-translator Public

primehilltop Public

warisqr007 Public

vq-bnf Public

mellotron Public

ConvSANN Public

StyleFlow Public

watson-stt-wer-python Public

vc-vq-prosody Public

transformer-prosody-vc Public

FastSpeech2 Public

silero-models Public

Conditional-Normalizing-Flow Public

vc-spk-loss Public

accent-conversion Public

SRD-VC Public