JiJiJiang

Hongji Wang JiJiJiang

Speech Algorithm Engineer

52 followers · 35 following

Tencent Meeting, Tencent
Shenzhen, China

Achievements

Stars

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

795 48 Updated Dec 21, 2024

kamilakesbi / DiarizersLM

Python 9 2 Updated Jul 16, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,662 4,971 Updated Dec 27, 2024

nttcslab-sp / mamba-diarization

Official repository for Mamba-based Segmentation Model for Speaker Diarization

Python 27 3 Updated Oct 10, 2024

gengxuelong / wenet_LLM_from_ASLP

wenet_LLM_from_ASLP

Python 4 Updated Nov 26, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,652 3,157 Updated Aug 12, 2024

meta-llama / llama

Inference code for Llama models

Python 56,975 9,632 Updated Aug 18, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,688 185 Updated Nov 14, 2024

hacksider / Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Python 41,895 6,152 Updated Dec 26, 2024

VITA-MLLM / VITA

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Python 1,079 64 Updated Dec 27, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,802 1,664 Updated Dec 19, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,010 697 Updated Dec 17, 2024

declare-lab / MELD

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

Python 844 207 Updated Mar 10, 2024

kyutai-labs / moshi

Python 7,051 550 Updated Dec 20, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,227 290 Updated Nov 5, 2024