Skip to content
@FunAudioLLM

FunAudioLLM

Popular repositories Loading

  1. CosyVoice CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    Python 19.3k 2.2k

  2. SenseVoice SenseVoice Public

    Multilingual Voice Understanding Model

    Python 7.4k 690

  3. FunMusic FunMusic Public

    A fundamental toolkit designed for music, song, and audio generation

    Python 1.3k 131

  4. ThinkSound ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    Python 1.1k 67

  5. Fun-ASR Fun-ASR Public

    Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

    Python 801 61

  6. Fun-Audio-Chat Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    Python 712 70

Repositories

Showing 10 of 12 repositories
  • FunResearch Public

    This repository is maintained by the Speech Team at Alibaba’s Tongyi Lab, serving as an open-source platform for our cutting-edge research in speech, audio, NLP technologies. We believe in accelerating scientific progress through transparent collaboration, and invite the global research community to explore, reproduce, and build upon our work.

    FunAudioLLM/FunResearch’s past year of commit activity
    Python 16 Apache-2.0 1 0 0 Updated Jan 26, 2026
  • Fun-ASR Public

    Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

    FunAudioLLM/Fun-ASR’s past year of commit activity
    Python 801 Apache-2.0 61 42 0 Updated Jan 26, 2026
  • Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    FunAudioLLM/Fun-Audio-Chat’s past year of commit activity
    Python 712 Apache-2.0 70 9 2 Updated Jan 22, 2026
  • FunAudioLLM/FunAudioLLM.github.io’s past year of commit activity
    HTML 56 MIT 10 0 1 Updated Jan 21, 2026
  • CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    FunAudioLLM/CosyVoice’s past year of commit activity
    Python 19,334 Apache-2.0 2,168 858 16 Updated Jan 19, 2026
  • MME-Emotion Public

    Official repository for the paper “MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language Models”

    FunAudioLLM/MME-Emotion’s past year of commit activity
    Python 19 MIT 2 1 0 Updated Jan 17, 2026
  • SenseVoice Public

    Multilingual Voice Understanding Model

    FunAudioLLM/SenseVoice’s past year of commit activity
    Python 7,429 690 169 4 Updated Dec 30, 2025
  • ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    FunAudioLLM/ThinkSound’s past year of commit activity
    Python 1,142 67 32 1 Updated Nov 25, 2025
  • CV3-Eval Public
    FunAudioLLM/CV3-Eval’s past year of commit activity
    Python 171 Apache-2.0 14 7 0 Updated Aug 25, 2025
  • OmniAudio Public
    FunAudioLLM/OmniAudio’s past year of commit activity
    Python 8 3 0 0 Updated May 21, 2025

Top languages

Loading…