Skip to content
View shiyuzh2007's full-sized avatar

Block or report shiyuzh2007

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • e2m Public

    Forked from wisupai/e2m

    E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…

    Jupyter Notebook MIT License Updated Aug 27, 2024
  • A curated list of awesome ChatGPT related projects.

    Updated Jul 23, 2024
  • wukong-robot Public

    Forked from wzpan/wukong-robot

    🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。

    Python MIT License Updated Jul 19, 2024
  • facechain Public

    Forked from modelscope/facechain

    FaceChain is a deep-learning toolchain for generating your Digital-Twin.

    Jupyter Notebook Apache License 2.0 Updated Jul 8, 2024
  • GPT-SoVITS Public

    Forked from RVC-Boss/GPT-SoVITS

    1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

    Python 1 MIT License Updated Mar 19, 2024
  • Open-Sora Public

    Forked from hpcaitech/Open-Sora

    Open-Sora: Democratizing Efficient Video Production for All

    Python Apache License 2.0 Updated Mar 18, 2024
  • FunASR Public

    Forked from modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

    Python Other Updated Mar 15, 2024
  • Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

    HTML Apache License 2.0 Updated Mar 8, 2024
  • A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.

    Python Apache License 2.0 Updated Feb 28, 2024
  • ✨✨Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

    Updated Feb 27, 2024
  • 《开源大模型食用指南》基于AutoDL快速部署开源大模型,更适合中国宝宝的部署教程

    Jupyter Notebook 1 Apache License 2.0 Updated Feb 26, 2024
  • Official implementation of OOTDiffusion

    Python Other Updated Feb 22, 2024
  • Stable Diffusion web UI

    Python GNU Affero General Public License v3.0 Updated Feb 21, 2024
  • Python Updated Feb 7, 2024
  • PALM-E Public

    Forked from kyegomez/PALM-E

    Implementation of "PaLM-E: An Embodied Multimodal Language Model"

    Python Apache License 2.0 Updated Jan 29, 2024
  • magvit Public

    Forked from google-research/magvit

    Official JAX implementation of MAGVIT: Masked Generative Video Transformer

    Python Apache License 2.0 Updated Jan 17, 2024
  • AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

    JavaScript MIT License Updated Jan 10, 2024
  • ⚡ Building applications with LLMs through composability ⚡

    Python MIT License Updated Jan 8, 2024
  • langflow Public

    Forked from langflow-ai/langflow

    ⛓️ Langflow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.

    Python MIT License Updated Jan 8, 2024
  • This toolbox aims to unify audio generation model evaluation for easier comparison.

    Python MIT License Updated Jan 7, 2024
  • i-Code Public

    Forked from microsoft/i-Code
    Jupyter Notebook MIT License Updated Dec 20, 2023
  • Qwen Public

    Forked from QwenLM/Qwen

    The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

    Python Apache License 2.0 Updated Dec 6, 2023
  • AutoGPTQ Public

    Forked from AutoGPTQ/AutoGPTQ

    An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

    Python MIT License Updated Dec 5, 2023
  • llama Public

    Forked from meta-llama/llama

    Inference code for LLaMA models

    Python Other Updated Dec 2, 2023
  • Foundational Models for State-of-the-Art Speech and Text Translation

    C Other Updated Nov 30, 2023
  • Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

    Python Apache License 2.0 Updated Nov 30, 2023
  • Amphion Public

    Forked from open-mmlab/Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

    Updated Nov 28, 2023
  • Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

    Python Apache License 2.0 Updated Nov 24, 2023
  • Qwen-Audio Public

    Forked from QwenLM/Qwen-Audio

    The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

    Other Updated Nov 16, 2023
  • Bert-VITS2 Public

    Forked from fishaudio/Bert-VITS2

    vits2 backbone with bert

    Python GNU Affero General Public License v3.0 Updated Nov 9, 2023