shiyuzh2007

shiyuzh2007

8 followers · 3 following

syzhou.github.io

Achievements

e2m Public
Forked from wisupai/e2m

E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…

Jupyter Notebook MIT License Updated Aug 27, 2024
awesome-chatgpt Public
Forked from uhub/awesome-chatgpt

A curated list of awesome ChatGPT related projects.

Updated Jul 23, 2024
wukong-robot Public
Forked from wzpan/wukong-robot

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

Python MIT License Updated Jul 19, 2024
facechain Public
Forked from modelscope/facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook Apache License 2.0 Updated Jul 8, 2024
GPT-SoVITS Public
Forked from RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 1 MIT License Updated Mar 19, 2024
Open-Sora Public
Forked from hpcaitech/Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python Apache License 2.0 Updated Mar 18, 2024
FunASR Public
Forked from modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

Python Other Updated Mar 15, 2024
unstructured Public
Forked from Unstructured-IO/unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML Apache License 2.0 Updated Mar 8, 2024
3D-Speaker Public
Forked from modelscope/3D-Speaker

A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.

Python Apache License 2.0 Updated Feb 28, 2024
Awesome-Multimodal-Large-Language-Models Public
Forked from BradyFU/Awesome-Multimodal-Large-Language-Models

✨✨Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Updated Feb 27, 2024
self-llm Public
Forked from datawhalechina/self-llm

《开源大模型食用指南》基于AutoDL快速部署开源大模型，更适合中国宝宝的部署教程

Jupyter Notebook 1 Apache License 2.0 Updated Feb 26, 2024
OOTDiffusion Public
Forked from levihsu/OOTDiffusion

Official implementation of OOTDiffusion

Python Other Updated Feb 22, 2024
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

Python GNU Affero General Public License v3.0 Updated Feb 21, 2024
Make-An-Audio Public
Forked from Text-to-Audio/Make-An-Audio

Python Updated Feb 7, 2024
PALM-E Public
Forked from kyegomez/PALM-E

Implementation of "PaLM-E: An Embodied Multimodal Language Model"

Python Apache License 2.0 Updated Jan 29, 2024
magvit Public
Forked from google-research/magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python Apache License 2.0 Updated Jan 17, 2024
AutoGPT Public
Forked from Significant-Gravitas/AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

JavaScript MIT License Updated Jan 10, 2024
langchain Public
Forked from langchain-ai/langchain

⚡ Building applications with LLMs through composability ⚡

Python MIT License Updated Jan 8, 2024
langflow Public
Forked from langflow-ai/langflow

⛓️ Langflow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.

Python MIT License Updated Jan 8, 2024
audioldm_eval Public
Forked from haoheliu/audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python MIT License Updated Jan 7, 2024
i-Code Public
Forked from microsoft/i-Code

Jupyter Notebook MIT License Updated Dec 20, 2023
Qwen Public
Forked from QwenLM/Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python Apache License 2.0 Updated Dec 6, 2023
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python MIT License Updated Dec 5, 2023
llama Public
Forked from meta-llama/llama

Inference code for LLaMA models

Python Other Updated Dec 2, 2023
seamless_communication Public
Forked from facebookresearch/seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

C Other Updated Nov 30, 2023
Video-LLaVA Public
Forked from PKU-YuanGroup/Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python Apache License 2.0 Updated Nov 30, 2023
Amphion Public
Forked from open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Updated Nov 28, 2023
safe-rlhf Public
Forked from PKU-Alignment/safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python Apache License 2.0 Updated Nov 24, 2023
Qwen-Audio Public
Forked from QwenLM/Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Other Updated Nov 16, 2023
Bert-VITS2 Public
Forked from fishaudio/Bert-VITS2

vits2 backbone with bert

Python GNU Affero General Public License v3.0 Updated Nov 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shiyuzh2007

Achievements

Achievements

Block or report shiyuzh2007

e2m Public

awesome-chatgpt Public

wukong-robot Public

facechain Public

GPT-SoVITS Public

Open-Sora Public

FunASR Public

unstructured Public

3D-Speaker Public

Awesome-Multimodal-Large-Language-Models Public

self-llm Public

OOTDiffusion Public

stable-diffusion-webui Public

Make-An-Audio Public

PALM-E Public

magvit Public

AutoGPT Public

langchain Public

langflow Public

audioldm_eval Public

i-Code Public

Qwen Public

AutoGPTQ Public

llama Public

seamless_communication Public

Video-LLaVA Public

Amphion Public

safe-rlhf Public

Qwen-Audio Public

Bert-VITS2 Public