-
e2m Public
Forked from wisupai/e2mE2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2…
Jupyter Notebook MIT License UpdatedAug 27, 2024 -
awesome-chatgpt Public
Forked from uhub/awesome-chatgptA curated list of awesome ChatGPT related projects.
UpdatedJul 23, 2024 -
wukong-robot Public
Forked from wzpan/wukong-robot🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
Python MIT License UpdatedJul 19, 2024 -
facechain Public
Forked from modelscope/facechainFaceChain is a deep-learning toolchain for generating your Digital-Twin.
Jupyter Notebook Apache License 2.0 UpdatedJul 8, 2024 -
GPT-SoVITS Public
Forked from RVC-Boss/GPT-SoVITS1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
-
Open-Sora Public
Forked from hpcaitech/Open-SoraOpen-Sora: Democratizing Efficient Video Production for All
Python Apache License 2.0 UpdatedMar 18, 2024 -
FunASR Public
Forked from modelscope/FunASRA Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
Python Other UpdatedMar 15, 2024 -
unstructured Public
Forked from Unstructured-IO/unstructuredOpen source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
HTML Apache License 2.0 UpdatedMar 8, 2024 -
3D-Speaker Public
Forked from modelscope/3D-SpeakerA repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
Python Apache License 2.0 UpdatedFeb 28, 2024 -
Awesome-Multimodal-Large-Language-Models Public
Forked from BradyFU/Awesome-Multimodal-Large-Language-Models✨✨Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
UpdatedFeb 27, 2024 -
self-llm Public
Forked from datawhalechina/self-llm《开源大模型食用指南》基于AutoDL快速部署开源大模型,更适合中国宝宝的部署教程
-
OOTDiffusion Public
Forked from levihsu/OOTDiffusionOfficial implementation of OOTDiffusion
Python Other UpdatedFeb 22, 2024 -
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webuiStable Diffusion web UI
Python GNU Affero General Public License v3.0 UpdatedFeb 21, 2024 -
-
PALM-E Public
Forked from kyegomez/PALM-EImplementation of "PaLM-E: An Embodied Multimodal Language Model"
Python Apache License 2.0 UpdatedJan 29, 2024 -
magvit Public
Forked from google-research/magvitOfficial JAX implementation of MAGVIT: Masked Generative Video Transformer
Python Apache License 2.0 UpdatedJan 17, 2024 -
AutoGPT Public
Forked from Significant-Gravitas/AutoGPTAutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
JavaScript MIT License UpdatedJan 10, 2024 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedJan 8, 2024 -
langflow Public
Forked from langflow-ai/langflow⛓️ Langflow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
Python MIT License UpdatedJan 8, 2024 -
audioldm_eval Public
Forked from haoheliu/audioldm_evalThis toolbox aims to unify audio generation model evaluation for easier comparison.
Python MIT License UpdatedJan 7, 2024 -
-
Qwen Public
Forked from QwenLM/QwenThe official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Python Apache License 2.0 UpdatedDec 6, 2023 -
AutoGPTQ Public
Forked from AutoGPTQ/AutoGPTQAn easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Python MIT License UpdatedDec 5, 2023 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python Other UpdatedDec 2, 2023 -
seamless_communication Public
Forked from facebookresearch/seamless_communicationFoundational Models for State-of-the-Art Speech and Text Translation
C Other UpdatedNov 30, 2023 -
Video-LLaVA Public
Forked from PKU-YuanGroup/Video-LLaVAVideo-LLaVA: Learning United Visual Representation by Alignment Before Projection
Python Apache License 2.0 UpdatedNov 30, 2023 -
Amphion Public
Forked from open-mmlab/AmphionAmphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
UpdatedNov 28, 2023 -
safe-rlhf Public
Forked from PKU-Alignment/safe-rlhfSafe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Python Apache License 2.0 UpdatedNov 24, 2023 -
Qwen-Audio Public
Forked from QwenLM/Qwen-AudioThe official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Other UpdatedNov 16, 2023 -
Bert-VITS2 Public
Forked from fishaudio/Bert-VITS2vits2 backbone with bert
Python GNU Affero General Public License v3.0 UpdatedNov 9, 2023