- Shenzhen,China
-
07:59
(UTC +08:00)
🤖 AI
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Generative Agents: Interactive Simulacra of Human Behavior
🔉 Youtube Videos Transcription with OpenAI's Whisper
TypeChat is a library that makes it easy to build natural language interfaces using types.
An attempt to build a working, locally-running cheap version of Generative Agents: Interactive Simulacra of Human Behavior
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggest…
Whisper based Japanese subtitle generator
ImageBind One Embedding Space to Bind Them All
Overview and tutorial of the LangChain Library
Generate 3D objects conditioned on text or images
Code for the paper "Jukebox: A Generative Model for Music"
Instruct-tune LLaMA on consumer hardware
A natural language interface for computers
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Stable Diffusion web UI
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
Bark Voice Cloning and Voice Cloning for Chinese Speech
IDE style command line auto complete
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Foundational Models for State-of-the-Art Speech and Text Translation
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
The code for the bark-voicecloning model. Training and inference.
PyTorch入门教程,在线阅读地址:https://datawhalechina.github.io/thorough-pytorch/
A simple and open-source analogue of the HeyGen system
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Industry leading face manipulation platform