Starred repositories
🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 3 hours!
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
A generative speech model for daily dialogue.
Effortless data labeling with AI support from Segment Anything and other awesome models.
yolov8 hub,cpp with onnxruntime and opencv
✨✨Latest Advances on Multimodal Large Language Models
Making large AI models cheaper, faster and more accessible
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Character Animation (AnimateAnyone, Face Reenactment)
torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
YoloV8 for a bare Raspberry Pi 4 or 5
NumPy aware dynamic Python compiler using LLVM
Real-time face swap for PC streaming or video calls
Python library for working with HEIF images and plugin for Pillow.
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
The goal of CLAIMED is to enable low-code/no-code rapid prototyping style programming to seamlessly CI/CD into production.
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
Code release for ActionFormer (ECCV 2022)
We write your reusable computer vision tools. 💜