Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Models and examples built with TensorFlow
Robust Speech Recognition via Large-Scale Weak Supervision
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A high-throughput and memory-efficient inference and serving engine for LLMs
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Official Code for DragGAN (SIGGRAPH 2023)
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Instant voice cloning by MIT and MyShell. Audio foundation model.
OpenMMLab Detection Toolbox and Benchmark
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
We write your reusable computer vision tools. 💜
DSPy: The framework for programming—not prompting—language models
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Universal LLM Deployment Engine with ML Compilation
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Agno is a lightweight framework for building multi-modal Agents
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone