Stars
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Code and documentation to train Stanford's Alpaca models, and generate the data.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Generative Models by Stability AI
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
ModelScope: bring the notion of Model-as-a-Service to life.
A Collection of Variational Autoencoders (VAE) in PyTorch.
Google AI 2018 BERT pytorch implementation
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Nightly release of ControlNet 1.1
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
A tool for extracting plain text from Wikipedia dumps
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).
Generate text images for training deep learning ocr model
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
PyTorch implementation of Spatial Transformer Network (STN) with Thin Plate Spline (TPS)
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need