Stars
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
get-convex / ai-town
Forked from a16z-infra/ai-townA MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
Chinese version implementation of Generative Agents: Interactive Simulacra of Human Behavior
Generative Agents: Interactive Simulacra of Human Behavior
A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.
Learning-based agent for Google Research Football (足球游戏智能体)
[Neurips 2023] Generating Mario Levels with GPT2. Code for the paper "MarioGPT: Open-Ended Text2Level Generation through Large Language Models" https://arxiv.org/abs/2302.05981
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
Python Fan calculator for Chinese Standard Mahjong
Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game states
Simple and easily configurable 3D FPS-game-like environments for reinforcement learning
Java and Python protobuf rpc implementation using tcp/ip sockets.
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Massively Parallel Deep Reinforcement Learning. 🔥
Chinese Transformer Generative Pre-Training Model
Chinese version of GPT2 training code, using BERT tokenizer.
Proximal Policy Optimization with Tensorflow 2.0