Skip to content
View AIDefender's full-sized avatar
  • Nanyang Technological University, Singapore
  • Singapore

Highlights

  • Pro

Block or report AIDefender

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 8,236 800 Updated Jan 31, 2025

Memory-Guided Diffusion for Expressive Talking Video Generation

Python 694 72 Updated Jan 24, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,167 378 Updated Jan 27, 2025

Recipes to train reward model for RLHF.

Python 1,125 79 Updated Jan 22, 2025

A series of math-specific large language models of our Qwen2 series.

Python 738 83 Updated Jan 11, 2025

Python wrapper and simple addons for sioyek PDF viewer

Python 180 8 Updated Jun 6, 2023

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,371 134 Updated Jan 31, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,845 111 Updated Jun 1, 2023

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,161 403 Updated Jan 30, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 45,867 5,475 Updated Dec 18, 2024

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

Python 73 4 Updated Oct 16, 2024
Swift 775 38 Updated Jan 14, 2025

Really Fast End-to-End Jax RL Implementations

Python 801 68 Updated Sep 9, 2024

Assetto Corsa OpenAI Gym Environment

Python 97 6 Updated Dec 4, 2024

Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlight)

Python 28 6 Updated Jan 16, 2025

Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)

Python 157 17 Updated Oct 15, 2023

Implementation of Robust Imitation Learning against Variations in Environment Dynamics

Python 83 2 Updated Jan 30, 2023

Use ChatGPT for academic writing

597 73 Updated Nov 14, 2024

This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.

3,546 314 Updated Jan 25, 2024

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 1,976 173 Updated Nov 7, 2024

Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy

Python 20 6 Updated Jun 1, 2022

[ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents

Python 191 17 Updated Oct 23, 2024

[NeurIPS 2023] The official code for paper "State Regularized Policy Optimization on Data with Dynamics Shift"

Python 4 1 Updated Dec 13, 2023

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,214 44 Updated Dec 11, 2024

RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code

Python 666 187 Updated May 12, 2024

Based on the learnware paradigm, the learnware package supports the entire process including the submission, usability testing, organization, identification, deployment, and reuse of learnwares. Si…

Python 97 2 Updated Dec 20, 2024

✯ 可直连访问的电视/广播图标库与相关工具项目 ✯ 🔕 永久免费 直连访问 完整开源 不断完善的台标 支持IPv4/IPv6双栈访问 🔕

JavaScript 24,531 3,749 Updated Jan 31, 2025

[IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents

39 Updated Oct 30, 2024

A natural language interface for computers

Python 58,075 4,981 Updated Jan 24, 2025
Next