-
BSc Artificial Intelligence @ Beijing Institute of Technology -> MSc Computing (AIML) @ Imperial College London
- London
-
13:35
(UTC +01:00) - leo9344.github.io
- https://scholar.google.com/citations?user=vErB2ZAAAAAJ&hl=zh-CN
Highlights
- Pro
Stars
awesome-prompt-for-academic, welcome to contribute
A Survey of Reinforcement Learning for Large Reasoning Models
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
An Awesome List of Agentic Model trained with Reinforcement Learning
This repository serves as a comprehensive knowledge hub, curating cutting-edge research papers and developments across 25+ specialized domains
Awesome Reasoning LLM Tutorial/Survey/Guide
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Official implementation of “MeshHeart: A Geometric Transformer for Conditional 3D+t Cardiac Mesh Generation“ (Nature Machine Intelligence 2025)
GenAI Agent Framework, the Pydantic way
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Awesome Jailbreak, red teaming arxiv papers (Automatically Update Every 12th hours)
[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges
Hierarchical Reasoning Model Official Release
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Python-based LLM query and chat history visualization library.
[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories