Skip to content
View leo9344's full-sized avatar
🎯
🎯

Highlights

  • Pro

Block or report leo9344

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 48 1 Updated Jul 10, 2025
Jupyter Notebook 45 2 Updated Sep 11, 2025

awesome-prompt-for-academic, welcome to contribute

Shell 58 2 Updated Aug 25, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

1,093 66 Updated Sep 12, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 64,283 6,691 Updated Sep 15, 2025

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 1,454 165 Updated Sep 15, 2025

An Awesome List of Agentic Model trained with Reinforcement Learning

466 13 Updated Sep 11, 2025

HydraProbe: The LLM Vulnerability Hunter

Python 3 Updated Aug 27, 2025

This repository serves as a comprehensive knowledge hub, curating cutting-edge research papers and developments across 25+ specialized domains

60 2 Updated Sep 14, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 2,060 142 Updated Jul 11, 2025

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

Python 4,120 939 Updated Mar 8, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 18,048 2,093 Updated Sep 6, 2025

Official implementation of “MeshHeart: A Geometric Transformer for Conditional 3D+t Cardiac Mesh Generation“ (Nature Machine Intelligence 2025)

Python 15 4 Updated Jun 19, 2025
Python 1 Updated Jun 19, 2025

✨ Agentic Reinforced Policy Optimization

Python 590 27 Updated Sep 7, 2025

GenAI Agent Framework, the Pydantic way

Python 12,464 1,229 Updated Sep 12, 2025
Python 54 2 Updated Aug 28, 2025
6 Updated Aug 14, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 11,791 1,130 Updated Jul 29, 2025

Awesome Jailbreak, red teaming arxiv papers (Automatically Update Every 12th hours)

Python 62 8 Updated Sep 15, 2025

[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges

1,663 48 Updated Sep 2, 2025

[ICML 2025] Official Implementation of GLIDER

Python 54 5 Updated May 27, 2025

Hierarchical Reasoning Model Official Release

Python 10,571 1,555 Updated Sep 9, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,925 769 Updated Sep 15, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,397 1,739 Updated Sep 11, 2025

Python-based LLM query and chat history visualization library.

Python 1 Updated Jul 14, 2025

[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Python 70 1 Updated Aug 8, 2025
Next