GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai
-
Updated
Aug 1, 2025 - Python
GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
Latest Advances on Long Chain-of-Thought Reasoning
The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"
[Up-to-date] Awesome RAG Reasoning Resources
Deep Reasoning Translation (DRT) Project
ToolUniverse is a collection of biomedical tools designed for AI agents
a-m-team's exploration in large language modeling
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
Pivotal Token Search
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
Designing Multi-Agent Systems with Zero Supervision
Official Implementation of "Reasoning Language Models: A Blueprint"
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale
This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
Add a description, image, and links to the reasoning-language-models topic page so that developers can more easily learn about it.
To associate your repository with the reasoning-language-models topic, visit your repo's landing page and select "manage topics."