reasoning-models

Here are 30 public repositories matching this topic...

zilliztech / deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

agent openai grok claude rag milvus vector-database llm zilliz deepseek agentic-rag grok3 reasoning-models deepseek-r1 deep-research qwen3 llama4

Updated Nov 19, 2025
Python

MiniMax-AI / MiniMax-M1

Star

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

large-language-models llm reasoning-models minimax-m1

Updated Jul 7, 2025
Python

Zefan-Cai / R-KV

Star

[Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

llm kvcache reasoning-models

Updated Oct 16, 2025
Python

HKUDS / LightReasoner

Star

"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"

post-training large-language-models reasoning-models token-efficiency

Updated Nov 1, 2025
Python

WeiboAI / VibeThinker

Star

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

ai transformer language-model huggingface llm sllm reasoning-language-models reasoning-models livecodebench aime2025

Updated Nov 19, 2025
Python

eric-ai-lab / Soft-Thinking

Star

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

soft-reasoning chain-of-thought-reasoning reasoning-models soft-thinking continous-space-reasoning soft-token concept-token

Updated Nov 14, 2025
Python

UCSC-VLAA / MedReason

Star

MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

reasoning medical-dataset medical-large-language-models reasoning-models

Updated Jun 19, 2025
Python

hao-ai-lab / Dynasor

Star

[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.

llm reasoning-models deepseek-r1

Updated May 31, 2025
Python

Alpha-Innovator / OmniCaptioner

Star

Official Repository of OmniCaptioner

multi-modal captioning-images caption-generation vlms reasoning-models deepseek-r1 multi-modal-deepseek-r1

Updated Apr 23, 2025
Python

codelion / pts

Sponsor

Star

Pivotal Token Search

Updated Jul 15, 2025
Python

OpenSPG / KAG-Thinker

Star

An interactive thinking and deep reasoning model. It provides a cognitive reasoning paradigm for complex multi-hop problems.

kag deepthinking deepsearchalgorithmus reasoning-models deepresearch

Updated Nov 14, 2025
Python

fscdc / ReasonMap

Star

[arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

reasoning multimodal-large-language-models reasoning-models efficient-reasoning

Updated Nov 8, 2025
Python

czg1225 / VeriThinker

Star

[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient

efficiency fine-tuning large-language-models reasoning-models deepseek-r1-distill-llama deepseek-r1-distill-qwen

Updated Sep 27, 2025
Python

DolbyUUU / Logic-RL-Lite

Star

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".

reinforcement-learning fine-tuning post-training llm deepseek gpt-o1 reasoning-language-models reasoning-models deepseek-r1

Updated Apr 1, 2025
Python

DolbyUUU / DeepEnlighten

Star

Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.

reinforcement-learning fine-tuning post-training llm deepseek gpt-o1 reasoning-language-models reasoning-models deepseek-r1

Updated Mar 16, 2025
Python

UKPLab / acl2025-diverse-cot

Star

Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"

cot lrm chain-of-thought large-reasoning-models reasoning-models

Updated Jun 25, 2025
Python

AbhaySingh71 / AI-Lawyer-RAG-with-Deepseek

Star

AI Lawyer is an intelligent reasoning legal assistant powered by DeepSeek , Ollama RAG and LangChain, designed to streamline legal research and document analysis. By leveraging retrieval-augmented generation (RAG), it provides precise legal insights, and contract summarization. With an intuitive Streamlit-based UI, analyze legal documents.