math-reasoning

Star

Here are 16 public repositories matching this topic...

YutingLi0606 / Vision-Matters

Star

(ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning

mllm mllm-reasoning math-reasoning

Updated Sep 30, 2025
Python

lupantech / ineqmath

Star

Solving Inequality Proofs with Large Language Models.

theorem-proving inequality olympiad llms llm-as-a-judge math-reasoning

Updated Dec 15, 2025
Python

InternLM / Spark

Star

An official implementation of "SPARK: Synergistic Policy And Reward Co-Evolving Framework"

self-improvement multi-modal large-language-models vision-language-model reward-model large-vision-language-models self-rewarding math-reasoning

Updated Oct 23, 2025
Python

goblinasaddy / nanoJEPA

Star

A minimal JEPA-based language model demonstrating latent-space reasoning on GSM8K using a single decoder-only Transformer.

deep-learning pytorch transformer research-project representation-learning language-model latent-space gsm8k jepa math-reasoning

Updated Feb 28, 2026
Python

Seanaaa0 / QT-R1

Star

STaR × S1 math pipeline on Qwen2.5-1.5B. LoRA, strict Final: format, ~20–30% acc (OpenR1-Math split).

transformers star dataset-pipeline qlora peft-fine-tuning-llm qwen2-5 math-reasoning openr1-math

Updated Sep 6, 2025
Python

wsdjzlh / math-process-supervision-qwen

Star

A controlled LoRA finetuning study on process supervision for mathematical reasoning with Qwen2.5-Math-7B-Instruct.

lora process-supervision llm gsm8k qwen math-reasoning

Updated Apr 23, 2026
Python

mianhua157 / math-data-cleaning-qwen

Star

Data cleaning and structuring pipeline for math reasoning tasks using Qwen3-0.6B for LLM post-training.

nlp machine-learning transformers pytorch data-processing data-cleaning post-training llm qwen math-reasoning

Updated Apr 13, 2026
Python

fikreab-s / aimo3-math-reasoning-pipeline

Star

Tool-Integrated Reasoning for competition math — weighted voting, difficulty-aware allocation

kaggle llm math-reasoning tool-integrated-reasoning

Updated May 9, 2026
Python

KaiP-598 / grpo-from-scratch

Star

GRPO (Group Relative Policy Optimization) implemented from scratch in PyTorch. 10 ablation experiments.

training reinforcement-learning pytorch from-scratch llm rlhf vllm deepseek-r1 grpo math-reasoning

Updated Apr 26, 2026
Python

hoadm-net / MathCoRL

Star

Comprehensive framework for mathematical reasoning research with dual research capabilities

nlp prompt-engineering math-reasoning

Updated Mar 18, 2026
Python

huyxdang / RLVR-Decomposed-Implementation

Star

Small-scale Implementation and Extension of “The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning” (NeurIPS '25)

reinforcement-learning llm math-reasoning

Updated Oct 23, 2025
Python

antonisbaro / promptimus-prime

Star

Transforming weak prompts into reasoning machines using Textual Gradients and AdalFlow. Runs on Colab.

transformers pytorch google-colab large-language-models llm prompt-engineering chain-of-thought generative-ai gsm8k prompt-optimization textual-gradients automated-prompt-engineering math-reasoning llm-autodiff adalflow

Updated Jan 28, 2026
Python

fikreab-s / small-model-rl-verifier-loop

Star

GRPO reinforcement learning with verifiable rewards for sub-2B models

reinforcement-learning verifier llm grpo math-reasoning

Updated May 9, 2026
Python

goldbar123467 / GSM8K-BenchMark-Qwen-3-

Star

GSM8K benchmark results for Qwen3-4B and Qwen3-8B fine-tuned

synthetic-data fine-tuning gsm8k qlora qwen math-reasoning

Updated Feb 28, 2026
Python

dipta007 / GanitLLM

Star

A Bengali Math LLM

math rl reasoning grpo math-reasoning mathllm

Updated May 19, 2026
Python

Math-llm-lab / llm-math-econ-evaluation

Star

NDA-safe excerpts of math & economics modeling tasks for LLM reasoning evaluation and numerical verification.

python monte-carlo root-finding numerical-methods model-validation economic-modeling quantitative-research llm-evaluation math-reasoning

Updated Dec 23, 2025
Python

Improve this page

Add a description, image, and links to the math-reasoning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the math-reasoning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

math-reasoning

Here are 16 public repositories matching this topic...

YutingLi0606 / Vision-Matters

lupantech / ineqmath

InternLM / Spark

goblinasaddy / nanoJEPA

Seanaaa0 / QT-R1

wsdjzlh / math-process-supervision-qwen

mianhua157 / math-data-cleaning-qwen

fikreab-s / aimo3-math-reasoning-pipeline

KaiP-598 / grpo-from-scratch

hoadm-net / MathCoRL

huyxdang / RLVR-Decomposed-Implementation

antonisbaro / promptimus-prime

fikreab-s / small-model-rl-verifier-loop

goldbar123467 / GSM8K-BenchMark-Qwen-3-

dipta007 / GanitLLM

Math-llm-lab / llm-math-econ-evaluation

Improve this page

Add this topic to your repo