Skip to content

sampler: Disable backend sampling if reasoning budget is enabled#21209

Merged
ggerganov merged 1 commit into
ggml-org:masterfrom
Galunid:sampler-fix
Mar 31, 2026
Merged

sampler: Disable backend sampling if reasoning budget is enabled#21209
ggerganov merged 1 commit into
ggml-org:masterfrom
Galunid:sampler-fix

Conversation

@Galunid

@Galunid Galunid commented Mar 31, 2026

Copy link
Copy Markdown
Contributor

Overview

Fixes #21208, turns out backend sampling is enabled by default, despite docs saying otherwise. Similar check exists for grammar, so it should be fine.

Requirements

@Galunid Galunid requested a review from a team as a code owner March 31, 2026 06:17
@aldehir

aldehir commented Mar 31, 2026

Copy link
Copy Markdown
Contributor

I had a feeling I was missing another check somewhere... Thanks for the PR!

@aldehir aldehir requested a review from ggerganov March 31, 2026 07:10
@ggerganov ggerganov merged commit 5ce013c into ggml-org:master Mar 31, 2026
1 check passed
slartibardfast pushed a commit to slartibardfast/llama.cpp that referenced this pull request Apr 12, 2026
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 1, 2026
ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026
my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026
my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026
fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Eval bug: using reasoning budget in combination with backend sampling is not supported despite backend-sampling not being enabled

3 participants