add dry sampler #513

firecoperana · 2025-06-10T02:18:47Z

I test this using the example in vllm-project/vllm#11368 and it looks ok.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

saood06 · 2025-06-10T02:57:13Z

This already looks so much better than #504 just from looking at how much more similar it is to the reference implementation.

It was taking time testing that because it looked like it had a lot of edge cases that would lead to issues or at least some incorrect behavior.

ikawrakow · 2025-06-10T05:42:27Z

examples/rpc/CMakeLists.txt

 add_executable(${TARGET} rpc-server.cpp)
 target_link_libraries(${TARGET} PRIVATE ggml)
-target_compile_features(${TARGET} PRIVATE cxx_std_17)
+target_compile_features(${TARGET} PRIVATE cxx_std_17)


Why do we need this?

It's in the mainline file.

ikawrakow · 2025-06-10T05:42:44Z

examples/server/CMakeLists.txt

    SERVER_VERBOSE=$<BOOL:${LLAMA_SERVER_VERBOSE}>
 )
-
+if (MSVC)


Why is this needed?

For the stack size code, add_tensor function in ggml-rpc.cpp is using recursion to serialize the graph. Windows has very small stack size by default, so it is easy to cause stack overflow if graph is too complex. This is not needed for dry sampler, but a bug fix for rpc.

ikawrakow · 2025-06-10T05:47:23Z

src/llama.cpp

 }

+struct llama_sampler_dry * llama_sampler_init_dry(const struct llama_model* model, float dry_multiplier, float dry_base, int32_t dry_allowed_length, int32_t dry_penalty_last_n, const char** seq_breakers, size_t num_breakers) {
+    return llama_sampler_init_dry_impl(model->vocab, llama_n_ctx_train(model), dry_multiplier, dry_base, dry_allowed_length, dry_penalty_last_n, seq_breakers, num_breakers);


The DRY sampler only depends on the vocabulary, not the entire model. Wouldn't it have been better to define the interface that way (taking a pointer to vocabulary instead of model)?

I can change it.

ikawrakow · 2025-06-10T13:38:46Z

@saood06 Any other comments?

saood06 · 2025-06-11T05:35:49Z

Tried to build this to test and got this:

/ik_llama.cpp/src/../include/llama.h:1240:54: error: unknown type name ‘llama_sampler_dry’
 1240 |     void llama_sample_dry(struct llama_context* ctx, llama_sampler_dry* smpl, llama_token_data_array* candidates_p);
      |                                                      ^~~~~~~~~~~~~~~~~

firecoperana · 2025-06-12T01:13:49Z

Tried to build this to test and got this:

/ik_llama.cpp/src/../include/llama.h:1240:54: error: unknown type name ‘llama_sampler_dry’
 1240 |     void llama_sample_dry(struct llama_context* ctx, llama_sampler_dry* smpl, llama_token_data_array* candidates_p);
      |                                                      ^~~~~~~~~~~~~~~~~

Can you clean the build folder and try again? It compiles fine for me.
Build command I use.
cmake -B build -DGGML_CUDA=ON -DGGML_BLAS=OFF -DCMAKE_BUILD_TYPE=Release -DLLAMA_BUILD_TESTS=OFF -DLLAMA_BUILD_EXAMPLES=ON -DLLAMA_BUILD_SERVER=ON -DLLAMA_CURL=OFF -DBUILD_SHARED_LIBS=ON -DGGML_SCHED_MAX_COPIES=1

saood06 · 2025-06-12T01:33:18Z

Can you clean the build folder and try again?

This was with a clean build folder.

It compiles fine for me. Build command I use. cmake -B build -DGGML_CUDA=ON -DGGML_BLAS=OFF -DCMAKE_BUILD_TYPE=Release -DLLAMA_BUILD_TESTS=OFF -DLLAMA_BUILD_EXAMPLES=ON -DLLAMA_BUILD_SERVER=ON -DLLAMA_CURL=OFF -DBUILD_SHARED_LIBS=ON -DGGML_SCHED_MAX_COPIES=1

Maybe it is because you set -DLLAMA_BUILD_TESTS=OFF, sorry I should have given you more of the compile error log.

In file included from /home/saood06/ik_main/ik_llama.cpp/tests/test-c.c:1:
/home/saood06/ik_main/ik_llama.cpp/src/../include/llama.h:1240:54: error: unknown type name ‘llama_sampler_dry’
 1240 |     void llama_sample_dry(struct llama_context* ctx, llama_sampler_dry * smpl, llama_token_data_array* candidates_p);
      |                                                      ^~~~~~~~~~~~~~~~~
gmake[2]: *** [tests/CMakeFiles/test-c.dir/build.make:79: tests/CMakeFiles/test-c.dir/test-c.c.o] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:2688: tests/CMakeFiles/test-c.dir/all] Error 2
gmake[1]: *** Waiting for unfinished jobs....

firecoperana · 2025-06-12T02:58:18Z

Can you clean the build folder and try again?

This was with a clean build folder.

It compiles fine for me. Build command I use. cmake -B build -DGGML_CUDA=ON -DGGML_BLAS=OFF -DCMAKE_BUILD_TYPE=Release -DLLAMA_BUILD_TESTS=OFF -DLLAMA_BUILD_EXAMPLES=ON -DLLAMA_BUILD_SERVER=ON -DLLAMA_CURL=OFF -DBUILD_SHARED_LIBS=ON -DGGML_SCHED_MAX_COPIES=1

Maybe it is because you set -DLLAMA_BUILD_TESTS=OFF, sorry I should have given you more of the compile error log.
In file included from /home/saood06/ik_main/ik_llama.cpp/tests/test-c.c:1:
/home/saood06/ik_main/ik_llama.cpp/src/../include/llama.h:1240:54: error: unknown type name ‘llama_sampler_dry’
 1240 |     void llama_sample_dry(struct llama_context* ctx, llama_sampler_dry * smpl, llama_token_data_array* candidates_p);
      |                                                      ^~~~~~~~~~~~~~~~~
gmake[2]: *** [tests/CMakeFiles/test-c.dir/build.make:79: tests/CMakeFiles/test-c.dir/test-c.c.o] Error 1
gmake[1]: *** [CMakeFiles/Makefile2:2688: tests/CMakeFiles/test-c.dir/all] Error 2
gmake[1]: *** Waiting for unfinished jobs....

Should be good this time.

This reverts commit 3f111ad.

add dry sampler

8449899

ikawrakow reviewed Jun 10, 2025

View reviewed changes

use vocab instead of model in dry_init function

21dbb56

Merge branch 'ikawrakow:main' into dry_sampler

dd08fff

fix compile error for build test

62ef62f

Merge branch 'main' into dry_sampler

d773268

ikawrakow approved these changes Jun 19, 2025

View reviewed changes

ikawrakow merged commit 3f111ad into ikawrakow:main Jun 19, 2025

mcm007 mentioned this pull request Jul 3, 2025

Bug: llama-server crash with sampling order #575

Closed

saood06 mentioned this pull request Aug 13, 2025

Add GPT-OSS from OpenAI - closed in favor of 689 #683

Closed

saood06 mentioned this pull request Aug 24, 2025

Add DRY and fix the server to use other new samplers. #504

Closed

4 tasks

sayap added a commit to sayap/ik_llama.cpp that referenced this pull request Sep 22, 2025

Revert "add dry sampler (ikawrakow#513)"

907104f

This reverts commit 3f111ad.

add dry sampler #513

add dry sampler #513

Uh oh!

Conversation

firecoperana commented Jun 10, 2025

Uh oh!

saood06 commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ikawrakow Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

firecoperana Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

ikawrakow Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

firecoperana Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

ikawrakow Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

firecoperana Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

ikawrakow commented Jun 10, 2025

Uh oh!

saood06 commented Jun 11, 2025

Uh oh!

firecoperana commented Jun 12, 2025

Uh oh!

saood06 commented Jun 12, 2025

Uh oh!

firecoperana commented Jun 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

saood06 commented Jun 10, 2025 •

edited

Loading