Skip to content

Commit 658451b

Browse files
committed
ci: server: fix server tests after ggml-org#6638
1 parent d06d354 commit 658451b

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/server/server.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1208,7 +1208,7 @@ struct server_context {
12081208
}
12091209

12101210
auto n_ctx_train = llama_n_ctx_train(model);
1211-
if (slot.params.n_predict < 1 && slot.ga_n == 1
1211+
if (slot.params.n_predict < 1 && slot.n_predict < 1 && slot.ga_n == 1
12121212
&& slot.n_prompt_tokens + slot.n_decoded >= n_ctx_train) {
12131213
LOG_WARNING("n_predict is not set and self-context extend is disabled."
12141214
" Limiting generated tokens to n_ctx_train to avoid EOS-less generation infinite loop", {

0 commit comments

Comments
 (0)