Eval bug: llama-server stopped working after PR #11285 got merged

### Name and Version

llama-server f30f099228f774209aa3010b78dfbe5d262e69aa


### Operating systems

Linux

### GGML backends

CUDA

### Hardware

RTX 4090, CUDA

### Models

E.g. Code Qwen 2.5 7B-Chat (Q8)

### Problem description & steps to reproduce

llama-server stopped generating any tokens for me, regardless of model, starting with commit f30f099228f774209aa3010b78dfbe5d262e69aa from #11285.
Simply reverting the above commit, e.g. on top of todays master (6171c9d25820ccf676b243c172868819d882848f) does fix the issue for me.

To reproduce, goto http://localhost:8080, enter a question hit return, nothing happens.

### First Bad Commit

f30f099228f774209aa3010b78dfbe5d262e69aa

### Relevant log output

```shell
│ main: server is listening on http://0.0.0.0:8080 - starting the main loop                                                                                                                                                                                 │                
             │ srv  update_slots: all slots are idle                                                                                                                                                                                                                     │                
             │ request: GET / 127.0.0.1 200                                                                                                                                                                                                                              │                
             │ request: GET /favicon.ico  400                                                                                                                                                                                                                            │                
             │ request: POST /v1/chat/completions  400                                                                                                                                                                                                                   │
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eval bug: llama-server stopped working after PR #11285 got merged #11335

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Eval bug: llama-server stopped working after PR #11285 got merged #11335

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions