[Bug]: Temperature is ignored in vLLM 0.8.0/0.8.1

### Your current environment

<details>
<summary>The output of `python collect_env.py`</summary>

```text
Your output of `python collect_env.py` here
```
</details>

### Description
In vLLM 0.7 and before, using a high temperature (10) with a random input string **always** returns "max_tokens" number of tokens (random output of the correct length)
With a temperature of 0, it returns something similar to "It seems like you've entered a string of characters that doesn't appear to be a meaningful word, phrase, or question."

Using the docker image 0.8.0 or 0.8.1, no matter the temperature, it always answers something like "It seems like you've entered a string of characters that doesn't appear to be a meaningful word, phrase, or question."

### Details
I tried with multiple models and the temperature seems to be ignored for all of them

### 🐛 Describe the bug

### Reproduction
Starting a Docker container with:
`docker run --gpus all \
    --entrypoint bash \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --ipc=host \
    -p 8000:8000 \
    -it \
    vllm/vllm-openai:v0.7.3`
and running 
```python3 -m vllm.entrypoints.openai.api_server   --model Qwen/Qwen2.5-VL-7B-Instruct   --trust-remote-code   --max-model-len 32768 --tensor-parallel-size 2 --gpu-memory-utilization 0.95```
on the server-side, and

```import random
import string
import time
from openai import OpenAI
model_name = "Qwen/Qwen2.5-VL-7B-Instruct"

client = OpenAI(
        api_key="EMPTY",
        base_url="http://localhost:8000/v1",
    )
    
client.chat.completions.create(
        model = model_name,
        max_tokens = 1000,
        temperature = 10,
        messages = [
            {"role": "system", "content": "You are Qwen."},
            {
                "role": "user", 
                "content": "".join(random.choices(string.ascii_letters + string.digits, k=10)),
            },
        ],
    )```

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: Temperature is ignored in vLLM 0.8.0/0.8.1 #15241

Your current environment

Description

Details

🐛 Describe the bug

Reproduction

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: Temperature is ignored in vLLM 0.8.0/0.8.1 #15241

Description

Your current environment

Description

Details

🐛 Describe the bug

Reproduction

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions