[Bug]: Reward model usage

### Your current environment


I am recently trying to use the new feature to serve reward model with vLLM, but I note that the sequential classifier type RM is not supported well (I am using `0.7.1`), 

I checked #8976 #10444 seems it is been resolved already, I am not sure if it is a bug or not




### 🐛 Describe the bug

for example (adapted from [this](https://docs.vllm.ai/en/latest/models/pooling_models.html#llm-encode) )

```
from vllm import LLM

llm = LLM(model="Skywork/Skywork-Reward-Llama-3.1-8B-v0.2", task="reward")
(output,) = llm.encode("Hello, my name is")

data = output.outputs.data
print(f"Data: {data!r}")
```

And I got `ValueError: Model architectures ['LlamaForSequenceClassification'] are not supported for now. ` Similarly, `Gemma2ForSequenceClassification` is not supported. I wonder if there will be support for these models

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: Reward model usage #12791

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: Reward model usage #12791

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions