[Performance]: The performance of bge-rerank model on vllm and huggingface is inconsistent

### Proposal to improve performance

_No response_

### Report of performance regression

_No response_

### Misc discussion on performance

The script loaded using vllm is:
```
from vllm import LLM
model_path = "/data/reranker/bge-reranker-large"
# Create an LLM.
# You should pass task="score" for cross-encoder models
model = LLM(
    model=model_path,
    task="score",
    enforce_eager=True,
    dtype="float32"
)
```
huggingface scripts mainly use default parameters for，float32
```
 CrossEncoder(
            model, max_length=512, device=device
        )
```
After running 200 sets of data, in which 100 paragraphs were rearranged, 19 sets were found to be in inconsistent order after rearrangement
I also looked up some information online and there is the following information：
https://zhuanlan.zhihu.com/p/658780653

I do not know whether the parameter of my vllm is wrong, or what causes it?


### Your current environment (if you think it is necessary)

Current environment
linux: ubuntu 18  
NVIDIA-SMI 525.105.17   Driver Version: 525.105.17   CUDA Version: 12.0  Tesla T4 
vllm==0.6.6

### Before submitting a new issue...

- [X] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Performance]: The performance of bge-rerank model on vllm and huggingface is inconsistent #11568

Proposal to improve performance

Report of performance regression

Misc discussion on performance

Your current environment (if you think it is necessary)

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Performance]: The performance of bge-rerank model on vllm and huggingface is inconsistent #11568

Description

Proposal to improve performance

Report of performance regression

Misc discussion on performance

Your current environment (if you think it is necessary)

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions