Skip to content

[Bug] baichuan-13b-chat Service exception after long run #677

Closed as not planned
@Tomorrowxxy

Description

@Tomorrowxxy

Start command

python -m vllm.entrypoints.openai.api_server --model baichuan-inc/Baichuan-13B-Chat --host 0.0.0.0 --port 8777 --trust-remote-code --dtype half

After about 12 hours of operation, the inference service stopped working

GPU:V100
CUDA:11.4

Screenshot of the problem:
Xnip2023-08-05_12-03-19

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions