Error: `resource already mapped` in `custom_all_reduce.cuh`

This is a issue relared to https://github.com/vllm-project/vllm/issues/2619#issuecomment-1913490065

I have tried `ray=2.9.1` with dev code in commit #2636 

vllm.entrypoints.openai.api_server --model ./Mistral-7B-Instruct-v0.2-AWQ --quantization awq --dtype auto --host 0.0.0.0 --port 8081 --tensor-parallel-size 2
but I meet another error
Failed: Cuda error /home/my/vllm/csrc/custom_all_reduce.cuh:417 'resource already mapped' Segmentation fault (core dumped)

I am running with `python=3.11`, `CUDA 12.1`, `driver 530` with 2x RTX 3090 NVLink.
When I rollback to commit #2622, the program works well. So it seems it is caused by #2192

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Error: `resource already mapped` in `custom_all_reduce.cuh` #2641

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Error: resource already mapped in custom_all_reduce.cuh #2641

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Error: `resource already mapped` in `custom_all_reduce.cuh` #2641