Skip to content

[Bug]: Failed to load Qwen3-Embedding model. #19250

Closed
@Xu-Wenqing

Description

@Xu-Wenqing

Your current environment

Qwen3-Embedding-0.6B: https://www.modelscope.cn/models/Qwen/Qwen3-Embedding-0.6B/summary
Failed to load

🐛 Describe the bug

Loading safetensors checkpoint shards:   0% Completed | 0/1 [00:00<?, ?it/s]
ERROR 06-06 11:53:56 [core.py:515] EngineCore failed to start.
ERROR 06-06 11:53:56 [core.py:515] Traceback (most recent call last):
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/v1/engine/core.py", line 506, in run_engine_core
ERROR 06-06 11:53:56 [core.py:515]     engine_core = EngineCoreProc(*args, **kwargs)
ERROR 06-06 11:53:56 [core.py:515]                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/v1/engine/core.py", line 390, in __init__
ERROR 06-06 11:53:56 [core.py:515]     super().__init__(vllm_config, executor_class, log_stats,
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/v1/engine/core.py", line 76, in __init__
ERROR 06-06 11:53:56 [core.py:515]     self.model_executor = executor_class(vllm_config)
ERROR 06-06 11:53:56 [core.py:515]                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/executor/executor_base.py", line 53, in __init__
ERROR 06-06 11:53:56 [core.py:515]     self._init_executor()
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/executor/uniproc_executor.py", line 48, in _init_executor
ERROR 06-06 11:53:56 [core.py:515]     self.collective_rpc("load_model")
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/executor/uniproc_executor.py", line 57, in collective_rpc
ERROR 06-06 11:53:56 [core.py:515]     answer = run_method(self.driver_worker, method, args, kwargs)
ERROR 06-06 11:53:56 [core.py:515]              ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/utils.py", line 2656, in run_method
ERROR 06-06 11:53:56 [core.py:515]     return func(*args, **kwargs)
ERROR 06-06 11:53:56 [core.py:515]            ^^^^^^^^^^^^^^^^^^^^^
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/v1/worker/gpu_worker.py", line 165, in load_model
ERROR 06-06 11:53:56 [core.py:515]     self.model_runner.load_model()
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/v1/worker/gpu_model_runner.py", line 1592, in load_model
ERROR 06-06 11:53:56 [core.py:515]     self.model = model_loader.load_model(
ERROR 06-06 11:53:56 [core.py:515]                  ^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/model_executor/model_loader/base_loader.py", line 41, in load_model
ERROR 06-06 11:53:56 [core.py:515]     self.load_weights(model, model_config)
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/model_executor/model_loader/default_loader.py", line 269, in load_weights
ERROR 06-06 11:53:56 [core.py:515]     loaded_weights = model.load_weights(
ERROR 06-06 11:53:56 [core.py:515]                      ^^^^^^^^^^^^^^^^^^^
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/model_executor/models/qwen3.py", line 321, in load_weights
ERROR 06-06 11:53:56 [core.py:515]     return loader.load_weights(weights)
ERROR 06-06 11:53:56 [core.py:515]            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/model_executor/models/utils.py", line 278, in load_weights
ERROR 06-06 11:53:56 [core.py:515]     autoloaded_weights = set(self._load_module("", self.module, weights))
ERROR 06-06 11:53:56 [core.py:515]                          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 06-06 11:53:56 [core.py:515]   File "/mnt/workspace/vllm/vllm/model_executor/models/utils.py", line 264, in _load_module
ERROR 06-06 11:53:56 [core.py:515]     raise ValueError(msg)
ERROR 06-06 11:53:56 [core.py:515] ValueError: There is no module or parameter named 'embed_tokens' in Qwen3ForCausalLM

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions