[3/N][CI/UT] add spec decode e2e UT #487

mengwei805 · 2025-04-09T01:42:05Z

What this PR does / why we need it?

add test_eagle_correctness.py; use local model weights;
add test_mtp_correctness.py; use bf16 model weights;

Does this PR introduce any user-facing change?

None

How was this patch tested?

Local verification passed

Co-authored-by: mengwei805 <mengwei25@huawei.com> Co-authored-by: MengqingCao <cmq0113@163.com> Signed-off-by: mengwei805 <mengwei25@huawei.com>

MengqingCao · 2025-04-11T09:06:16Z

.github/workflows/vllm_ascend_test.yaml

@@ -148,7 +148,7 @@ jobs:
      - name: Run vllm-project/vllm-ascend key feature test
        if: steps.filter.outputs.speculative_tests_changed
        run: |
-          pytest -sv tests/spec_decode
+          pytest -sv tests/spec_decode/e2e/test_eagle_correctness.py::test_llama2_eagle_e2e_greedy_correctness


todo: Revert this when successfully download llama model

OK, the modification here is just for debugging. I will restore it after debugging is successful.

Signed-off-by: mengwei805 <mengwei25@huawei.com>

github-actions bot added the module:tests label Apr 9, 2025

mengwei805 force-pushed the v0.7.3-sd-ut-part3 branch 7 times, most recently from 63f049c to 7c03c68 Compare April 11, 2025 06:09

[3/N][CI/UT] add spec decode e2e UT

21b49d7

Co-authored-by: mengwei805 <mengwei25@huawei.com> Co-authored-by: MengqingCao <cmq0113@163.com> Signed-off-by: mengwei805 <mengwei25@huawei.com>

mengwei805 force-pushed the v0.7.3-sd-ut-part3 branch from 7c03c68 to 21b49d7 Compare April 11, 2025 08:24

MengqingCao mentioned this pull request Apr 11, 2025

[CI] add hf-token for llama model download #507

Open

MengqingCao self-assigned this Apr 11, 2025

MengqingCao reviewed Apr 11, 2025

View reviewed changes

mengwei805 added 3 commits April 14, 2025 12:01

fix

6185356

Signed-off-by: mengwei805 <mengwei25@huawei.com>

fix mtp

eb10882

Signed-off-by: mengwei805 <mengwei25@huawei.com>

test mtp

e07137d

Signed-off-by: mengwei805 <mengwei25@huawei.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[3/N][CI/UT] add spec decode e2e UT #487

[3/N][CI/UT] add spec decode e2e UT #487

mengwei805 commented Apr 9, 2025 •

edited

Loading

MengqingCao Apr 11, 2025

mengwei805 Apr 14, 2025

[3/N][CI/UT] add spec decode e2e UT #487

Are you sure you want to change the base?

[3/N][CI/UT] add spec decode e2e UT #487

Conversation

mengwei805 commented Apr 9, 2025 • edited Loading

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

MengqingCao Apr 11, 2025

Choose a reason for hiding this comment

mengwei805 Apr 14, 2025

Choose a reason for hiding this comment

mengwei805 commented Apr 9, 2025 •

edited

Loading