[CI][gpt-oss] Enable python tool tests in CI #24315

wuhang2014 · 2025-09-05T10:56:38Z

Purpose

Fix #24199

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request enables Python tool tests in the CI, which is a great step forward. The changes involve un-skipping existing tests and adding necessary logic to validate the output of the code interpreter tool. My review focuses on improving the robustness of the new test logic. Specifically, I've identified a potential fragility in how the numerical result is extracted from the model's output string. The suggestions aim to make the tests less likely to fail due to minor variations in the model's response format, ensuring a more reliable CI pipeline.

tests/entrypoints/openai/test_response_api_with_harmony.py

heheda12345 · 2025-09-09T06:34:45Z

tests/entrypoints/openai/test_response_api_with_harmony.py

+                index = output_string.rfind('9', 0, last_index)
+            else:
+                index = -1
+            assert parse_integer_from_string(


what about assert "977966748" in output_string?
and can you use a harder question that the model can't answer if it doesn't use python tool properly? e.g., multiple 2 very very large numbers

what about assert "977966748" in output_string?

It may not work as expected because output text probably contains comma-seperator and/or markdown format for mathematical expression.

and can you use a harder question that the model can't answer if it doesn't use python tool properly? e.g., multiple 2 very very large numbers

I will try to calculate the product of two large prime numbers.

Add restrictions statement in prompt to simplify output text for easier assertion.

heheda12345 · 2025-09-09T18:22:48Z

tests/entrypoints/openai/test_response_api_with_harmony.py

+               "`var_a=9999999967*9999999769` "
+               "`var_b=9999999943*9999999781` "
+               "`var_c=9999999929*9999999787`. "
+               "Show only the sorted variable names with `<` using ascii."),


Are you sure the test will fail without "--tool-server", "demo"?

No, you are right, the model may guess out the correct answer.

I make a new prompt to let model infer the decimals of a root cube of a big number. And I try it with and without tool server. As expected, it returns correct digits of decimals with tool server while without tool server it returns incorrect numbers and sometimes it may take very long time in reasoning.

wuhang2014 · 2025-09-10T08:42:13Z

Depend on #182

heheda12345 · 2025-09-10T17:35:59Z

Do you need to install gpt-oss to run this test?

heheda12345 · 2025-09-10T18:12:51Z

Also CC @yeqcharlotte

wuhang2014 · 2025-09-11T01:25:41Z

Do you need to install gpt-oss to run this test?

I think so, it seems like gpt-oss is not listed in any requirments txt file

heheda12345 · 2025-09-11T04:55:24Z

vllm/entrypoints/tool.py

@@ -13,6 +13,28 @@
 logger = init_logger(__name__)


+def patch_call_python_script_with_uv():


Can you do this in unit test?

No, because python tool is called in a seperate process(namely a RemoteOpenAIServer), I don't think we have a method to patch across processes.

heheda12345

Thanks for your contribution!

heheda12345 · 2025-09-16T02:09:29Z

@wuhang2014 CI failure is related. Can you take a look?

FAILED entrypoints/openai/test_response_api_with_harmony.py::test_code_interpreter[openai/gpt-oss-20b] - AssertionError: assert '5846' in '5847'

wuhang2014 · 2025-09-16T03:10:54Z

@wuhang2014 CI failure is related. Can you take a look?

FAILED entrypoints/openai/test_response_api_with_harmony.py::test_code_interpreter[openai/gpt-oss-20b] - AssertionError: assert '5846' in '5847'

Sure, I think gpt-oss is not doing well with empty text returned from python tool. Maybe we can add a hint in text like "Empty result from python tool" instead, and I will try it later.

Signed-off-by: wuhang <wuhang6@huawei.com>

mergify · 2025-10-05T14:20:49Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @wuhang2014.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: wuhang <wuhang6@huawei.com>

heheda12345

LGTM! Thank you.

Signed-off-by: wuhang <wuhang6@huawei.com> Signed-off-by: Karan Goel <3261985+karan@users.noreply.github.com>

Signed-off-by: wuhang <wuhang6@huawei.com>

heheda12345 · 2025-10-09T07:12:56Z

@wangxiyuan sorry for that. It is fixed in #26392

wangxiyuan · 2025-10-09T07:48:56Z

@heheda12345 thanks for the quick fix. It makes sense.

Signed-off-by: wuhang <wuhang6@huawei.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: wuhang <wuhang6@huawei.com>

Signed-off-by: wuhang <wuhang6@huawei.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

wuhang2014 requested review from DarkLight1337, aarnphm, robertgshaw2-redhat and simon-mo as code owners September 5, 2025 10:56

gemini-code-assist bot reviewed Sep 5, 2025

View reviewed changes

tests/entrypoints/openai/test_response_api_with_harmony.py Outdated Show resolved Hide resolved

tests/entrypoints/openai/test_response_api_with_harmony.py Outdated Show resolved Hide resolved

heheda12345 reviewed Sep 9, 2025

View reviewed changes

wuhang2014 requested a review from heheda12345 September 10, 2025 03:34

wuhang2014 force-pushed the pythontooltests branch from b2d94cd to 4435a7a Compare September 11, 2025 03:28

mergify bot added the frontend label Sep 11, 2025

heheda12345 reviewed Sep 11, 2025

View reviewed changes

heheda12345 approved these changes Sep 12, 2025

View reviewed changes

heheda12345 enabled auto-merge (squash) September 12, 2025 04:23

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 12, 2025

heheda12345 changed the title ~~[Test]Enable python tool tests in CI~~ [gpt-oss] Enable python tool tests in CI Sep 12, 2025

heheda12345 disabled auto-merge September 12, 2025 04:25

mergify bot added the gpt-oss Related to GPT-OSS models label Sep 12, 2025

heheda12345 changed the title ~~[gpt-oss] Enable python tool tests in CI~~ [CI][gpt-oss] Enable python tool tests in CI Sep 12, 2025

heheda12345 enabled auto-merge (squash) September 12, 2025 04:25

yeqcharlotte added this to gpt-oss Issues & Enhancements Sep 14, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Sep 14, 2025

yeqcharlotte moved this from To Triage to In progress in gpt-oss Issues & Enhancements Sep 14, 2025

heheda12345 mentioned this pull request Sep 15, 2025

[Frontend] Responses API MCP tools for built in tools and to pass through headers #24628

Merged

auto-merge was automatically disabled September 22, 2025 03:04
Head branch was pushed to by a user without write access

wuhang2014 added 4 commits October 3, 2025 22:26

enable python tool tests in CI

a9b77fb

Signed-off-by: wuhang <wuhang6@huawei.com>

enable python tool tests in CI

11eed8d

Signed-off-by: wuhang <wuhang6@huawei.com>

enable python tool tests in CI

dd66083

Signed-off-by: wuhang <wuhang6@huawei.com>

verify python tool in CI

1ea4ad4

Signed-off-by: wuhang <wuhang6@huawei.com>

wuhang2014 force-pushed the pythontooltests branch from 7c2b30e to 1ea4ad4 Compare October 3, 2025 14:26

mergify bot added the ci/build label Oct 5, 2025

wuhang2014 force-pushed the pythontooltests branch from bf6c410 to 5f91521 Compare October 5, 2025 04:59

add gpt-oss as a common requirement

fb93699

Signed-off-by: wuhang <wuhang6@huawei.com>

wuhang2014 force-pushed the pythontooltests branch from 5f91521 to fb93699 Compare October 5, 2025 11:57

mergify bot added the needs-rebase label Oct 5, 2025

Merge branch 'main' into pythontooltests

1523b06

Signed-off-by: wuhang <wuhang6@huawei.com>

mergify bot removed the needs-rebase label Oct 5, 2025

wuhang2014 added 2 commits October 5, 2025 15:18

fix pre-commit errors

7828899

Signed-off-by: wuhang <wuhang6@huawei.com>

Merge branch 'main' into pythontooltests

04d6d57

heheda12345 approved these changes Oct 6, 2025

View reviewed changes

github-project-automation bot moved this from In progress to Ready in gpt-oss Issues & Enhancements Oct 6, 2025

heheda12345 enabled auto-merge (squash) October 6, 2025 03:47

heheda12345 merged commit 91ac7f7 into vllm-project:main Oct 6, 2025
85 checks passed

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Oct 6, 2025

karan pushed a commit to karan/vllm that referenced this pull request Oct 6, 2025

[CI][gpt-oss] Enable python tool tests in CI (vllm-project#24315)

2e44529

Signed-off-by: wuhang <wuhang6@huawei.com> Signed-off-by: Karan Goel <3261985+karan@users.noreply.github.com>

southfreebird pushed a commit to southfreebird/vllm that referenced this pull request Oct 7, 2025

[CI][gpt-oss] Enable python tool tests in CI (vllm-project#24315)

fb2ecc9

Signed-off-by: wuhang <wuhang6@huawei.com>

kouroshHakha mentioned this pull request Oct 8, 2025

[dependency] Add conditional dependency for gpt-oss to maintain Python 3.10 minimum version requirement. #26444

Closed

wuhang2014 deleted the pythontooltests branch October 9, 2025 06:24

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[CI][gpt-oss] Enable python tool tests in CI (vllm-project#24315)

3a885e1

Signed-off-by: wuhang <wuhang6@huawei.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[CI][gpt-oss] Enable python tool tests in CI (vllm-project#24315)

85d36be

Signed-off-by: wuhang <wuhang6@huawei.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[CI][gpt-oss] Enable python tool tests in CI (vllm-project#24315)

c6d6504

Signed-off-by: wuhang <wuhang6@huawei.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[CI][gpt-oss] Enable python tool tests in CI (vllm-project#24315)

bfc87d9

Signed-off-by: wuhang <wuhang6@huawei.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

		@@ -13,6 +13,28 @@
		logger = init_logger(__name__)


		def patch_call_python_script_with_uv():

Uh oh!

Uh oh!

[CI][gpt-oss] Enable python tool tests in CI #24315

[CI][gpt-oss] Enable python tool tests in CI #24315

Conversation

wuhang2014 commented Sep 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

heheda12345 Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

wuhang2014 Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

wuhang2014 Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

wuhang2014 Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

heheda12345 Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

wuhang2014 Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

wuhang2014 commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

heheda12345 commented Sep 10, 2025

Uh oh!

heheda12345 commented Sep 10, 2025

Uh oh!

wuhang2014 commented Sep 11, 2025

Uh oh!

heheda12345 Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

wuhang2014 Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

heheda12345 left a comment

Choose a reason for hiding this comment

Uh oh!

heheda12345 commented Sep 16, 2025

Uh oh!

wuhang2014 commented Sep 16, 2025

Uh oh!

mergify bot commented Oct 5, 2025

Uh oh!

heheda12345 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

heheda12345 commented Oct 9, 2025

Uh oh!

wangxiyuan commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wuhang2014 commented Sep 5, 2025 •

edited by github-actions bot

Loading

wuhang2014 Sep 9, 2025 •

edited

Loading

wuhang2014 commented Sep 10, 2025 •

edited

Loading