Skip to content

Conversation

@BoyuanFeng
Copy link
Contributor

@BoyuanFeng BoyuanFeng commented Oct 16, 2025

This PR fixes a inductor graph partition test.

The error happened since silly attention is NOT partitioned. Why? Because #25845 lets inductor partition respect splitting_ops and the test splitting_ops=[] says don't partition on any ops.

Signed-off-by: Boyuan Feng <boyuan@meta.com>
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request correctly fixes a failing test for inductor graph partitioning by ensuring that splitting_ops is properly configured. The test was failing because a recent change made inductor respect splitting_ops, and the test was passing an empty list, which disabled partitioning. My main feedback is to remove an outdated and misleading comment in the test code that contradicts the new behavior.

monkeypatch.setenv("VLLM_DISABLE_COMPILE_CACHE", "1")

_run_simple_model(
# Inductor graph partition automatically resets splitting_ops to an empty list
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

This comment appears to be outdated and incorrect. The pull request description explains that inductor partitioning now respects splitting_ops, and the test was failing with splitting_ops=[] because it prevented partitioning. This comment claims the opposite, that splitting_ops is automatically reset to an empty list. To avoid confusion for future developers, this comment should be removed.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you remove the comment?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes cleaned

Signed-off-by: Boyuan Feng <boyuan@meta.com>
ProExpertProg added a commit to neuralmagic/vllm that referenced this pull request Oct 16, 2025
commit 41590e7
Author: Boyuan Feng <boyuan@meta.com>
Date:   Thu Oct 16 13:25:18 2025 -0700

    nit

    Signed-off-by: Boyuan Feng <boyuan@meta.com>

commit 8393288
Author: Boyuan Feng <boyuan@meta.com>
Date:   Thu Oct 16 13:04:14 2025 -0700

    fix simple inductor graph partition test

    Signed-off-by: Boyuan Feng <boyuan@meta.com>

Signed-off-by: ProExpertProg <lgovedic@redhat.com>
Copy link
Collaborator

@ProExpertProg ProExpertProg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added to #27050

@ProExpertProg ProExpertProg added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 16, 2025
@ProExpertProg ProExpertProg merged commit 17c540a into vllm-project:main Oct 17, 2025
21 checks passed
@ProExpertProg
Copy link
Collaborator

Passed tests in CI, merged!

@ProExpertProg ProExpertProg changed the title fix simple inductor graph partition test [torch.compile] fix simple inductor graph partition test Oct 17, 2025
Zhuul pushed a commit to Zhuul/vllm that referenced this pull request Oct 17, 2025
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
albertoperdomo2 pushed a commit to albertoperdomo2/vllm that referenced this pull request Oct 23, 2025
…t#27050)

Signed-off-by: Boyuan Feng <boyuan@meta.com>
Signed-off-by: Alberto Perdomo <aperdomo@redhat.com>
alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025
…t#27050)

Signed-off-by: Boyuan Feng <boyuan@meta.com>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025
…t#27050)

Signed-off-by: Boyuan Feng <boyuan@meta.com>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
…t#27050)

Signed-off-by: Boyuan Feng <boyuan@meta.com>
Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
…t#27050)

Signed-off-by: Boyuan Feng <boyuan@meta.com>
Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants