You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
commit ad717d4
Author: Richard Zou <zou3519@gmail.com>
Date: Wed Oct 15 16:29:49 2025 -0700
[BugFix] Work around graph partition x torch.compile cache issue
In PyTorch 2.9, torch.compile has a bug where the graph
partition is not taken into account during caching.
Because vLLM's Mode.VLLM_COMPILE is the only mode that uses
Inductor graph partition, and VLLM_COMPILE implies there
is a PostGradPassManager, we put the list of operators to graph
partition into the PostGradPassManager's uuid (which
then gets incorporated into Inductor's FX graph cache key).
Remove this hack whenever torch.compile fixes it.
Signed-off-by: Richard Zou <zou3519@gmail.com>
Signed-off-by: ProExpertProg <lgovedic@redhat.com>
0 commit comments