Skip to content

Conversation

BoyuanFeng
Copy link
Contributor

@BoyuanFeng BoyuanFeng commented Oct 18, 2025

A pytorch 2.9 monkey patch to fix test_attn_quant. See pytorch/pytorch#165815

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a monkey patch for get_graph_partition_signature in PyTorch 2.9 to address a bug related to graph partitioning, which fixes the test_attn_quant test. The changes in the test file reflect an update to the logging infrastructure. The core change in vllm/env_override.py seems correct in its intent to backport a fix. However, there is a misleading comment for the new monkey patch that should be corrected to improve maintainability.

)

with caplog_mp_spawn(logging.DEBUG) as log_holder:
with caplog_vllm.at_level(logging.DEBUG):
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This test does not use mp. caplog_mp_spawn would also fail for dynamo graph partition.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if that's just on your machine - for me even tp=1 tests used MP. I'll enable CI and we can see. But I think we should probably just do both, and we can investigate later

Signed-off-by: Boyuan Feng <[email protected]>
)

with caplog_mp_spawn(logging.DEBUG) as log_holder:
with caplog_vllm.at_level(logging.DEBUG):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if that's just on your machine - for me even tp=1 tests used MP. I'll enable CI and we can see. But I think we should probably just do both, and we can investigate later

@ProExpertProg ProExpertProg added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 18, 2025
Signed-off-by: Boyuan Feng <[email protected]>
@BoyuanFeng
Copy link
Contributor Author

@ProExpertProg yes we should use caplog_mp_spawn. I was using VLLM_ENABLE_V1_MULTIPROCESSING=0 for debugging. I guess that's the reason.

@ProExpertProg ProExpertProg merged commit e133d6d into vllm-project:main Oct 18, 2025
45 checks passed
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
adabeyta pushed a commit to adabeyta/vllm that referenced this pull request Oct 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: Another issue with Inductor partition codegen for attn+nvfp4 quant fusion

2 participants