Move DTensor isinstance check from runtime to trace construction time in nvFuser executor #2652

Copilot · 2025-10-15T14:25:15Z

Plan to Fix DTensor isinstance Check Issue

Add use_dtensor_execute attribute to FusionDefinitionWrapper class
Determine use_dtensor_execute value at trace construction time in create_fusion_definition_wrapper
Update FusionDefinitionWrapper.__call__ to use use_dtensor_execute instead of runtime isinstance check
Fix potential NameError when torch.distributed is not available
Add tests for the changes
Fix test to only run for nvfuser executor

Summary

Successfully moved the DTensor type check from runtime to trace construction time in the nvFuser executor. The changes ensure that the dispatch to DTensor-based execution is determined by the proxy types in the trace rather than by checking runtime arguments.

Key Changes

Added use_dtensor_execute field to FusionDefinitionWrapper (line 537):
- Boolean field that determines at initialization whether DTensor execution path should be used
- Defaults to False for backward compatibility
Compute use_dtensor_execute at trace construction (line 623):
- In create_fusion_definition_wrapper, check if any sorted_unique_inputs is a DTensorProxy
- This check happens once during fusion definition creation, not on every call
Use pre-computed flag instead of runtime check (line 556):
- Changed from: if dist.is_available() and any(isinstance(t, torch.distributed.tensor.DTensor) for t in args)
- Changed to: if self.use_dtensor_execute
- Eliminates the need to check every argument at runtime
Fixed guard for distributed availability (line 548):
- Changed from: if dist.is_available()
- Changed to: if IS_TORCH_DISTRIBUTED_AVAILABLE
- Prevents potential NameError when torch.distributed is not available

Tests Added

test_use_dtensor_execute_flag in test_nvfuser.py:
- Tests that use_dtensor_execute is False for regular tensor inputs
- Verifies the flag is accessible from the FusionDefinitionWrapper
test_use_dtensor_execute_flag_with_dtensor in test_dtensor.py:
- Tests that use_dtensor_execute is True for DTensor inputs
- Ensures the flag correctly reflects DTensor usage
- Updated: Now skips for non-nvfuser executors to avoid false assertion failures

Benefits

Performance: Eliminates runtime isinstance checks on all fusion inputs
Correctness: Dispatch decision based on trace types, not runtime types
Maintainability: More explicit about when the dispatch decision is made
Robustness: Fixed potential NameError when distributed is not available

Original prompt

This section details on the original issue you should resolve

<issue_title>Do not perform DTensor isinstance check for all nvFuser inputs at runtime</issue_title>
<issue_description>## 🐛 Bug

Current nvFuser executor in Thunder checks every argument at runtime if it's DTensor or not. This check should be performed at trace construction time because the dispatch to DTensor-based execution or not should be done solely based on the types recorded in the trace.

Place where DTensor check is performed:

lightning-thunder/thunder/executors/nvfuserex_impl.py

Line 555 in 750620b

if dist.is_available() and any(isinstance(t, torch.distributed.tensor.DTensor) for t in args):

This would involve updating the construction of FusionDefinitionWrapper to include a new attribute "use_dtensor_execute" (or some better name) to control the dispatch.</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes #2651

💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click here to start the survey.

Co-authored-by: IvanYashchuk <[email protected]>

IvanYashchuk · 2025-10-15T15:00:09Z

@copilot, the changes look good. Please add a test for this change.

Co-authored-by: IvanYashchuk <[email protected]>

Copilot · 2025-10-15T15:07:14Z

@copilot, the changes look good. Please add a test for this change.

Added tests in commit 2c67caa:

test_use_dtensor_execute_flag in test_nvfuser.py - verifies the flag is False for regular tensors
test_use_dtensor_execute_flag_with_dtensor in test_dtensor.py - verifies the flag is True for DTensor inputs

Both tests check that the use_dtensor_execute attribute is correctly set at trace construction time based on the input proxy types.

IvanYashchuk

That's exactly what I wanted.

IvanYashchuk · 2025-10-15T15:19:22Z

@kshitij12345, do you agree with the change?

kshitij12345 · 2025-10-17T13:57:40Z

thunder/tests/distributed/test_dtensor.py



+    @common_utils.parametrize("executor", tuple(executors_map.keys()))
+    def test_use_dtensor_execute_flag_with_dtensor(self, executor):


@copilot, This test should run only for nvfuser executor otherwise assert len(fusions) > 0 will be false

@copilot, apply the suggestion above.

Applied the suggestion in commit 0ed6e23. The test now skips for non-nvfuser executors to avoid assertion failures when there are no fusions.

Co-authored-by: IvanYashchuk <[email protected]>

IvanYashchuk · 2025-11-06T08:47:29Z

@copilot test test are you active?

for more information, see https://pre-commit.ci

Initial plan

8ae1137

Copilot AI assigned Copilot and IvanYashchuk Oct 15, 2025

Copilot AI and others added 2 commits October 15, 2025 14:35

Move DTensor check from runtime to trace construction time

60b27b5

Co-authored-by: IvanYashchuk <[email protected]>

Fix potential NameError when distributed is not available

a353391

Co-authored-by: IvanYashchuk <[email protected]>

Copilot AI changed the title ~~[WIP] Fix DTensor isinstance check for nvFuser inputs~~ Move DTensor isinstance check from runtime to trace construction time in nvFuser executor Oct 15, 2025

Copilot AI requested a review from IvanYashchuk October 15, 2025 14:44

Add tests for use_dtensor_execute flag

241fa92

Co-authored-by: IvanYashchuk <[email protected]>

IvanYashchuk approved these changes Oct 15, 2025

View reviewed changes

IvanYashchuk marked this pull request as ready for review October 15, 2025 15:18

IvanYashchuk requested review from KaelanDt, lantiga, mruberry and t-vi as code owners October 15, 2025 15:18

IvanYashchuk requested a review from kshitij12345 October 15, 2025 15:19

kshitij12345 reviewed Oct 17, 2025

View reviewed changes

Skip DTensor test for non-nvfuser executors

b6e761e

Co-authored-by: IvanYashchuk <[email protected]>

Copilot AI requested a review from IvanYashchuk October 21, 2025 14:58

Merge branch 'main' into copilot/fix-dtensor-check-runtime

3ad29c7

IvanYashchuk approved these changes Nov 6, 2025

View reviewed changes

[pre-commit.ci] auto fixes from pre-commit.com hooks

f8dd3ad

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move DTensor isinstance check from runtime to trace construction time in nvFuser executor #2652

Move DTensor isinstance check from runtime to trace construction time in nvFuser executor #2652

Uh oh!

Copilot AI commented Oct 15, 2025 •

edited

Loading

Uh oh!

IvanYashchuk commented Oct 15, 2025

Uh oh!

Copilot AI commented Oct 15, 2025

Uh oh!

IvanYashchuk left a comment

Uh oh!

IvanYashchuk commented Oct 15, 2025

Uh oh!

kshitij12345 Oct 17, 2025 •

edited

Loading

Uh oh!

IvanYashchuk Oct 21, 2025

Uh oh!

Copilot AI Oct 21, 2025

Uh oh!

IvanYashchuk commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants



		@common_utils.parametrize("executor", tuple(executors_map.keys()))
		def test_use_dtensor_execute_flag_with_dtensor(self, executor):

Move DTensor isinstance check from runtime to trace construction time in nvFuser executor #2652

Are you sure you want to change the base?

Move DTensor isinstance check from runtime to trace construction time in nvFuser executor #2652

Uh oh!

Conversation

Copilot AI commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Plan to Fix DTensor isinstance Check Issue

Summary

Key Changes

Tests Added

Benefits

Comments on the Issue (you are @copilot in this section)

Uh oh!

IvanYashchuk commented Oct 15, 2025

Uh oh!

Copilot AI commented Oct 15, 2025

Uh oh!

IvanYashchuk left a comment

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk commented Oct 15, 2025

Uh oh!

kshitij12345 Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

IvanYashchuk commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Oct 15, 2025 •

edited

Loading

kshitij12345 Oct 17, 2025 •

edited

Loading