Clean up traces for symbolic values caching #2662

beverlylytle · 2025-10-16T11:10:46Z

When the symbolic values caching option is enabled, there are many duplicated calls to prims.eq and prims.shape that appear in any given bsym's subsymbols. DCE is currently applied before the decent to the subsymbols happens. When the descent to the subsymbols happens, it results in a very ugly and hard to read trace. This PR applies dce to the bsym's subsymbols earlier on to tidy things up.

Fixes #2728

beverlylytle · 2025-11-12T12:46:55Z

The initial draft in e7c8bc9 applied dce to subsymbols within remove_duplicate_number_proxies, a function whose job it is to clean after symbolic values. In cb31f7f, dce is now applied in within Symbol.__call__ when the subsymbols are created. It unfortunately requires a local import of dce because of dependency hell. I rather prefer the idea of all the symbolic values tidying up happening in one method.

Copilot

Pull Request Overview

This PR applies Dead Code Elimination (DCE) earlier in the process to clean up duplicated calls to prims.eq and prims.shape in subsymbols when symbolic values caching is enabled. This improves trace readability by removing redundant operations before they accumulate.

Extended the dce function to accept either a Trace or a list of BoundSymbolInterface objects
Applied DCE to a bsym's subsymbols immediately after execution to remove duplicates
Refactored output handling to support both trace-level and subsymbol-level DCE

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
thunder/core/transform_common.py	Extended `dce` function to handle both traces and bound symbol lists, with conditional logic for each type
thunder/core/symbol.py	Added DCE call to clean up subsymbols immediately after execution, removed trailing whitespace

Comments suppressed due to low confidence (1)

thunder/core/transform_common.py:157

This assignment assigns a variable to itself.

        output = output

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

thunder/core/transform_common.py

IvanYashchuk

Moving DCE to symbol creation results in much tidier outputs right where most contributors and users will inspect them. This is a targeted and elegant change, thank you for addressing technical debt so precisely!

thunder/core/symbol.py

thunder/core/transform_common.py

beverlylytle · 2025-11-12T16:51:22Z

I'm not convince that Symbol.__call__ is the right place for this dce. The dynamo tests are failing non-trivially, and they weren't with the other version.

I don't have a minimal failing example yet, but I do see that in more complex tests that fusions are missing required bound symbols.
I think this has something to do with the fact that results is the output of a symbol's meta, and not the function itself. I am seeing things like check_len with a meta that should return None having its subsymbol prims.check_len eliminated because of that. Of course that doesn't explain why fusions are being gutted.

Things are still failing when I move the dce(subsymbols) line below to after the results are finalized.

beverlylytle · 2025-11-13T14:08:16Z

I don't have it fully figured out yet, but I'm narrowing in on the problem. Consider

import torch
from thunder.dynamo import thunderfx

def foo(x):
    y = torch.cos(x) + torch.sin(x)
    return y

x = torch.randn(10, 10, requires_grad=True)

jfoo = thunderfx(foo)
jfoo(x)

No error is surfaced when running this code. However, it does erroneously result in a split graph. When the splitter is testing the node that has the add operation, a KeyError about 't1' is thrown and caught, resulting in the split Looking more carefully at this node, one sees that the bound symbol that is attempting to be executed is t1 = ltorch.add(t0, t1). Note the name collision between the output and the args. I can add some hack at the top of Symbol.__call__ that adds unseen arg names to the trace:

        flat_args, _ = tree_flatten((args, kwargs))
        for arg in flat_args:
            if not hasattr(arg, 'name') or trace.has_name(arg.name):
                continue
            trace.add_name(arg.name)

and the graph is no longer split. But obviously this isn't good. I can't figure out how dce'ing the subsymbols results in this name collision. The only time the dce'ing changes a list of subsymbols in the above example is for ltorch.add where it starts with [i0 = prims.ne(alpha, 1), t1 = prims.add(a, b)], but dce removes the prims.ne.

@IvanYashchuk What do you make of this? Is it worth pulling more on this thread? or can we go back to having the dce happen in remove_duplicate_number_proxies?

IvanYashchuk · 2025-11-13T14:42:38Z

I think it's worth looking into further. What part of the codebase is generating t1 = ltorch.add(t0, t1)? It shouldn't happen. For normal non-view non-inplace operations, the output name should be new. This might be surfacing a hidden bug that is worth fixing.

beverlylytle · 2025-11-14T14:39:28Z

thunder/dynamo/utils.py

        # We need to be under trace context to generate proxies.
-        with thunder.core.trace.tracectx(TraceCtx()):
+        with thunder.core.trace.tracectx(tracectx):
            try:
                function_to_run(*proxy_args, **proxy_kwargs)


This was the source of all of my woes. This pattern implicitly binds the provided proxy_args to the symbols being generated, as opposed to mapping the input args to the input args an established trace expects. But the proxy_args were being created in a distinct TraceCtx, and this result in name collisions. This was revealed in an application of DCE that exists way, way, deep down in the call stack when executing function_to_run. DCE creates a producer map, and in accessing this map, a KeyError was raised, triggering a graph split.

thunder/executors/nvfuserex_impl.py

thunder/tests/test_core.py

shino16

This is a desired one! Thank you @beverlylytle

thunder/core/symbol.py

thunder/core/transform_common.py

thunder/tests/test_core.py

thunder/core/transform_common.py

shino16

Thank you!

beverlylytle · 2025-12-02T14:17:35Z

@KaelanDt This is ready for your review. Thanks!

Apply DCE to subsymbols

e7c8bc9

IvanYashchuk mentioned this pull request Nov 12, 2025

Simplify trace representation with shape symbolic values enabled #2728

Open

beverlylytle added 2 commits November 12, 2025 14:11

Merge branch 'main' into bl/dce_subsymbols

28408f7

try in symbol.__call__ instead

cb31f7f

beverlylytle changed the title ~~WIP~~ Clean up traces for symbolic values caching Nov 12, 2025

beverlylytle requested a review from IvanYashchuk November 12, 2025 12:52

IvanYashchuk requested a review from Copilot November 12, 2025 15:51

Copilot started reviewing on behalf of IvanYashchuk November 12, 2025 15:51 View session

Copilot finished reviewing on behalf of IvanYashchuk November 12, 2025 15:52

Copilot AI reviewed Nov 12, 2025

View reviewed changes

thunder/core/transform_common.py Outdated Show resolved Hide resolved

thunder/core/transform_common.py Outdated Show resolved Hide resolved

IvanYashchuk approved these changes Nov 12, 2025

View reviewed changes

thunder/core/symbol.py Outdated Show resolved Hide resolved

thunder/core/transform_common.py Outdated Show resolved Hide resolved

thunder/core/transform_common.py Outdated Show resolved Hide resolved

IvanYashchuk added the symbolic values label Nov 12, 2025

come on, ruff, it's a test

db83930

beverlylytle commented Nov 14, 2025

View reviewed changes

Merge branch 'main' into bl/dce_subsymbols

3986848

beverlylytle marked this pull request as ready for review November 14, 2025 14:43

beverlylytle requested review from KaelanDt, lantiga and mruberry as code owners November 14, 2025 14:43

beverlylytle commented Nov 18, 2025

View reviewed changes

thunder/executors/nvfuserex_impl.py Show resolved Hide resolved

beverlylytle commented Nov 18, 2025

View reviewed changes

thunder/tests/test_core.py Outdated Show resolved Hide resolved

beverlylytle added 3 commits November 18, 2025 14:41

Merge branch 'main' into bl/dce_subsymbols

94d9b81

remove print

5363067

Merge branch 'main' into bl/dce_subsymbols

3ace3b5

beverlylytle mentioned this pull request Nov 20, 2025

Handle aliasing of viewed input tensors of varying shapes #2760

Merged

shino16 approved these changes Nov 20, 2025

View reviewed changes

thunder/core/symbol.py Show resolved Hide resolved

thunder/core/transform_common.py Outdated Show resolved Hide resolved

thunder/tests/test_core.py Show resolved Hide resolved

thunder/core/transform_common.py Outdated Show resolved Hide resolved

beverlylytle added 2 commits November 21, 2025 11:49

respond to comments

129e9d0

where's my coffee

06ca4df

shino16 approved these changes Nov 21, 2025

View reviewed changes

Merge branch 'main' into bl/dce_subsymbols

303fb68

IvanYashchuk enabled auto-merge (squash) December 2, 2025 19:03

Clean up traces for symbolic values caching #2662

Are you sure you want to change the base?

Clean up traces for symbolic values caching #2662

Conversation

beverlylytle commented Oct 16, 2025 • edited by IvanYashchuk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beverlylytle commented Nov 12, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

IvanYashchuk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

beverlylytle commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beverlylytle commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IvanYashchuk commented Nov 13, 2025

Uh oh!

beverlylytle Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

shino16 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shino16 left a comment

Choose a reason for hiding this comment

Uh oh!

beverlylytle commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

beverlylytle commented Oct 16, 2025 •

edited by IvanYashchuk

Loading

beverlylytle commented Nov 12, 2025 •

edited

Loading

beverlylytle commented Nov 13, 2025 •

edited

Loading

shino16 left a comment •

edited

Loading