Skip to content

Conversation

xmfan
Copy link
Member

@xmfan xmfan commented Oct 13, 2025

Stacked PRs:


  1. There's a commitment to turn capture_scalar_outputs on by default, and I'm adding sample MoE tests to core. So we can flip this back.
  2. using the compile(disable(compile(_forward, fullgraph=True)), fullgraph=False) trick, we can ensure the grouped gemm is always captured.

logs: https://gist.github.com/xmfan/4fd7efc7718795d302304dab807fabd6

… grouped experts

stack-info: PR: #1861, branch: xmfan/stack/1
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant