refactor(trainer): remove deprecated light-megatron backend path by WangLingxun · Pull Request #585 · AMD-AGI/Primus

WangLingxun · 2026-03-09T07:06:09Z

Remove the legacy light-megatron trainer implementation and clean up all related framework aliases. This change deletes the lightmegatron trainer modules and removes light-megatron routing from parser and hook dispatchers (train pretrain and projection performance), so framework resolution now follows megatron directly without a compatibility alias.

- Update third_party/Megatron-LM to core_v0.16.0. - Adapt muon optimizer patch signatures/call paths to Megatron 0.16 (config_overrides, optimizer arg forwarding). - Relax turbo TE spec provider gate to <0.17 for DeepSeek compatibility on v0.16. - Fix DDP/FSDP grad-sync compatibility by allowing _BaseDataParallel.{start,finish}_grad_sync to accept extra kwargs (e.g. force_all_reduce).

WangLingxun requested review from Xiaoming-AMD, limou102 and wenxie-amd as code owners March 9, 2026 07:06

WangLingxun force-pushed the refactor/remove-light-megatron-path branch from f8a4952 to ed4533e Compare March 9, 2026 07:09

WangLingxun and others added 2 commits March 16, 2026 14:06

Merge branch 'main' into refactor/remove-light-megatron-path

1894468

Merge branch 'main' into refactor/remove-light-megatron-path

4e7d06f

Xiaoming-AMD approved these changes Mar 23, 2026

View reviewed changes

Xiaoming-AMD merged commit 5ec7649 into main Mar 23, 2026
2 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(trainer): remove deprecated light-megatron backend path#585

refactor(trainer): remove deprecated light-megatron backend path#585
Xiaoming-AMD merged 3 commits intomainfrom
refactor/remove-light-megatron-path

WangLingxun commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

WangLingxun commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants