Commit e3a7958
committed
Add XSA + EMA + TTT merged train_gpt.py
Combines PR openai#287 (XSA + EMA + Int6 QAT) with PR openai#254 TTT adaptation.
Changes: FA2 fallback import, TTT hyperparameters, ttt_adapt function,
TTT call before torch.compile in eval section.1 parent 9605e98 commit e3a7958
1 file changed
Lines changed: 1652 additions & 0 deletions
0 commit comments