Skip to content

Commit e3a7958

Browse files
committed
Add XSA + EMA + TTT merged train_gpt.py
Combines PR openai#287 (XSA + EMA + Int6 QAT) with PR openai#254 TTT adaptation. Changes: FA2 fallback import, TTT hyperparameters, ttt_adapt function, TTT call before torch.compile in eval section.
1 parent 9605e98 commit e3a7958

1 file changed

Lines changed: 1652 additions & 0 deletions

File tree

0 commit comments

Comments
 (0)