I'm implementing the papers BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding and Attention Is All You Need to apply what I have learned about transformer architectures.
billray0259/my_transformer
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|