Add Gemma3-270M model configuration support #500 #501

chethanuk · 2025-10-04T22:38:13Z

Add ModelConfig.gemma3_270m() classmethod with architecture specs:

18 layers, 640 embed_dim, 2048 hidden_dim
4 heads (256 head_dim), 1 KV head (GQA)
512 sliding window, 262144 vocab size

Add checkpoint path constants (GEMMA3_270M_PT, GEMMA3_270M_IT)

Resolves #500

It's a good idea to open an issue first for discussion.

Reference

Colab Notebook

Checklist

I have added all the necessary unit tests for my change.
I have verified that my change does not break existing code and all unit tests pass.
I have added all appropriate doc-strings/documentation.
My PR is based on the latest changes of the main branch (if unsure, rebase the code).
I have signed the Contributor License Agreement.
I have followed Contribution Guidelines.

- Add ModelConfig.gemma3_270m() classmethod with architecture specs: * 18 layers, 640 embed_dim, 2048 hidden_dim * 4 heads (256 head_dim), 1 KV head (GQA) * 512 sliding window, 262144 vocab size - Add checkpoint path constants (GEMMA3_270M_PT, GEMMA3_270M_IT) - Add test coverage for gemma3-270m model routing - Verified against official HuggingFace config Resolves google#500

tianshub

Thanks for adding the support! Btw, you don't need to manually merge from github, it will be automatically merged once the internal test passed.

abheesht17

@chethanuk - do you have a notebook comparing outputs with the reference model?

tianshub self-requested a review October 5, 2025 03:58

chethanuk temporarily deployed to testing October 5, 2025 03:58 — with GitHub Actions Inactive

tianshub approved these changes Oct 5, 2025

View reviewed changes

abheesht17 reviewed Oct 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Gemma3-270M model configuration support #500 #501

Add Gemma3-270M model configuration support #500 #501

chethanuk commented Oct 4, 2025

Uh oh!

tianshub left a comment

Uh oh!

abheesht17 left a comment

Uh oh!

Uh oh!

Add Gemma3-270M model configuration support #500 #501

Are you sure you want to change the base?

Add Gemma3-270M model configuration support #500 #501

Conversation

chethanuk commented Oct 4, 2025

Uh oh!

tianshub left a comment

Choose a reason for hiding this comment

Uh oh!

abheesht17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!