Feat: adding self.training argument to implement KV cache during validation and inference #94

adityavipradas · 2025-05-31T09:38:34Z

Resolving issue #92

Training: do not build KV cache
Validation: build KV cache
Inference: build KV cache

Added self.training argument in language_model.py to populate KV cache during inference, validation and set it to [None] during training.

This keeps the code cleaner and easy to understand compared to adding additional arguments in call functions
but using self.training argument is not the perfect solution as it will populate KV cache during validation. Additional modifications will be needed to handle this.

added self.training argument to build KV cache only during inference and evaluation

replaced self.training with torch.is_grad_enabled()

kashif · 2025-05-31T09:55:53Z

If self.training is true during evaluation, then perhaps we are doing something wrong by missing a call to model.eval()?

adityavipradas · 2025-05-31T10:00:21Z

model.eval() is right where it needs to be.

What we need is:

Training: do not build KV cache
Validation: do not build KV cache
Inference: build KV cache

If building KV cache during validation is fine, we can use self.training or torch.is_grad_enabled(). Please let me know and I will make the modifications accordingly. Thank you.

both torch.is_grad_enabled() and self.training lead to the same KV cache building outcome.

adityavipradas added 2 commits May 31, 2025 12:17

modified language_model.py

f801ca7

added self.training argument to build KV cache only during inference and evaluation

modified language_model.py 2

0f791f9

replaced self.training with torch.is_grad_enabled()

adityavipradas mentioned this pull request May 31, 2025

Feat: adding is_training argument to implement KV cache only during inference #93

Closed

adityavipradas closed this May 31, 2025

adityavipradas reopened this May 31, 2025

adityavipradas changed the title ~~Feat: adding torch.is_grad_enabled() argument to implement KV cache only during inference~~ Feat: adding self.training argument to implement KV cache during validation and inference May 31, 2025

changed torch.is_grad_enabled() to self.training

c2accf2

both torch.is_grad_enabled() and self.training lead to the same KV cache building outcome.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat: adding self.training argument to implement KV cache during validation and inference #94

Feat: adding self.training argument to implement KV cache during validation and inference #94

Uh oh!

adityavipradas commented May 31, 2025 •

edited

Loading

Uh oh!

kashif commented May 31, 2025

Uh oh!

adityavipradas commented May 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feat: adding self.training argument to implement KV cache during validation and inference #94

Are you sure you want to change the base?

Feat: adding self.training argument to implement KV cache during validation and inference #94

Uh oh!

Conversation

adityavipradas commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kashif commented May 31, 2025

Uh oh!

adityavipradas commented May 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adityavipradas commented May 31, 2025 •

edited

Loading