Fix GPT-2 compatibility and long-input handling in perplexity by zanvari · Pull Request #767 · huggingface/evaluate

zanvari · 2026-06-17T23:41:39Z

Summary

This PR fixes GPT-2 compatibility issues in the perplexity metric/measurement and prevents overlong inputs from raising an IndexError.

While investigating GPT-2 compatibility, I found that overlong inputs could also raise an IndexError when max_length=None, so this PR addresses both issues and adds regression tests.

Changes

Replace tokenizer.special_tokens_map_extended with tokenizer.special_tokens_map.
Use the tokenizer EOS token as the padding token when padding is required and no pad token is defined.
Avoid padding when batch_size=1.
Default max_length to the tokenizer or model maximum length when it is not explicitly provided, preventing overlong inputs from causing indexing errors.
Add regression tests for GPT-2 perplexity computation and long-input handling.

Tests

python3 -m pytest tests/test_perplexity.py -v

Result:

2 passed

Fixes #766

Fix GPT-2 compatibility and max length handling in perplexity

d1869e9

zanvari mentioned this pull request Jun 18, 2026

Perplexity metric fails with GPT-2 tokenizer #766

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix GPT-2 compatibility and long-input handling in perplexity#767

Fix GPT-2 compatibility and long-input handling in perplexity#767
zanvari wants to merge 1 commit into
huggingface:mainfrom
zanvari:fix-perplexity-gpt2-tokenizer

zanvari commented Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

zanvari commented Jun 17, 2026

Summary

Changes

Tests

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant