Add state to logits processing #425

xhr15 · 2025-10-21T15:54:01Z

Logits processing is a powerful tool, particularly for using smaller language models for tasks such as named entity recognition. @seanmor5 started work in this area with #354.

Whatever the approach, it will require some kind of state.

This pull request is a proposal to allow logits processors to be stateful.

This would enable the use of deterministic finite automata (DFAs) or pushdown automata (PDAs) for processing constrained grammars in logits processing. bitcrowd#6 shows how this would be used. We will follow up on this PR if this approach is favoured.

https://bitcrowd.atlassian.net/browse/SAMPLE-6

jonatanklosko

Hey @xhr15 and @joelpaulkoch, thanks for the PR!

I dropped a few comments, but the main one is about the API. I know it's a bit more involved, but probably worth it. Let me know what you think, and if you have any concerns!

test/bumblebee/text/generation/logits_processing_test.exs

test/bumblebee/text/generation_test.exs

jonatanklosko · 2025-10-22T15:04:28Z

test/bumblebee/text/generation_test.exs

+      context =
+        put_in(
+          context,
+          [:logits_processor_state, :next_suppressed_token_id],


With the current API, the state is always initialized to %{} and then first invocation of the processor adds a key, here %{next_suppressed_token_id: %Nx.Tensor{...}}.

This can be problematic in defn while loop, which requires the accumulation sate to always have the same shape. In other words, the initial state should already include :next_suppressed_token_id with the default tensor. It is possible that this didn't come up during your tests, because depending on the model/input, we do the first generation step outside of the while loop, and the first call would initialize the state. However, if we are going to support stateful, I would rather do it in a more robust way.

Given the above, a stateless logits processor would involve two steps (functions):

Building an initial state.

Performing logits processing, which receives logits and state, and returns update logits and state.

This way we can call (1) when initializing the generation context, and for the actual processing we call (2).

The behaviour can be similar to Bumblebee.Scheduler. Something like this:

defmodule Bumblebee.LogitsProcessor do @moduledoc """ An interface for configuring and using logits processors. Logits processors are used during autoregressive generation to modify predicted scores at each generation step. This allows for applying certain rules to the model output to control which tokens are picked at each generation step, and which are not. Every module implementing this behaviour is expected to also define a configuration struct. """ @type t :: Bumblebee.Configurable.t() @type state :: Nx.Container.t() @doc """ Initializes state for a new logits processor. Returns `state`, which is an opaque `Nx.Container`, and it is then passed to and returned from `process/2`. Oftentimes logits processors are stateless, in which case this function can return an empty continer, such as `{}`. """ @callback init(t(), context) :: state() when context: %{ prng_key: Nx.Tensor.t() } @doc """ Processes logits, applying specific rules. """ @callback process( t(), state(), logits :: Nx.Tensor.t(), context :: context ) :: {state :: map(), logits :: Nx.Tensor.t()} when context: %{ sequence: Nx.Tensor.t(), length: Nx.Tensor.t(), input_length: Nx.Tensor.t() } end

Technically, the :logits_processors options is public API, but we can make it backward-compatible. For example, we can define %Bumblebee.Text.Generation.StatelessLogitsProcessor{fun: fun}, where the state is always empty and process just invokes the fun. I would even use that for the built-in processors, so that we don't need to define a bunch of new modules.

@jonatanklosko Thank you very much for your comments! I think esp. the two step call makes sense. We'll move in that direction :)

@jonatanklosko
as an afterthought:

What is the use case for context here:

@callback init(t(), context) :: state() when context: %{ prng_key: Nx.Tensor.t() }

Later in the loop, context holds:

context = %{ sequences: sequences, input_length: length, length: length, }

I am wondering how those would influence the initialisation of the logits processors?

Or are you planning of using additional keys? E.g. from the state as returned by init squence:

%{ sequences: sequences, input_length: length, length: length, finished_length: finished_length, ignored: Nx.broadcast(0, {batch_size}) }

If that was the case, we should probably rename the parameter to state or initial_state.

Wdyt?

xhr15 · 2025-10-24T13:48:24Z

@jonatanklosko Before we add more test and do further refactorings: Do you think this goes in the right direction? Please let me know if you have concerns or anything could be improved.

joelpaulkoch and others added 12 commits October 17, 2025 11:59

[#SAMPLE-6] Add state to logits processing

fc0825a

https://bitcrowd.atlassian.net/browse/SAMPLE-6

stateful logits processors

01ab3af

adding another test

5413662

fix test so compilation works

9d4ef39

demonstrate stateful logits processor through test assertions

4ce01cc

independent state for batch entries

2161b77

renamed initial_suppressed_token_index for clarity

fefc9fd

renamend next_suppressed_index -> :next_suppressed_token_index

6e8612a

logits_processor_states -> logits_processor_state in batch tests

e43254a

added a test with batch size 1 for clarity

a2f0015

renaming suppressed_id -> suppressed_token_id

0cdc0ad

more comments

cc6d6e3

jonatanklosko reviewed Oct 22, 2025

View reviewed changes

xhr15 added 4 commits October 23, 2025 14:39

changed to to demonstrate stack functionality

3816e7c

merged tests

fe58712

removed test for processor only used in test

c97890a

introduces LogitsProcessor module

fbf5ef3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add state to logits processing #425

Add state to logits processing #425

xhr15 commented Oct 21, 2025

Uh oh!

jonatanklosko left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jonatanklosko Oct 22, 2025

Uh oh!

xhr15 Oct 23, 2025

Uh oh!

xhr15 Oct 24, 2025

Uh oh!

xhr15 commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add state to logits processing #425

Are you sure you want to change the base?

Add state to logits processing #425

Conversation

xhr15 commented Oct 21, 2025

Uh oh!

jonatanklosko left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jonatanklosko Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

xhr15 Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

xhr15 Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

xhr15 commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants