[discrete diffusion] Add DiffusionGemma pipeline and schedulers by kashif · Pull Request #13986 · huggingface/diffusers

kashif · 2026-06-18T12:34:12Z

Adds a DiffusionGemma block-diffusion pipeline, alongside the schedulers already on this branch (discrete DDIM, entropy bound, and a uniform mode for block refinement).

DiffusionGemma is an encoder-decoder block-diffusion model: the encoder reads the prompt into a KV cache and the decoder denoises a fixed-size canvas by cross-attending to it. The pipeline runs the outer canvas loop and the inner denoising loop, sampling candidates each step, committing the most confident ones via BlockRefinementScheduler in uniform corruption mode, and renoising the rest. Structure mirrors the LLaDA2 and dflash (#13699) pipelines.

The model itself lives in transformers as DiffusionGemmaForBlockDiffusion (released in 5.12.0).

Tested:

pipeline unit tests pass (plumbing, callbacks, output types)
the pipeline drives the real tiny checkpoint end to end without error

Quality on the full google/diffusiongemma-26B-A4B-it checkpoint still needs a GPU run.

… block refinement

HuggingFaceDocBuilderDev · 2026-06-18T12:43:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

Looking great! A couple questions from quick skimming

…a pipeline

…eline flag

…sing steps

yiyixuxu

thanks for the PR! i left a few comments

I reviewed this through the lens of diffuser convention/style. If some of these choices are intentional to keep things familiar for Transformers users, let me know, and we can figure out the right balance together

yiyixuxu · 2026-06-18T21:57:25Z

+    def __call__(
+        self,
+        prompt: str | list[str] | None = None,
+        messages: list[dict[str, str]] | None = None,


I think between prompt and messages, we only need accept prompt since it's a really cheap into messages

it's just this, no?

messages = [{"role": "user", "content": prompt}]

Makes sense. The one wrinkle is image prompts, which we pass through messages today, so I'll fold the prompt/messages simplification into the image input rework so single-image and text both stay clean. Coming in a follow-up.

Made prompt the primary input and dropped the tokenized intermediates. Kept messages for raw multi-turn/multimodal conversations (per the thread below with zucchini), and added a raw image arg for the simple prompt+image case, so it is all raw inputs now.

…e call

Adds optional Gibbs corrector sweeps after each predictor step for uniform diffusion, recovering the LOO denoiser in closed form so it works on the released checkpoint with no retraining. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

The denoiser is a Transformers model, so adapters (LoRA, DoRA, ...) load through its native PEFT integration rather than the diffusers LoRA loader. Also dispatch the predictor-corrector by scheduler capability instead of class. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Build callback_kwargs with a loop instead of a dict comprehension, whose own scope hides locals() on pre-3.12 (PEP 709), causing KeyError: 'canvas'. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

kashif added 3 commits June 14, 2026 19:07

Add discrete DDIM and entropy bound schedulers and a uniform mode for…

ef0b135

… block refinement

Add DiffusionGemma block-diffusion pipeline

6168e6d

Add DiffusionGemma pipeline tests and docs

375d63a

github-actions Bot added size/L PR with diff > 200 LOC documentation Improvements or additions to documentation tests utils pipelines schedulers labels Jun 18, 2026

kashif added 2 commits June 18, 2026 14:48

Merge branch 'main' into diffusion-gemma-schedulers

05d7f66

Put DiffusionGemma docs under the Text pipelines section

245a6ef

zucchini-nlp reviewed Jun 18, 2026

View reviewed changes

Comment thread src/diffusers/pipelines/diffusion_gemma/pipeline_diffusion_gemma.py Outdated

Comment thread src/diffusers/pipelines/diffusion_gemma/pipeline_diffusion_gemma.py Outdated

Comment thread src/diffusers/pipelines/diffusion_gemma/pipeline_diffusion_gemma.py Outdated

kashif added 5 commits June 18, 2026 15:50

Add static cache and fullgraph-compiled decoder path to DiffusionGemm…

aae6e02

…a pipeline

Compile decoder externally for the static cache path instead of a pip…

d601881

…eline flag

Prefill the encoder once into a reusable cache and sync default denoi…

9351781

…sing steps

Support image prompts by forwarding pixel_values to the encoder prefill

4ce203e

Restyle docstrings to satisfy doc-builder

9d1df71

kashif mentioned this pull request Jun 18, 2026

Discrete diffusion in diffusers #12911

Draft

6 tasks

kashif added 4 commits June 18, 2026 18:22

Sort the new scheduler and pipeline exports

18651f5

Let any of the three schedulers drive the pipeline

1d1efe7

Document the schedulers and updated defaults in the pipeline docs

8a9ffcf

Sort the scheduler dummy objects

73448d9

yiyixuxu reviewed Jun 18, 2026

View reviewed changes

kashif and others added 4 commits June 19, 2026 09:35

Set scheduler sampling knobs on the scheduler config, not the pipelin…

04dd9b9

…e call

Merge branch 'main' into diffusion-gemma-schedulers

897bca6

Accept raw prompt/image/messages instead of pre-tokenized model inputs

0f0041d

kashif requested a review from dg845 June 20, 2026 08:16

kashif requested a review from sayakpaul June 20, 2026 10:29

Fix callback kwargs gathering on Python < 3.12

177c13f

Build callback_kwargs with a loop instead of a dict comprehension, whose own scope hides locals() on pre-3.12 (PEP 709), causing KeyError: 'canvas'. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[discrete diffusion] Add DiffusionGemma pipeline and schedulers#13986

[discrete diffusion] Add DiffusionGemma pipeline and schedulers#13986
kashif wants to merge 20 commits into
huggingface:mainfrom
kashif:diffusion-gemma-schedulers

kashif commented Jun 18, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Jun 18, 2026

Uh oh!

zucchini-nlp left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiyixuxu left a comment

Uh oh!

yiyixuxu Jun 18, 2026

Uh oh!

kashif Jun 19, 2026

Uh oh!

kashif Jun 19, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

kashif commented Jun 18, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Jun 18, 2026

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

kashif Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

kashif Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants