[WIP] feat(config): runtime config decoupling(design for reference) by rjzhb · Pull Request #383 · lightseekorg/tokenspeed

rjzhb · 2026-06-08T18:42:56Z

Summary

Legacy config flows pass HuggingFace PretrainedConfig objects straight into ModelConfig and model code. When transformers changes field names, nesting, or defaults, bugs show up deep in the runtime and fixes spread across many files.

This PR adds an engine-owned EngineModelSpec IR between HF parsing and the runtime. HF config is translated once per model in an adapter (hf_config → EngineModelSpec). Engine code should depend on the spec, not the HF schema. After a transformers update, we usually only fix parsing or the adapter—not model/loader code.

Pilots: minimax_m2, qwen3_5 / qwen3_5_moe
Flag: TOKENSPEED_USE_ENGINE_SPEC=1 (default off; legacy path unchanged)

Shared spec design

Both pilots use the same EngineModelSpec entry type—not separate per-model config systems.

Shared shell — every model produces the same top-level shape:

EngineModelSpec { schema_version, model_type, architecture, dtype, quantization, body }

Shared components — architecture-specific bodies are built from reusable blocks:

GQAAttentionSpec — attention heads / KV / RoPE dim
MoEMLPSpec — expert count, top-k, routing
RMSNormSpec, RopePositionSpec — norm and position encoding

MiniMax-M2 and Qwen3.5 both use these; only the adapter mapping from HF fields differs.

Typed body union — model-specific details live in body, not the shell:

ModelBody = MinimaxM2ModelSpec | Qwen35ModelSpec

MiniMax-M2: shared components + MTP fields (num_mtp_modules, …)
Qwen3.5: shared components + hybrid extras (GatedDeltaNetSpec, full_attention_interval, dense/MoE sizes, …)

Single dispatch — build_engine_spec() routes by model_type to adapters/minimax_m2 or adapters/qwen3_5; both return EngineModelSpec. ModelConfig then branches on spec.body.type only for the RuntimeView bridge.

When need New models: add a body variant + adapter, reuse existing components where possible—no new flat config type per model.

Signed-off-by: rjzhb <rjzhb222@163.com>

github-actions · 2026-06-25T20:35:37Z

This PR has been inactive for 14 days and is marked as stale. It will be closed in 3 days if there is no further activity.

rjzhb force-pushed the feat/runtime-config-decoupling branch 3 times, most recently from 1d7661a to 490b777 Compare June 8, 2026 21:04

Add minimax_m2 engine spec adapter pilot

66150e6

rjzhb force-pushed the feat/runtime-config-decoupling branch from 490b777 to 66150e6 Compare June 8, 2026 21:07

Add qwen3_5 engine spec adapter pilot

4f0eea5

Signed-off-by: rjzhb <rjzhb222@163.com>

rjzhb force-pushed the feat/runtime-config-decoupling branch from d02e72c to 4f0eea5 Compare June 8, 2026 21:34

Merge branch 'main' into feat/runtime-config-decoupling

984b7ff

rjzhb changed the title ~~[WIP] feat(config): runtime config decoupling~~ [WIP] feat(config): runtime config decoupling(design for reference) Jun 9, 2026

github-actions Bot added the inactive label Jun 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] feat(config): runtime config decoupling(design for reference)#383

[WIP] feat(config): runtime config decoupling(design for reference)#383
rjzhb wants to merge 3 commits into
lightseekorg:mainfrom
rjzhb:feat/runtime-config-decoupling

rjzhb commented Jun 8, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

rjzhb commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Shared spec design

Uh oh!

github-actions Bot commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rjzhb commented Jun 8, 2026 •

edited

Loading