[ROCm] Add MI355X-only MiniMax-M3 MXFP4 variant by functionstackx · Pull Request #580 · vllm-project/recipes

functionstackx · 2026-06-25T20:07:12Z

Summary

add amd/MiniMax-M3-MXFP4 as an MXFP4 variant of the existing MiniMaxAI/MiniMax-M3 recipe
use the ROCm nightly image and the validated TP8/encoder settings on MI355X
add variant-level hardware allowlists and exact hardware overrides so MXFP4 is unavailable everywhere except MI355X
remove stale generated hardware artifacts when compatibility shrinks

Why

The AMD Quark MXFP4 checkpoint is currently supported only on MI355X. Treating MXFP4 as a generally selectable precision produced invalid commands for NVIDIA and older AMD hardware.

Local verification

based off of vllm-project/vllm#45794

accuracy gsm8k & perf verfieid https://github.com/SemiAnalysisAI/InferenceX/actions/runs/28195297568/job/83520506068?pr=1935

SemiAnalysisAI/InferenceX#1935

User impact

Selecting MXFP4 now selects MI355X automatically. On every other hardware profile, the MXFP4 pill is disabled with an MI355X-only explanation. Generated API data for the promoted MXFP4 checkpoint exposes only MI355X.

Validation

node scripts/build-recipes-api.mjs — 142 models, 116 promoted variants
node --check src/lib/command-synthesis.js
node --check scripts/build-recipes-api.mjs
verified in the local browser that MXFP4 is disabled on B200
verified an invalid B200 + MXFP4 URL normalizes to MI355X
verified the MXFP4 API hardware index and generated files contain only mi355x

^{Need help on this PR? Tag /codesmith with what you need. Autofix is disabled.}

vercel · 2026-06-25T20:07:19Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
vllm-recipes	Ready	Preview, Comment	Jun 25, 2026 8:11pm

Signed-off-by: functionstackx <47992694+functionstackx@users.noreply.github.com>

gemini-code-assist

Code Review

This pull request introduces support for variant-level hardware allowlists (e.g., supported_hardware) and hardware-specific overrides (such as docker_image, extra_args, and extra_env). It also adds a new mxfp4 variant for the MiniMax-M3 model targeting AMD Instinct MI355X hardware. The review feedback highlights a potential runtime crash when handling invalid hardware IDs in the URL query parameters, and suggests improving consistency in the UI by using the variant's label instead of just its precision when displaying disabled hardware reasons.

gemini-code-assist · 2026-06-25T20:09:17Z

+  const requestedHwAllowed = requestedHwId
+    && isPrecisionCompatible(requestedHwProfile, requestedVariant)
+    && isHardwareSupported(recipe, requestedHwId)
+    && isVariantHardwareSupported(requestedVariant, requestedHwId);


If requestedHwId is an invalid hardware profile ID (e.g., passed via URL query parameters), taxonomy.hardware_profiles?.[requestedHwId] will be undefined. Since requestedHwId is truthy, requestedHwAllowed will proceed to evaluate isPrecisionCompatible(requestedHwProfile, requestedVariant). If the variant has a precision constraint, matchesConstraint will attempt to access profile.brand on undefined (or {} which doesn't crash but returns true if constraint is undefined, leading to hwId being set to an invalid ID). This can cause runtime crashes or broken UI state. We should explicitly verify that requestedHwId exists in taxonomy.hardware_profiles before allowing it.

Suggested change

const requestedHwAllowed = requestedHwId

&& isPrecisionCompatible(requestedHwProfile, requestedVariant)

&& isHardwareSupported(recipe, requestedHwId)

&& isVariantHardwareSupported(requestedVariant, requestedHwId);

const requestedHwAllowed = requestedHwId

&& !!taxonomy.hardware_profiles?.[requestedHwId]

&& isPrecisionCompatible(requestedHwProfile, requestedVariant)

&& isHardwareSupported(recipe, requestedHwId)

&& isVariantHardwareSupported(requestedVariant, requestedHwId);

gemini-code-assist · 2026-06-25T20:09:18Z

+                        const reason = !variantHardwareOk
+                          ? `${currentVariant.precision?.toUpperCase()} is only supported on ${(currentVariant.supported_hardware || []).map((hw) => taxonomy.hardware_profiles?.[hw]?.display_name || hw).join(", ")}`
+                          : !precisionOk
                          ? `${currentVariant.precision?.toUpperCase()} requires NVIDIA Blackwell`


For consistency with other parts of the component (such as line 1516) and to support custom variant labels correctly, we should use (currentVariant.label || currentVariant.precision) instead of just currentVariant.precision when rendering the error message.

Suggested change

const reason = !variantHardwareOk

? `${currentVariant.precision?.toUpperCase()} is only supported on ${(currentVariant.supported_hardware || []).map((hw) => taxonomy.hardware_profiles?.[hw]?.display_name || hw).join(", ")}`

: !precisionOk

? `${currentVariant.precision?.toUpperCase()} requires NVIDIA Blackwell`

const reason = !variantHardwareOk

? `${(currentVariant.label || currentVariant.precision)?.toUpperCase()} is only supported on ${(currentVariant.supported_hardware || []).map((hw) => taxonomy.hardware_profiles?.[hw]?.display_name || hw).join(", ")}`

: !precisionOk

? `${(currentVariant.label || currentVariant.precision)?.toUpperCase()} requires NVIDIA Blackwell`

gemini-code-assist · 2026-06-25T20:09:18Z

+                      const reason = !variantHardwareOk
+                        ? `${currentVariant.precision?.toUpperCase()} is only supported on ${(currentVariant.supported_hardware || []).map((hw) => taxonomy.hardware_profiles?.[hw]?.display_name || hw).join(", ")}`
+                        : !precisionOk
                        ? `${currentVariant.precision?.toUpperCase()} requires NVIDIA Blackwell`


For consistency with other parts of the component (such as line 1516) and to support custom variant labels correctly, we should use (currentVariant.label || currentVariant.precision) instead of just currentVariant.precision when rendering the error message.

Suggested change

const reason = !variantHardwareOk

? `${currentVariant.precision?.toUpperCase()} is only supported on ${(currentVariant.supported_hardware || []).map((hw) => taxonomy.hardware_profiles?.[hw]?.display_name || hw).join(", ")}`

: !precisionOk

? `${currentVariant.precision?.toUpperCase()} requires NVIDIA Blackwell`

const reason = !variantHardwareOk

? `${(currentVariant.label || currentVariant.precision)?.toUpperCase()} is only supported on ${(currentVariant.supported_hardware || []).map((hw) => taxonomy.hardware_profiles?.[hw]?.display_name || hw).join(", ")}`

: !precisionOk

? `${(currentVariant.label || currentVariant.precision)?.toUpperCase()} requires NVIDIA Blackwell`

hongxiayang · 2026-06-25T20:23:48Z

suggest to update the subject of PR: replace [codex] with [AMD] or [ROCm]

functionstackx added 5 commits June 25, 2026 16:08

Add amd/MiniMax-M3-MXFP4 variant

fd85217

Signed-off-by: functionstackx <47992694+functionstackx@users.noreply.github.com>

Scope MiniMax M3 MXFP4 overrides to MI355X

64d3d36

Signed-off-by: functionstackx <47992694+functionstackx@users.noreply.github.com>

Restrict MiniMax M3 MXFP4 to MI355X

e46f2d6

Signed-off-by: functionstackx <47992694+functionstackx@users.noreply.github.com>

Trim MiniMax M3 MXFP4 guide

7ca7218

Signed-off-by: functionstackx <47992694+functionstackx@users.noreply.github.com>

Trim MiniMax M3 MXFP4 references

190b98e

Signed-off-by: functionstackx <47992694+functionstackx@users.noreply.github.com>

functionstackx force-pushed the codex/add-minimax-m3-mxfp4 branch from 2d841ac to 190b98e Compare June 25, 2026 20:08

gemini-code-assist Bot reviewed Jun 25, 2026

View reviewed changes

functionstackx marked this pull request as ready for review June 25, 2026 20:10

vercel Bot deployed to Preview June 25, 2026 20:11 View deployment

functionstackx changed the title ~~[codex] Add MI355X-only MiniMax-M3 MXFP4 variant~~ Add MI355X-only MiniMax-M3 MXFP4 variant Jun 25, 2026

functionstackx changed the title ~~Add MI355X-only MiniMax-M3 MXFP4 variant~~ [ROCm] Add MI355X-only MiniMax-M3 MXFP4 variant Jun 25, 2026

ywang96 approved these changes Jun 25, 2026

View reviewed changes

ywang96 merged commit e2f883e into vllm-project:main Jun 25, 2026
4 checks passed

hongxiayang mentioned this pull request Jun 27, 2026

Add MiniMax-M3 MXFP4 (AMD) variant #579

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] Add MI355X-only MiniMax-M3 MXFP4 variant#580

[ROCm] Add MI355X-only MiniMax-M3 MXFP4 variant#580
ywang96 merged 5 commits into
vllm-project:mainfrom
functionstackx:codex/add-minimax-m3-mxfp4

functionstackx commented Jun 25, 2026 •

edited

Loading

Uh oh!

vercel Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 25, 2026

Uh oh!

gemini-code-assist Bot Jun 25, 2026

Uh oh!

gemini-code-assist Bot Jun 25, 2026

Uh oh!

hongxiayang commented Jun 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

functionstackx commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Local verification

User impact

Validation

Uh oh!

vercel Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 25, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 25, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 25, 2026

Choose a reason for hiding this comment

Uh oh!

hongxiayang commented Jun 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

functionstackx commented Jun 25, 2026 •

edited

Loading

vercel Bot commented Jun 25, 2026 •

edited

Loading