Update Mellum2 Instruct serve command by haic0 · Pull Request #590 · vllm-project/recipes

haic0 · 2026-06-29T13:42:16Z

Summary

Add trust-remote-code to the Mellum2 Instruct recipe and show the TP=1 launch command.
Aligns the recipe launch guidance with the provided vLLM serve command.

Test plan

Ran node scripts/build-recipes-api.mjs on the complete validated recipe update set.

Made with Cursor

Add trust-remote-code to the Mellum2 Instruct recipe and show the TP=1 launch command. Signed-off-by: haic0 <haichzha@amd.com> Co-authored-by: Cursor <cursoragent@cursor.com>

vercel · 2026-06-29T13:42:22Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
vllm-recipes	Ready	Preview, Comment	Jun 29, 2026 1:44pm

gemini-code-assist

Code Review

This pull request updates the configuration for the Mellum2-12B-A2.5B-Instruct model, adding hardware overrides and replacing the --max-model-len argument with --trust-remote-code in both the base arguments and the guide commands. The reviewer feedback correctly points out that --max-model-len should be retained alongside --trust-remote-code to prevent potential out-of-memory errors during KV cache allocation due to the model's large context length.

gemini-code-assist · 2026-06-29T13:43:02Z

  base_args:
-    - "--max-model-len"
-    - "131072"
+    - "--trust-remote-code"


It looks like --max-model-len was accidentally replaced by --trust-remote-code instead of appending it. For models with large context lengths like Mellum2 (131,072), explicitly setting --max-model-len is important to prevent out-of-memory errors during KV cache allocation in vLLM. We should keep both arguments.

- "--max-model-len" - "131072" - "--trust-remote-code"

gemini-code-assist · 2026-06-29T13:43:02Z

+    --trust-remote-code \
+    --tensor-parallel-size 1


To align with the base_args and ensure the launch command is robust, we should retain the --max-model-len 131072 parameter in the guide's serving commands.

--max-model-len 131072 \ --trust-remote-code \ --tensor-parallel-size 1

gemini-code-assist · 2026-06-29T13:43:02Z

+    --trust-remote-code \
+    --tensor-parallel-size 1 \


Similarly, we should retain --max-model-len 131072 in the tool-calling launch command example.

--max-model-len 131072 \ --trust-remote-code \ --tensor-parallel-size 1 \

Update Mellum2 Instruct serve command

843765f

Add trust-remote-code to the Mellum2 Instruct recipe and show the TP=1 launch command. Signed-off-by: haic0 <haichzha@amd.com> Co-authored-by: Cursor <cursoragent@cursor.com>

gemini-code-assist Bot reviewed Jun 29, 2026

View reviewed changes

vercel Bot deployed to Preview June 29, 2026 13:44 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Mellum2 Instruct serve command#590

Update Mellum2 Instruct serve command#590
haic0 wants to merge 1 commit into
vllm-project:mainfrom
haic0:haic0/update-mellum2-instruct-command

haic0 commented Jun 29, 2026

Uh oh!

vercel Bot commented Jun 29, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

haic0 commented Jun 29, 2026

Summary

Test plan

Uh oh!

vercel Bot commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel Bot commented Jun 29, 2026 •

edited

Loading