Update Laguna XS.2 serve command by haic0 · Pull Request #596 · vllm-project/recipes

haic0 · 2026-06-29T13:42:54Z

Summary

Adjust Laguna XS.2 to the provided trust-remote-code and TP=8 launch command.
Aligns the recipe launch guidance with the provided vLLM serve command.

Test plan

Ran node scripts/build-recipes-api.mjs on the complete validated recipe update set.

Made with Cursor

Adjust Laguna XS.2 to the provided trust-remote-code and TP=8 launch command. Signed-off-by: haic0 <haichzha@amd.com> Co-authored-by: Cursor <cursoragent@cursor.com>

vercel · 2026-06-29T13:43:00Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
vllm-recipes	Ready	Preview, Comment	Jun 29, 2026 1:53pm

gemini-code-assist

Code Review

This pull request updates the configuration and deployment guide for the Laguna-XS.2 model to use an 8-GPU tensor parallel (TP=8) launch configuration. Feedback on these changes highlights that hardcoding tp: 8 in the strategy overrides will generate invalid launch commands for users selecting hardware profiles with fewer than 8 GPUs, especially since the model can fit on a single high-memory GPU. Additionally, it is recommended to revise the variant description to avoid referencing "the guide" directly, as this metadata is displayed in UI tooltips.

gemini-code-assist · 2026-06-29T13:43:59Z

 strategy_overrides:
  single_node_tp:
-    tp: 1
+    tp: 8


Hardcoding tp: 8 in strategy_overrides forces the tensor parallel size to 8 for all single-node TP deployments in the interactive command builder.

This will generate invalid launch commands (with --tensor-parallel-size 8) when a user selects a hardware profile with fewer than 8 GPUs (such as a single H200, which is the verified hardware for this model).

Since the model's weights (~66GB in BF16) can fit and run on a single high-memory GPU like an H200 (141GB) or H100 (80GB), consider keeping this at tp: 1 (or omitting it if the command builder supports auto-scaling) to allow the command builder to dynamically scale the TP size based on the selected hardware profile.

tp: 1

gemini-code-assist · 2026-06-29T13:43:59Z

    precision: bf16
    vram_minimum_gb: 80
-    description: "BF16 weights — fits on a single 80GB+ GPU (H100/H200/B200)"
+    description: "BF16 weights; the guide shows the requested TP=8 launch command"


Referencing "the guide" inside the variant's description metadata is a bit confusing for users interacting with the command builder UI, as this description is displayed directly in the UI tooltips/pills.

Consider keeping the description focused on the technical details of the variant, or mention the recommended TP size more generally.

description: "BF16 weights — fits on a single 80GB+ GPU, TP=8 recommended for full context"

Update Laguna XS.2 serve command

b974952

Adjust Laguna XS.2 to the provided trust-remote-code and TP=8 launch command. Signed-off-by: haic0 <haichzha@amd.com> Co-authored-by: Cursor <cursoragent@cursor.com>

gemini-code-assist Bot reviewed Jun 29, 2026

View reviewed changes

vercel Bot deployed to Preview June 29, 2026 13:53 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Laguna XS.2 serve command#596

Update Laguna XS.2 serve command#596
haic0 wants to merge 1 commit into
vllm-project:mainfrom
haic0:haic0/update-laguna-xs2-command

haic0 commented Jun 29, 2026

Uh oh!

vercel Bot commented Jun 29, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

haic0 commented Jun 29, 2026

Summary

Test plan

Uh oh!

vercel Bot commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel Bot commented Jun 29, 2026 •

edited

Loading