Update Ling 2.6 Flash ROCm command by haic0 · Pull Request #594 · vllm-project/recipes

haic0 · 2026-06-29T13:42:42Z

Summary

Adjust Ling 2.6 Flash to the provided TP=2 ROCm launch command.
Aligns the recipe launch guidance with the provided vLLM serve command.

Test plan

Ran node scripts/build-recipes-api.mjs on the complete validated recipe update set.

Made with Cursor

Adjust Ling 2.6 Flash to the provided TP=2 ROCm launch command. Signed-off-by: haic0 <haichzha@amd.com> Co-authored-by: Cursor <cursoragent@cursor.com>

vercel · 2026-06-29T13:42:47Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
vllm-recipes	Ready	Preview, Comment	Jun 29, 2026 1:50pm

gemini-code-assist

Code Review

This pull request updates the configuration for inclusionAI/Ling-2.6-flash.yaml to reduce the default tensor parallel size from 4 to 2, updates the strategy overrides and description accordingly, and adds a ROCm vLLM serve command to the guide. The reviewer pointed out an issue in the Docker run command where prepending vllm serve to the arguments will break container startup due to the image's existing entrypoint.

gemini-code-assist · 2026-06-29T13:43:26Z

    -e VLLM_ROCM_USE_AITER=1 \
    vllm/vllm-openai-rocm:v0.20.2 \
-      inclusionAI/Ling-2.6-flash \
+      vllm serve inclusionAI/Ling-2.6-flash \


Prepending vllm serve to the Docker command arguments will cause the container to fail at startup. The official vllm/vllm-openai-rocm image already defines an entrypoint (such as python3 -m vllm.entrypoints.openai.api_server or vllm serve). Passing vllm serve as arguments appends them to the entrypoint, resulting in an invalid command execution (e.g., vllm serve vllm serve ... or api_server vllm serve ...).

To fix this, we should revert to passing only the model ID and its arguments, as the entrypoint already handles the execution of the server.

inclusionAI/Ling-2.6-flash \

Update Ling 2.6 Flash ROCm command

9bece68

Adjust Ling 2.6 Flash to the provided TP=2 ROCm launch command. Signed-off-by: haic0 <haichzha@amd.com> Co-authored-by: Cursor <cursoragent@cursor.com>

gemini-code-assist Bot reviewed Jun 29, 2026

View reviewed changes

vercel Bot deployed to Preview June 29, 2026 13:50 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Ling 2.6 Flash ROCm command#594

Update Ling 2.6 Flash ROCm command#594
haic0 wants to merge 1 commit into
vllm-project:mainfrom
haic0:haic0/update-ling26-flash-rocm-command

haic0 commented Jun 29, 2026

Uh oh!

vercel Bot commented Jun 29, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

haic0 commented Jun 29, 2026

Summary

Test plan

Uh oh!

vercel Bot commented Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel Bot commented Jun 29, 2026 •

edited

Loading