Skip to content

Commit 5310d13

Browse files
committed
Add max_tokens cap on gpt oss 20b
1 parent 1542fbb commit 5310d13

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

helm/vllm/values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ vllm:
1616
- /bin/sh
1717
- -c
1818
args:
19-
- "vllm serve openai/gpt-oss-20b --trust-remote-code --enable-chunked-prefill --enable-auto-tool-choice --tool-call-parser openai --reasoning-parser openai_gptoss"
19+
- "vllm serve openai/gpt-oss-20b --trust-remote-code --enable-chunked-prefill --enable-auto-tool-choice --tool-call-parser openai --reasoning-parser openai_gptoss --max_tokens 10000"
2020
gpu:
2121
enabled: true
2222
type: "nvidia"

0 commit comments

Comments
 (0)