-
-
Notifications
You must be signed in to change notification settings - Fork 10.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix incorrect string formatting in barrier timeout exceptions
#27149
opened Oct 18, 2025 by
hyongtao-code
Loading…
4 tasks
[Bugfix][Core] Fix xgrammar import failure on unsupported platforms
structured-output
v1
#27148
opened Oct 18, 2025 by
ihb2032
Loading…
[torch.compile] Enable silu_mul_fp8_quant fusion without custom ops enabled
#27146
opened Oct 18, 2025 by
ZJY0516
Loading…
5 tasks
[Model][3/N] Improve all pooling task | Support chunked prefill with ALL pooling
frontend
v1
#27145
opened Oct 18, 2025 by
noooop
Loading…
5 tasks
[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache.
ci/build
v1
#27144
opened Oct 18, 2025 by
sighingnow
Loading…
[NIXL] use Host buffer to support TP_ratio > 1 for XPU
kv-connector
#27140
opened Oct 18, 2025 by
xuechendi
Loading…
5 tasks
[Fix][Spec Decode] Fix llama4 draft loading with different quantization
llama
Related to Llama models
speculative-decoding
#27136
opened Oct 18, 2025 by
linzebing
Loading…
3 of 5 tasks
[Bugfix] Fix incorrect kv cache metrics in grafana.json
documentation
Improvements or additions to documentation
#27133
opened Oct 17, 2025 by
fangpings
Loading…
5 tasks
Early exit for MoE LoRA kernels
ci/build
deepseek
Related to DeepSeek models
gpt-oss
Related to GPT-OSS models
needs-rebase
qwen
Related to Qwen models
[Minor] Add some clarifying comments to recent changes
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#27130
opened Oct 17, 2025 by
njhill
Loading…
[BugFix] bugfix for Flash Attention MLA with full cuda graph IMA following pr-25490
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#27128
opened Oct 17, 2025 by
Daisy-Ma-coder
Loading…
[compile] Enable sequence parallelism matching w/o custom ops enabled
#27126
opened Oct 17, 2025 by
angelayi
Loading…
make flash_attn ViT upgrade opt-in
ci/build
ci-failure
Issue about an unexpected test failure in CI
qwen
Related to Qwen models
rocm
Related to AMD ROCm
#27124
opened Oct 17, 2025 by
bradleyhd
Loading…
[BugFix] Disable fp8 kv-cache by default for DeepSeek V3.2
deepseek
Related to DeepSeek models
ready
ONLY add when PR is ready to merge/full CI is needed
[Bugfix] Fix allocation & free logic of SingleWriterShmRingBuffer
#27117
opened Oct 17, 2025 by
imkero
Loading…
5 tasks
[BugFix] Fix failing gemma-3-1b-it test:
test_lm_eval_accuracy_v1_engine[google/gemma-3-1b-it]
ci/build
#27111
opened Oct 17, 2025 by
LucasWilkinson
Loading…
Add missing opentelemetry dependency to base docker image
ci/build
#27109
opened Oct 17, 2025 by
Aymendje
Loading…
3 of 5 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2025-10-15.