Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Core] Remove V0 executors frontend tpu Related to Google TPUs v1
#27142 opened Oct 18, 2025 by njhill Draft
[BugFix] fix graph partition signature
#27139 opened Oct 18, 2025 by BoyuanFeng Loading…
Use pydantic validation in speculative.py config
#27137 opened Oct 18, 2025 by Navya1707 Loading…
[Fix][Spec Decode] Fix llama4 draft loading with different quantization llama Related to Llama models speculative-decoding
#27136 opened Oct 18, 2025 by linzebing Loading…
3 of 5 tasks
feat: enable FlashInfer FP8 Blockscale on SM90
#27134 opened Oct 18, 2025 by djmmoss Draft
1 of 3 tasks
[Bugfix] Fix incorrect kv cache metrics in grafana.json documentation Improvements or additions to documentation
#27133 opened Oct 17, 2025 by fangpings Loading…
5 tasks
Early exit for MoE LoRA kernels ci/build deepseek Related to DeepSeek models gpt-oss Related to GPT-OSS models needs-rebase qwen Related to Qwen models
#27131 opened Oct 17, 2025 by gnovack Draft
5 tasks
[Minor] Add some clarifying comments to recent changes ready ONLY add when PR is ready to merge/full CI is needed v1
#27130 opened Oct 17, 2025 by njhill Loading…
[BugFix] bugfix for Flash Attention MLA with full cuda graph IMA following pr-25490 ready ONLY add when PR is ready to merge/full CI is needed v1
#27128 opened Oct 17, 2025 by Daisy-Ma-coder Loading…
make flash_attn ViT upgrade opt-in ci/build ci-failure Issue about an unexpected test failure in CI qwen Related to Qwen models rocm Related to AMD ROCm
#27124 opened Oct 17, 2025 by bradleyhd Loading…
[Kernels] Swap quant method needs-rebase
#27123 opened Oct 17, 2025 by bnellnm Loading…
[BugFix] Disable fp8 kv-cache by default for DeepSeek V3.2 deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed
#27121 opened Oct 17, 2025 by LucasWilkinson Loading… v0.11.1
[Bugfix] Fix allocation & free logic of SingleWriterShmRingBuffer
#27117 opened Oct 17, 2025 by imkero Loading…
5 tasks
Add missing opentelemetry dependency to base docker image ci/build
#27109 opened Oct 17, 2025 by Aymendje Loading…
3 of 5 tasks
ProTip! Updated in the last three days: updated:>2025-10-15.