Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Doc]: fix typos in various files documentation Improvements or additions to documentation v1
#28567 opened Nov 12, 2025 by didier-durand Loading…
[Model] [Config] Correctly identify granite-4.0-micro as non-hybrid model ready ONLY add when PR is ready to merge/full CI is needed
#28563 opened Nov 12, 2025 by tdoublep Loading…
3 of 5 tasks
[Bugfix] Fix SM100 gpt-oss regression due to faulty attn sink support gpt-oss Related to GPT-OSS models nvidia ready ONLY add when PR is ready to merge/full CI is needed v1
#28561 opened Nov 12, 2025 by mgoin Loading…
5 tasks
[CI] Skip "Multi-Modal Models Test (Extended) 3" test that's broken in current Transformers multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed
#28559 opened Nov 12, 2025 by hmellor Loading…
[BugFix] Priority scheduling and spec tokens preemption ready ONLY add when PR is ready to merge/full CI is needed v1
#28558 opened Nov 12, 2025 by andylolu2 Loading…
Add NUMA node validation for CPU thread binding
#28555 opened Nov 12, 2025 by usberkeley Loading…
5 tasks
[Misc] Resolve Starlette CVE ci/build
#28554 opened Nov 12, 2025 by shernshiou Loading…
1 of 5 tasks
[KV Connector] Test async mode in scheduler tests kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1
#28550 opened Nov 12, 2025 by markmc Loading…
Add support for Eagle with separate lm-head and embed_tokens layers deepseek Related to DeepSeek models llama Related to Llama models speculative-decoding v1
#28549 opened Nov 12, 2025 by eldarkurtic Loading…
[LoRA][2/N]Remove LoRA extra vocab
#28545 opened Nov 12, 2025 by jeejeelee Draft
5 tasks
[CI] Add non-eager test-case for SharedStorageConnector kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1
#28544 opened Nov 12, 2025 by NickLucche Loading…
Update rope_scaling to rope_parameters in preparation for Transformers v5 documentation Improvements or additions to documentation gpt-oss Related to GPT-OSS models llama Related to Llama models performance Performance-related issues qwen Related to Qwen models speculative-decoding
#28542 opened Nov 12, 2025 by hmellor Draft
Feat/support nemotron h mtp needs-rebase new-model Requests to new models qwen Related to Qwen models speculative-decoding v1
#28541 opened Nov 12, 2025 by shaharmor98 Draft
5 tasks
[Performance] Fuse DeepSeek shared experts and gate operations ci/build deepseek Related to DeepSeek models performance Performance-related issues
#28540 opened Nov 12, 2025 by Red-Caesar Loading…
Fix KV sharing fast prefill with cudagraph enabled nvidia v1
#28537 opened Nov 12, 2025 by sarckk Loading…
3 of 5 tasks
[CI Failure] Fix backend selection for encoder-only models needs-rebase nvidia rocm Related to AMD ROCm tpu Related to Google TPUs v1
#28534 opened Nov 12, 2025 by hl475 Loading…
5 tasks
[Doc] Update plugin doc documentation Improvements or additions to documentation v1
#28532 opened Nov 12, 2025 by wangxiyuan Loading…
5 tasks
[Frontend] supports interleaved thinking documentation Improvements or additions to documentation frontend ready ONLY add when PR is ready to merge/full CI is needed tool-calling
#28531 opened Nov 12, 2025 by chaunceyjiang Loading…
5 tasks
[Docs] Clean up moe_kernel_features.md documentation Improvements or additions to documentation
#28530 opened Nov 12, 2025 by windsonsea Loading…
ProTip! Adding no:label will show everything without a label.