-
-
Notifications
You must be signed in to change notification settings - Fork 11.2k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Doc]: fix typos in various files
documentation
Improvements or additions to documentation
v1
#28567
opened Nov 12, 2025 by
didier-durand
Loading…
[Model] [Config] Correctly identify granite-4.0-micro as non-hybrid model
ready
ONLY add when PR is ready to merge/full CI is needed
#28563
opened Nov 12, 2025 by
tdoublep
Loading…
3 of 5 tasks
[CI] Skip "Multi-Modal Models Test (Extended) 3" test that's broken in current Transformers
multi-modality
Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
#28559
opened Nov 12, 2025 by
hmellor
Loading…
[BugFix] Priority scheduling and spec tokens preemption
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#28558
opened Nov 12, 2025 by
andylolu2
Loading…
Add NUMA node validation for CPU thread binding
#28555
opened Nov 12, 2025 by
usberkeley
Loading…
5 tasks
[Misc] Resolve Starlette CVE
ci/build
#28554
opened Nov 12, 2025 by
shernshiou
Loading…
1 of 5 tasks
Update xformers to 0.0.32.post2 instead of dev commit
ci/build
nvidia
#28551
opened Nov 12, 2025 by
EmilienM
Loading…
[KV Connector] Test async mode in scheduler tests
kv-connector
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#28550
opened Nov 12, 2025 by
markmc
Loading…
Add support for Eagle with separate lm-head and embed_tokens layers
deepseek
Related to DeepSeek models
llama
Related to Llama models
speculative-decoding
v1
#28549
opened Nov 12, 2025 by
eldarkurtic
Loading…
[CI] Add non-eager test-case for SharedStorageConnector
kv-connector
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#28544
opened Nov 12, 2025 by
NickLucche
Loading…
[Bugfix][Model] Prevent special token leakage in KimiK2ToolParser streaming mode
documentation
Improvements or additions to documentation
frontend
tool-calling
#28543
opened Nov 12, 2025 by
jscaldwell55
Loading…
Update Improvements or additions to documentation
gpt-oss
Related to GPT-OSS models
llama
Related to Llama models
performance
Performance-related issues
qwen
Related to Qwen models
speculative-decoding
rope_scaling to rope_parameters in preparation for Transformers v5
documentation
Feat/support nemotron h mtp
needs-rebase
new-model
Requests to new models
qwen
Related to Qwen models
speculative-decoding
v1
#28541
opened Nov 12, 2025 by
shaharmor98
•
Draft
5 tasks
[Performance] Fuse DeepSeek shared experts and gate operations
ci/build
deepseek
Related to DeepSeek models
performance
Performance-related issues
#28540
opened Nov 12, 2025 by
Red-Caesar
Loading…
Fix KV sharing fast prefill with cudagraph enabled
nvidia
v1
#28537
opened Nov 12, 2025 by
sarckk
Loading…
3 of 5 tasks
[Hardware][PowerPC] Fix fp16 compilation error for Power in cpu attention backend and bump oneDNN version
ci/build
#28535
opened Nov 12, 2025 by
Akashcodes732
Loading…
[CI Failure] Fix backend selection for encoder-only models
needs-rebase
nvidia
rocm
Related to AMD ROCm
tpu
Related to Google TPUs
v1
#28534
opened Nov 12, 2025 by
hl475
Loading…
5 tasks
[Bugfix] Eliminate tuple inputs to submodules in graph partitioning
ci/build
needs-rebase
#28533
opened Nov 12, 2025 by
gmagogsfm
Loading…
[Doc] Update plugin doc
documentation
Improvements or additions to documentation
v1
#28532
opened Nov 12, 2025 by
wangxiyuan
Loading…
5 tasks
[Frontend] supports interleaved thinking
documentation
Improvements or additions to documentation
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
tool-calling
#28531
opened Nov 12, 2025 by
chaunceyjiang
Loading…
5 tasks
[Docs] Clean up moe_kernel_features.md
documentation
Improvements or additions to documentation
#28530
opened Nov 12, 2025 by
windsonsea
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.