-
Notifications
You must be signed in to change notification settings - Fork 69
Pull requests: vllm-project/vllm-gaudi
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Nixl deployment fixes
documentation
Improvements or additions to documentation
skip-gaudi-tests
#573
opened Nov 14, 2025 by
PatrykWo
Loading…
Specify output tensor in matmul_qk - with version difference
#571
opened Nov 14, 2025 by
adobrzyn
Loading…
[Docs] Readme for bucketing from file + env var added (#545)
documentation
Improvements or additions to documentation
skip-gaudi-tests
#570
opened Nov 14, 2025 by
adobrzyn
Loading…
Docs: Missing content from Habana docs
documentation
Improvements or additions to documentation
skip-gaudi-tests
#562
opened Nov 13, 2025 by
mhelf-intel
Loading…
Add a plugin for variable support in Markdown
documentation
Improvements or additions to documentation
skip-gaudi-tests
#554
opened Nov 12, 2025 by
mhelf-intel
Loading…
fix loading fp8 static quantized model for compressored_tensors format.
#552
opened Nov 11, 2025 by
lkk12014402
Loading…
Prepare Unified Attention biases on HPU + add NumPy memory pooling
#550
opened Nov 7, 2025 by
kzawora-intel
Loading…
Refactor part of spec decode structure identical to vLLM
#544
opened Nov 7, 2025 by
jerrychenhf
Loading…
[SW-228042] Add support for dynamic vLLM kv-cache quantization
#538
opened Nov 6, 2025 by
dudilester
Loading…
[Attention Metadata Overhaul 2/N] Move metadata processing outside HPUModelAdapter, prepare biases on CPU
#530
opened Nov 5, 2025 by
kzawora-intel
Loading…
[Attention Metadata Overhaul 1/N] Extract metadata update to HPUAttentionMetadataProcessor
#526
opened Nov 5, 2025 by
kzawora-intel
Loading…
reduce graph recompilations in input embeddings for Gemma3
#519
opened Nov 4, 2025 by
skaulintel
•
Draft
Call shutdown_inc to mitiagate driver worker teardown order
#511
opened Nov 3, 2025 by
michalkuligowski
•
Draft
[Attention Metadata Overhaul 3/N] Add per-layer attention metadata
#475
opened Oct 24, 2025 by
kzawora-intel
•
Draft
Previous Next
ProTip!
no:milestone will show everything without a milestone.