Skip to content

Pull requests: vllm-project/vllm-gaudi

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix block bucket size for DP+contiguous PA
#171 opened Sep 15, 2025 by wuxun-zhang Loading…
WarmUp for Pooling - Embed Task
#170 opened Sep 15, 2025 by slokesha Loading…
Support Ray distributed executor
#169 opened Sep 14, 2025 by xinyu-intel Loading…
Bug fix: hpu mrope
#167 opened Sep 12, 2025 by attafosu Loading…
TESTOWNERS update
#165 opened Sep 12, 2025 by adobrzyn Loading…
Introduce VLLM_SCALE_ADJUSTMENT
#164 opened Sep 12, 2025 by xinyu-intel Loading…
Fix for negative logits
#160 opened Sep 11, 2025 by pawel-olejniczak Loading…
Fix unified after DP changes
#156 opened Sep 11, 2025 by adobrzyn Loading…
Enable group indexing gptq
#154 opened Sep 11, 2025 by jmamzax Loading…
Enable interleaved sliding window for gemma3
#150 opened Sep 10, 2025 by jiminha Loading…
AttentionMetadata Preparation for Encoder-only Models
#145 opened Sep 9, 2025 by slokesha Loading…
[WIP] Enable mamba
#138 opened Sep 4, 2025 by tianmu-li Draft
[SW-236089] UTs: multimodality correctness
#136 opened Sep 4, 2025 by kfojcik-intel Loading…
Fully overlap model execution
#134 opened Sep 3, 2025 by tianmu-li Loading…
Add out-of-tree HPU schedulers
#119 opened Sep 1, 2025 by kzawora-intel Loading…
[WARMUP] fix update bucket
#118 opened Aug 29, 2025 by xuechendi Loading…
[Bucketing] Read buckets from file
#101 opened Aug 23, 2025 by adobrzyn Draft
Add attention unit tests
#74 opened Aug 12, 2025 by tthaddey Loading…
Lookahead decoding
#72 opened Aug 11, 2025 by jkaniecki Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.