-
-
Notifications
You must be signed in to change notification settings - Fork 7.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[v1][KVCacheManager] Rename BlockHashType to BlockHash
documentation
Improvements or additions to documentation
v1
#19015
opened Jun 2, 2025 by
heheda12345
Loading…
[draft] add some nvtx ranges for vllm for diagnostic
documentation
Improvements or additions to documentation
needs-rebase
speculative-decoding
v1
[doc] add pytest tips
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#19010
opened Jun 2, 2025 by
reidliu41
Loading…
fix prefix caching logic for running requests without speculative tokens
v1
#19006
opened Jun 1, 2025 by
yuguo68
Loading…
[doc] small fix - serve_args.md
documentation
Improvements or additions to documentation
#18999
opened Jun 1, 2025 by
reidliu41
Loading…
[DRAFT] Self-Speculative Decoding using LayerSkip
documentation
Improvements or additions to documentation
needs-rebase
speculative-decoding
v1
#18994
opened May 31, 2025 by
aniltolwani
•
Draft
[ROCm] [AITER] [Bugfix] Patch for AITER commit
648764942e552a8bb5fe16026703716a81f05374
ci/build
#18990
opened May 31, 2025 by
tjtanaa
Loading…
[Benchmark] Add hf_stream arg to enable or disable datasets streaming loading
#18989
opened May 31, 2025 by
Potabk
Loading…
Add tarsier model support
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
#18985
opened May 31, 2025 by
princepride
Loading…
[Bugfix][Model] Attempt to fix eagle in V0.
ready
ONLY add when PR is ready to merge/full CI is needed
#18978
opened May 30, 2025 by
gshtras
Loading…
[Core] Remove unnecessary copy of multi modal input embeddings
v1
#18973
opened May 30, 2025 by
lgeiger
Loading…
[V1][Spec Decode][Ngram] 1.35x gain -> 1.95x gain on InstructCoder with prompt fix
#18971
opened May 30, 2025 by
ekagra-ranjan
Loading…
[Bugfix][Core] Prefix caching enabled causes incorrect outputs
#18957
opened May 30, 2025 by
quanliu1991
Loading…
Abstract mooncake store connector to kv store connector
#18936
opened May 30, 2025 by
maobaolong
Loading…
[Bugfix][Config] Fix config dtype get error
needs-rebase
#18934
opened May 30, 2025 by
MengqingCao
Loading…
Adding "LoRA Test %N" to AMD production tests
ci/build
rocm
Related to AMD ROCm
#18929
opened May 29, 2025 by
Concurrensee
Loading…
feat: add data parallel rank to KVEventBatch
documentation
Improvements or additions to documentation
v1
#18925
opened May 29, 2025 by
PeaBrane
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.