-
-
Notifications
You must be signed in to change notification settings - Fork 6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Doc] More neutral K8s deployment guide
documentation
Improvements or additions to documentation
#14084
opened Mar 2, 2025 by
terrytangyuan
Loading…
[Misc] Update reasoning with stream example to use OpenAI library
documentation
Improvements or additions to documentation
#14077
opened Mar 1, 2025 by
liuyanyi
Loading…
[V1] Use Triton(ROCm) Attention backend as fallback for Turing GPUs
rocm
Related to AMD ROCm
v1
#14071
opened Mar 1, 2025 by
Isotr0py
Loading…
[Bugfix] Make memory profiler account for speculative draft model weights
speculative-decoding
#14067
opened Feb 28, 2025 by
benchislett
Loading…
[Frontend] Allow return_tokens_as_token_ids to be passed as a request param
frontend
#14066
opened Feb 28, 2025 by
benchislett
Loading…
Tune release tag to support release candidates
ci/build
#14064
opened Feb 28, 2025 by
atalman
Loading…
[Misc] Accurately capture the time of loading weights
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#14063
opened Feb 28, 2025 by
waltforme
Loading…
[Distributed] Add reduce_scatter to DeviceCommunicatorBase
#14057
opened Feb 28, 2025 by
tlrmchlsmth
•
Draft
[Docs] Add GPTQModel
documentation
Improvements or additions to documentation
#14056
opened Feb 28, 2025 by
Qubitium
Loading…
[Bugfix][Frontend] Strip empty tool calls from incoming chat conversations
frontend
#14054
opened Feb 28, 2025 by
benchislett
Loading…
[Bugfix] Explicitly include "omp.h" for MacOS to avoid installation failure
#14051
opened Feb 28, 2025 by
realShengYao
•
Draft
[Misc] Add fully interleaved support for multimodal 'string' content format
frontend
#14047
opened Feb 28, 2025 by
Dekakhrone
Loading…
[V1] Avoid false positives when warning for unimplemented methods
#14046
opened Feb 28, 2025 by
DarkLight1337
Loading…
[Bugfix] Fix Precision Mismatch in MoE Router of DeepSeek V2/V3 Models and Fused Kernels (BF16 -> FP32)
#14027
opened Feb 28, 2025 by
DaizeDong
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.