Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Doc] More neutral K8s deployment guide documentation Improvements or additions to documentation
#14084 opened Mar 2, 2025 by terrytangyuan Loading…
[V1] Simplify stats logging v1
#14082 opened Mar 1, 2025 by njhill Loading…
[v1] Refactor KVCacheConfig v1
#14079 opened Mar 1, 2025 by heheda12345 Loading…
[Misc] Update reasoning with stream example to use OpenAI library documentation Improvements or additions to documentation
#14077 opened Mar 1, 2025 by liuyanyi Loading…
[V1] Use Triton(ROCm) Attention backend as fallback for Turing GPUs rocm Related to AMD ROCm v1
#14071 opened Mar 1, 2025 by Isotr0py Loading…
[core] moe fp8 block quant tuning support
#14068 opened Mar 1, 2025 by divakar-amd Loading…
[Misc] Accurately capture the time of loading weights ready ONLY add when PR is ready to merge/full CI is needed v1
#14063 opened Feb 28, 2025 by waltforme Loading…
[V1][Core] FlashInfer attention backend for V1 v1
#14061 opened Feb 28, 2025 by aurickq Loading…
[Docs] Add GPTQModel documentation Improvements or additions to documentation
#14056 opened Feb 28, 2025 by Qubitium Loading…
[V1][Frontend] Improve Shutdown And Logs ci/build frontend ready ONLY add when PR is ready to merge/full CI is needed v1
#14048 opened Feb 28, 2025 by rafvasq Loading…
[Kernel] Add more dtype support for GGUF kernels
#14043 opened Feb 28, 2025 by SzymonOzog Loading…
[Feature] Consolidate performance benchmark datasets
#14036 opened Feb 28, 2025 by JenZhao Loading…
[Misc] Modify xgrammar version ci/build
#14030 opened Feb 28, 2025 by shen-shanshan Loading…
benchmark serving: random + sharegpt dataset
#14026 opened Feb 28, 2025 by seungrokj Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.