-
-
Notifications
You must be signed in to change notification settings - Fork 5.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model] GPTBigCodeForEmbedding supporting token span classification
ci/build
documentation
Improvements or additions to documentation
frontend
speculative-decoding
structured-output
v1
#13681
opened Feb 21, 2025 by
michaelrglass
Loading…
[Bugfix][API Server] Fix invalid usage of 'ge' and 'le' in port valid…
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#13672
opened Feb 21, 2025 by
WangErXiao
Loading…
[Misc] Capture and log the time of loading weights
v1
#13666
opened Feb 21, 2025 by
waltforme
Loading…
Correction to TP logic for Mamba Mixer 2 when Num Groups not divisible by TP Size
#13660
opened Feb 21, 2025 by
fabianlim
Loading…
[model][refactor] remove cuda hard code in models and layers
speculative-decoding
#13658
opened Feb 21, 2025 by
MengqingCao
Loading…
[ROCM] fix native attention function call
ready
ONLY add when PR is ready to merge/full CI is needed
#13650
opened Feb 21, 2025 by
gongdao123
Loading…
docs: Add a note on full CI run in contributing guide
documentation
Improvements or additions to documentation
#13646
opened Feb 21, 2025 by
terrytangyuan
Loading…
[Model][Speculative Decoding] Expand DeepSeek MTP code to support k > n_predict
speculative-decoding
#13626
opened Feb 20, 2025 by
benchislett
Loading…
[Bugfix] Flush TunableOp results before worker processes are destroyed.
rocm
#13623
opened Feb 20, 2025 by
naromero77amd
Loading…
[Frontend] [Minor] Fix tqdm progress bar for n > 1
frontend
#13621
opened Feb 20, 2025 by
franzscherr
Loading…
[Misc] Bump compressed-tensors
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#13619
opened Feb 20, 2025 by
dsikka
Loading…
[V1] TPU - Add tensor parallel support via Ray
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#13618
opened Feb 20, 2025 by
alexm-redhat
Loading…
[Bugfix] Fix boolean conversion for OpenVINO env variable
#13615
opened Feb 20, 2025 by
helena-intel
Loading…
[CI/Build] Remove limitation of NVCC_THREADS and MAX_JOBS > CPU count
ci/build
#13606
opened Feb 20, 2025 by
paul-grundmann
Loading…
[Kernel][Minor] Refactor macro parameter naming for consistency
#13605
opened Feb 20, 2025 by
haochengxia
Loading…
[Misc] Upgrade Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
transformers
to 4.49
ci/build
documentation
#13602
opened Feb 20, 2025 by
ywang96
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.