-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Super tiny fix naming in bench serving scripts
run-ci
#12515
opened Nov 2, 2025 by
fzyzcjy
Loading…
4 tasks
Supports direct retrieval of tokenizer content and chat template content from the worker.
run-ci
#12513
opened Nov 2, 2025 by
whybeyoung
Loading…
[GDN] Fuse b.sigmoid(), fused_gdn_gating and unsqueeze into one kernel: up to 0.85% e2e speedup
run-ci
#12508
opened Nov 2, 2025 by
byjiang1996
Loading…
4 tasks done
[Refact] Remove hardcoded KV cache dimension in MLATokenToKVPool
run-ci
#12502
opened Nov 1, 2025 by
Johnsonms
Loading…
4 tasks done
fix: Missing @dataclass for ExpertDistributionReq
#12501
opened Nov 1, 2025 by
SecretSettler
Loading…
4 tasks done
WIP: [Bug] fix blocking operations in coroutines
#12495
opened Nov 1, 2025 by
howardlau1999
Loading…
4 tasks
fix: Excessive preemption occurs when preempting running requests to schedule new prefill requests.
#12494
opened Nov 1, 2025 by
CLFutureX
Loading…
[Feature] Ascend support enable-mixed-chunk version2
#12491
opened Nov 1, 2025 by
MichelleWu351
•
Draft
fix typo of args description in sglang.profiler
#12486
opened Nov 1, 2025 by
ai-easy-cpu
Loading…
4 tasks
[ServerArgs] allow --mamba-ssm-dtype extend
run-ci
#12481
opened Nov 1, 2025 by
hanming-lu
Loading…
4 tasks
Add the Nvidia-ModelOPT FP8 quantization test case
run-ci
#12478
opened Oct 31, 2025 by
jingyu-ml
Loading…
2 of 4 tasks
[router] add WASM support for middleware
router
run-ci
#12471
opened Oct 31, 2025 by
tonyluj
Loading…
1 of 4 tasks
test: support return logprobs in bench_offline_throughput test
#12462
opened Oct 31, 2025 by
aftersnow
Loading…
4 tasks
Logging
cache_hit_rate only for prefill and speculative_decoding metrics only for decode
#12460
opened Oct 31, 2025 by
MMuzzammil1
Loading…
1 of 4 tasks
fix: use model-specific params to resize images for qwen-vl series models
run-ci
#12458
opened Oct 31, 2025 by
yangsijia-serena
Loading…
4 tasks
[fix] Handle escaped characters in GLM tool call parser to prevent double serialization
run-ci
#12456
opened Oct 31, 2025 by
soaringk
Loading…
2 of 4 tasks
[Grammar Fix] GLM-4-MOE self.first_k_dense_replace is undefined.
#12455
opened Oct 31, 2025 by
zRzRzRzRzRzRzR
Loading…
[Fix]
concat_mla_absorb_q_kernel fails for long inputs
#12453
opened Oct 31, 2025 by
bingps
Loading…
2 of 4 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-10-02.