Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Added code owners for AutoDeploy
#4769 opened May 29, 2025 by juney-nvidia Loading…
[DO NOT MERGE] Debug perf
#4768 opened May 29, 2025 by kaiyux Draft
fix: EP load balancer with MTP layer
#4767 opened May 29, 2025 by syuoni Loading…
fix: re-enable tp/pp for quickstart_advanced.py.
#4766 opened May 29, 2025 by yuxianq Loading…
chore: remove request_error ipc in LLM.submit
#4763 opened May 29, 2025 by Superjomn Loading…
draft: refactor and fix mtp vanilla
#4762 opened May 29, 2025 by lfr-0531 Loading…
Upgrade CUTLASS to v3.9.2
#4760 opened May 29, 2025 by Barry-Delaney Loading…
[nvbug 5305210] Resolve nvbug 5305210
#4759 opened May 29, 2025 by DomBrown Loading…
Refactor test timeout for individual long case
#4757 opened May 29, 2025 by EmmaQiaoCh Loading…
tests: Update gb200 test case
#4754 opened May 29, 2025 by yizhang-nv Draft
[Docs] - Add date and commit info (#4448)
#4752 opened May 29, 2025 by chzblych Loading…
tests: fix 5250460
#4751 opened May 29, 2025 by xinhe-nv Loading…
feat: Add Mixture of Experts FP8xMXFP4 support
#4750 opened May 29, 2025 by djns99 Loading…
Cherry-pick feat/llama4's changes
#4746 opened May 29, 2025 by nvpohanh Loading…
chore: fix llm_root when LLM_ROOT is not set
#4741 opened May 28, 2025 by achartier Loading…
ProTip! no:milestone will show everything without a milestone.