NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 1.5k
Star 10.9k

Code
Issues 639
Pull requests 289
Discussions
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 44 Milestones 1

New pull request New

289 Open 2,447 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

feat: Improve dev container tagging

#5551 opened Jun 27, 2025 by ixlmar • Draft

feat: Optimize TRTLLM Sampler perf single beam single step

#5550 opened Jun 27, 2025 by dcampora

Loading…

tests: add test_chunked_prefill for llama4

#5549 opened Jun 27, 2025 by xinhe-nv • Draft

Deduplicate waive list

#5546 opened Jun 27, 2025 by yiqingy0

Loading…

test: Use multiple workers for multi-GPU engine building

#5545 opened Jun 27, 2025 by Funatiq • Draft

rcca: test default kv_cache_reuse option for pytorch multimodal

#5544 opened Jun 27, 2025 by StanleySun639

Loading…

[draft] chore: enhance GenerationExecutor with RPC

#5543 opened Jun 27, 2025 by Superjomn • Draft

[DON'T MERGE] NGram with iter_stats

#5542 opened Jun 27, 2025 by wili-65535 • Draft

[nvbug 5304752][fix]: enhance _check_arguments to filter illegal requests for pytorch backend

#5541 opened Jun 27, 2025 by LinPoly

Loading…

Add pd dynamic scaling readme

#5540 opened Jun 27, 2025 by Shunkangz

Loading…

[Infra][release/0.21]Update nccl to 2.27.5

#5539 opened Jun 27, 2025 by EmmaQiaoCh

Loading…

feat: Add support for MXFP8xMXFP4 in pytorch

#5535 opened Jun 27, 2025 by djns99 • Draft

Refactor: move DeepEP from Docker images to wheel building

#5534 opened Jun 27, 2025 by yuantailing • Draft

[nvbug/5302638] fix _handle_cancelled_requests

#5532 opened Jun 27, 2025 by QiJune

Loading…

[feat] update & support sm89 deepgemm bmm

#5531 opened Jun 27, 2025 by CarstyYou

Loading…

[enh] [GH/CI] [WIP] Auto-assign PR reviewers using module-owners information randomly

#5530 opened Jun 27, 2025 by venkywonka • Draft

feat(models): Mistral3.1 VLM pytorch backend support

#5529 opened Jun 26, 2025 by 2ez4bz

Loading…

[nvbugs/5302040] feat. Add whisper support (Bert Attention on SM100 and GPTAttention for cross attention on SM100)

#5527 opened Jun 26, 2025 by wu6u3tw

Loading…

[nvbug/5337601][fix] Fix disagg + speculative decoding

#5525 opened Jun 26, 2025 by mikeiovine

Loading…

Add support for sm121

#5524 opened Jun 26, 2025 by pamelap-nvidia • Draft

[DRAFT] feat: transfer mm_data and refactor HyperCLOVAX & Qwen2/2.5-VL

#5522 opened Jun 26, 2025 by yechank-nvidia • Draft

[feat] Add Tencent HunYuanMoEV1 model support Community Engagement

help/insights needed from community

Community want to contribute

PRs initiated from Community

#5521 opened Jun 26, 2025 by qianbiaoxiang

Loading…

[don't review] Fp8 blockwise gemm autotune

#5518 opened Jun 26, 2025 by limin2021 • Draft

feature: unify new_tokens format sample state to trtllm samper tokens format

#5513 opened Jun 26, 2025 by netanel-haber • Draft

WIP: [feat] Add Vertex AI compatible prediction route, /vertex_generate

#5508 opened Jun 26, 2025 by harrisonlimh • Draft

Previous 1 2 3 4 5 … 11 12 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-06-24.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!