-
Notifications
You must be signed in to change notification settings - Fork 178
Pull requests: NVIDIA/TensorRT-Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Preserve original rope scaling type in export due to transformers library AutoConfig issue
#452
opened Oct 17, 2025 by
Edwardf0t1
Loading…
Updated amax_sync test to set const weights based on rank
#451
opened Oct 17, 2025 by
kinjalpatel27
Loading…
[5593873] [ONNX] Fix ResAdd logic to support 'Conv-BN-Sigmoid-Mul-Add' as fusible patterns
#450
opened Oct 17, 2025 by
gcunhase
Loading…
[1/2] Registry interface for custom quantization functional backend
#449
opened Oct 17, 2025 by
realAsma
Loading…
[5271050, 5274346][ONNX] Add support for Conv-Act-Pool fusion
#448
opened Oct 17, 2025 by
gcunhase
Loading…
Bump TRT-LLM to 1.1.0rc5 + fix failing CICD tests
#445
opened Oct 17, 2025 by
kevalmorabia97
Loading…
1 task done
Add SD3.5-medium quantization support in ModelOpt Diffusers example
#444
opened Oct 17, 2025 by
vishalpandya1990
Loading…
[Autocast] Add low precision autocasting support for Resize op
#436
opened Oct 14, 2025 by
aboubezari
Loading…
Cleanup mixed precision and gather node layer info mapping
#434
opened Oct 14, 2025 by
ynankani
Loading…
Add example for multinode calibration using FSDP2
#432
opened Oct 13, 2025 by
sugunav14
Loading…
2 of 5 tasks
Ensure that the ONNX IR version is the max supported version (10)
#416
opened Oct 9, 2025 by
gcunhase
Loading…
[4975376][5541172]perplexity and kl-divergence benchmark metrics
#411
opened Oct 8, 2025 by
ynankani
Loading…
EAGLE parallel draft with auto regression; kv cache in EAGLE training
#391
opened Sep 29, 2025 by
yeyu-nvidia
Loading…
[5545101]: AutoCast: Add options to force include node/op in F16
#386
opened Sep 28, 2025 by
galagam
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.