CPU Memory Consumption Reduction #3839
narendasan
started this conversation in
RFCs
Replies: 2 comments
-
In Qwen, TRT builder uses 1x to build live engine. |
Beta Was this translation helpful? Give feedback.
0 replies
-
INetworkDefinition does not take any memory actually, it is the lowered graph and constand folding takes the up to 1x (0-1x) memory. Code here: INetwork is just hold the reference to the weights in the lowered graph |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Beta Was this translation helpful? Give feedback.
All reactions