thunder.jit
has a relatively high CPU overhead when processing small graphs with small inputs.
#1657
Labels
thunder.jit
has a relatively high CPU overhead when processing small graphs with small inputs.
#1657
Note: If you have a model or program that is not supported yet but should be, please use the program coverage template.
🐛 Bug
thunder.jit
has a relatively high CPU overhead when processing small graphs with small inputs.To Reproduce
output:
Note that the nvfuser function is obtained from
get_nvfuser_repro
, which should be identical to what Thunder executes using the nvfuser backend. However, the execution time is significantly longer compared to the nvfuser API version.The trace of the thunder.jit:
I believe the difference is due to CPU overhead in
thunder.jit
. Since the kernel finishes very quickly in this case, it’s likely not an issue for real models. Please feel free to close this if you don't consider it a problem.cc: @mruberry
The text was updated successfully, but these errors were encountered: