Split the core of initdt to be algorithm independent #2007

ChrisRackauckas · 2023-08-01T20:53:34Z

This should reduce precompilation

ai-maintainer

Generating a Review...

topolarity · 2023-08-02T16:40:28Z

Shaves off 10-15s of pre-compilation time on my machine (from ~4m7s)

Master trace: https://topolarity.github.io/trace-viewer/?trace=https%3A%2F%2Fraw.githubusercontent.com%2Ftopolarity%2Ftracy-traces%2Fdump%2Fdump%2Fmaster-OrdinaryDiffEq-cc46ec8b.tracy&size=2713067
With this PR: https://topolarity.github.io/trace-viewer/?trace=https%3A%2F%2Fraw.githubusercontent.com%2Ftopolarity%2Ftracy-traces%2Fdump%2Fdump%2Finit_dt_split-OrdinaryDiffEq-1328a0c3.tracy&size=2683730

Of the four-ish minutes, dominate components are:

module-level opt/codegen (1m30s)
per-function JIT (1m6s)
inference (44s)
lowering (22s)

Here are the heaviest per-function JIT hitters after this PR (in parentheses is the number of specializations):

topolarity · 2023-08-02T16:52:13Z

Inference time is >80% dominated by 64 specializations of solve (which later gets split into all the JIT-ed functions above):

That var"#lorenz#673" jumps out to me. Seems like we should not be specializing on the system closure function?

ChrisRackauckas · 2023-08-02T16:53:03Z

solve will specialize, but then internally it'll put a function wrapper on it and then it should stop specializing on it from that point?

topolarity · 2023-08-02T16:57:05Z

solve will specialize, but then internally it'll put a function wrapper on it and then it should stop specializing on it from that point?

I guess we'd expect future inference times for solve() with other systems to be much lower then, since they'll re-use callee results on the wrapper type (even though the entrypoint will be unique for each system)

In that case, my first-glance takeaway from the JIT list is that no single function is dominating the pre-compilation time (perform_step! is the heaviest hitter and accounts for only ~15% of the JIT time). Which means we either need to reduce the specialization N for a large number of these functions, or improve things upstream (e.g. by parallelizing this work) if we want to make a bigger dent.

Split the core of initdt to be algorithm independent

71c8184

This should reduce precompilation

ai-maintainer bot reviewed Aug 1, 2023

View reviewed changes

ChrisRackauckas added 4 commits August 1, 2023 16:57

typo

f3ac955

fix some dispatching

62d984f

fix tmpcache handling

bf6bff3

handle nothing tmp for DAEAlgorithms

1328a0c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Split the core of initdt to be algorithm independent #2007

Split the core of initdt to be algorithm independent #2007

Uh oh!

ChrisRackauckas commented Aug 1, 2023

Uh oh!

ai-maintainer bot left a comment

Uh oh!

topolarity commented Aug 2, 2023

Uh oh!

topolarity commented Aug 2, 2023

Uh oh!

ChrisRackauckas commented Aug 2, 2023

Uh oh!

topolarity commented Aug 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Split the core of initdt to be algorithm independent #2007

Are you sure you want to change the base?

Split the core of initdt to be algorithm independent #2007

Uh oh!

Conversation

ChrisRackauckas commented Aug 1, 2023

Uh oh!

ai-maintainer bot left a comment

Choose a reason for hiding this comment

Uh oh!

topolarity commented Aug 2, 2023

Uh oh!

topolarity commented Aug 2, 2023

Uh oh!

ChrisRackauckas commented Aug 2, 2023

Uh oh!

topolarity commented Aug 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants