You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
non-rank-0 processes were continuing on and waiting
for rank-0 forward, while rank 0 was saving artifacts. This was leading
to collective timeouts. In testing, this reduced the incidence of
timeouts because non-rank-0 processes are waiting at a known wait-point.
Signed-off-by: James Kunstle <[email protected]>
0 commit comments