-
Notifications
You must be signed in to change notification settings - Fork 194
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Prevents race condition from multiple builds attempting to use/delete the same julia install #3662
base: main
Are you sure you want to change the base?
Conversation
I'm not sure how this works for distributed tests but I don't think anything needs to change there |
I think everything got cancelled rather than failing but not sure why... |
suggested some changes (otherwise we get a bunch of folders like "16574", "16344" with no other indication so I added a label) |
Co-authored-by: Gregory L. Wagner <[email protected]>
This doesn't seem to fix the problem: https://buildkite.com/clima/oceananigans/builds/16499#0190eab5-ee7c-47da-8bb6-f0026a31e110 Although I was hopeful. But perhaps having separate depots is enough... One issue could be that |
I wonder if we are not instantiating the test environment correctly. We do instantiate here: Oceananigans.jl/.buildkite/pipeline.yml Line 20 in abb66e3
but the segmentation fault is coming from
There may be some clues here: https://discourse.julialang.org/t/how-do-i-activate-the-test-environment-of-a-package/86740/3 but we may also want to reach out on the Julia slack |
The main downside of this PR is that our CI will get clogged up more frequently (sometimes the servers get full, and we have to go in and manually delete the builds that were not deleted during clean up). But we could consider merging this if it seems to help, even if it does not fully solve the issue. Perhaps @navidcy and @simone-silvestri have thoughts. |
or at least we could try following that route (it seems like there were some problems also with that implementation) |
I tried experimenting with this on the OceanBioME tests and it seems to be preventing it from segfaultinig OceanBioME/OceanBioME.jl#190 (and testing running them both at the same time here OceanBioME/OceanBioME.jl#196) But it does also seem to run quite a bit slower so I'm going to see how much I can get it to cache without it causing problems. And then I can copy the implementation over to here. |
See #3661