-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ISAAC plugin exits with segmentation fault #131
Comments
@benjha Thanks for reporting the error.
Then we can quickly check whether we are able to reproduce the error on hemera as well. |
PIConGPU came from the dev branch dated back to Nov. 2020 with its own Alpaka distribution
ISAAC:
|
@benjha Thanks for providing the details. I will see whether I can reproduce this bug. |
I am installing current PIConGPU dev branch with ISAAC 1.5.2 to verify if they work properly from this case. I am having a list of these errors:
which likely is an Alpaka version mismatch between the one PIConGPU dev uses and ISAAC uses. Were there any changes on the way compilation works? |
Are you sure you used the release 1.5.2 and not the current dev branch? The dev branch of ISAAC is currently incompatible with the PIConGPU dev branch. There is a PR ComputationalRadiationPhysics/picongpu#3498 in PIConGPU to fix it but we need to switch our PIConGPU CI first to the ISAAC dev branch. The release 1.5.2 is currently checked together with PIConGPU dev. |
@FelixTUD Could you please test the current dev of PIConGPU together with the release 1.5.2? |
I've rechecked dependencies and fixed the Alpaka mismatch issue. With PIConGPU current dev branch and ISAAC 1.5.2 following the next configuration:
PIConGPU throws the next errors:
this is the output from ISAAC-server:
For now, I will be dumping the ISAAC timers into files, but will be great to get more insight by using a profiler. |
@psychocoderHPC I'm looking into it, a LWFA setup compiles without a problem on hemera with pic dev and isaac 1.5.2 |
I can reproduce an identical error with an mpi execution of the example, this should help me tracking down the problem |
@benjha I might have found the error, you can try and remove the line Line 3465 in c7e9ff9
I need to have a more detailed look into it later, as it seems that json_init_root is only initialized on the master node, thats why it throws seg fault on all other nodes on destruction, let me know if it fixed it for now |
Thanks @FelixTUD It worked. I am testing further... |
This should be fixed with #132 |
Hi @PrometheusPi @psychocoderHPC,
After several unsuccessful attempts to get some traces out with TAU, I ran PIConGPU &ISAAC in a default configuration (profiling off, dumping viz. frames to Alpine, 1000 steps with checkpoint.restart.loop=3, using the
/etc/picongpu/8_isaac.cfg
file) and noted the simulation breaks with the next errors at the end of its execution, which is the cause TAU can't generate the traces:Looks like the issue is in the IsaacPlugin.hpp's
pluginUnload()
method which in turn call the IsaacVisualization destructor.Can you reproduce this error ?
The text was updated successfully, but these errors were encountered: