Skip to content

{2023.06}[2023a,a64fx] apps originally built with EB 4.9.2 #1091

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

trz42
Copy link
Collaborator

@trz42 trz42 commented May 27, 2025

Includes all apps in this batch. Build time on NVIDIA Grace was ~ 16 hours. Might need to split this up and limit build parallelism for some packages.

Full list of 83 apps:

amdahl/0.3.1-gompi-2023a
BCFtools/1.18-GCC-12.3.0
BioPerl/1.7.8-GCCcore-12.3.0
BWA/0.7.18-GCCcore-12.3.0
CapnProto/1.0.1-GCCcore-12.3.0
Cartopy/0.22.0-foss-2023a
colorize/0.7.7-GCC-12.3.0
CP2K/2023.1-foss-2023a
crb-blast/0.6.9-GCC-12.3.0
Critic2/1.2-foss-2023a
DB/18.1.40-GCCcore-12.3.0
DB_File/1.859-GCCcore-12.3.0
DendroPy/4.6.1-GCCcore-12.3.0
f90wrap/0.2.13-foss-2023a
fastp/0.23.4-GCC-12.3.0
Fiona/1.9.5-foss-2023a
FragGeneScan/1.31-GCCcore-12.3.0
geopandas/0.14.2-foss-2023a
groff/1.22.4-GCCcore-12.3.0
grpcio/1.57.0-GCCcore-12.3.0
gtk-doc/1.34.0-GCCcore-12.3.0
h5netcdf/1.2.0-foss-2023a
HDBSCAN/0.8.38.post1-foss-2023a
HMMER/3.4-gompi-2023a
HTSlib/1.18-GCC-12.3.0
IQ-TREE/2.3.5-gompi-2023a
ITSTool/2.0.7-GCCcore-12.3.0
jemalloc/5.3.0-GCCcore-12.3.0
Judy/1.0.5-GCCcore-12.3.0
KaHIP/3.16-gompi-2023a
KronaTools/2.8.1-GCCcore-12.3.0
libaio/0.3.113-GCCcore-12.3.0
libcint/5.4.0-gfbf-2023a
libgcrypt/1.10.3-GCCcore-12.3.0
libgpg-error/1.48-GCCcore-12.3.0
Libint/2.7.2-GCC-12.3.0-lmax-6-cp2k
librosa/0.10.1-foss-2023a
libvori/220621-GCCcore-12.3.0
libxml2-python/2.11.4-GCCcore-12.3.0
LLVM/14.0.6-GCCcore-12.3.0-llvmlite
LRBinner/0.1-foss-2023a
LSD2/2.4.1-GCCcore-12.3.0
LZO/2.10-GCCcore-12.3.0
MAFFT/7.520-GCC-12.3.0-with-extensions
mallard-ducktype/1.0.2-GCCcore-12.3.0
MariaDB/11.6.0-GCC-12.3.0
maturin/1.4.0-GCCcore-12.3.0-Rust-1.75.0
MBX/1.1.0-foss-2023a
Meson/1.3.1-GCCcore-12.3.0
meson-python/0.15.0-GCCcore-12.3.0
MetalWalls/21.06.1-foss-2023a
ncbi-vdb/3.0.10-gompi-2023a
netcdf4-python/1.6.4-foss-2023a
numba/0.58.1-foss-2023a
OpenFOAM/10-foss-2023a
OpenFOAM/11-foss-2023a
OpenFOAM/v2312-foss-2023a
orjson/3.9.15-GCCcore-12.3.0
Perl-bundle-CPAN/5.36.1-GCCcore-12.3.0
psycopg2/2.9.9-GCCcore-12.3.0
Pygments/2.18.0-GCCcore-12.3.0
pyproj/3.6.0-GCCcore-12.3.0
python-xxhash/3.4.1-GCCcore-12.3.0
QuantumESPRESSO/7.3.1-foss-2023a
Raptor/2.0.16-GCCcore-12.3.0
Rasqal/0.9.33-GCCcore-12.3.0
Redland/1.0.17-GCC-12.3.0
Ruby/3.3.0-GCCcore-12.3.0
Rust/1.75.0-GCCcore-12.3.0
SciTools-Iris/3.9.0-foss-2023a
Seaborn/0.13.2-gfbf-2023a
Shapely/2.0.1-gfbf-2023a
SQLAlchemy/2.0.25-GCCcore-12.3.0
tqdm/4.66.1-GCCcore-12.3.0
Transrate/1.0.3-GCC-12.3.0
unixODBC/2.3.12-GCCcore-12.3.0
wradlib/2.0.3-foss-2023a
xarray/2023.9.0-gfbf-2023a
XML-LibXML/2.0209-GCCcore-12.3.0
xxHash/0.8.2-GCCcore-12.3.0
yell/2.2.2-GCC-12.3.0
yelp-tools/42.1-GCCcore-12.3.0
yelp-xsl/42.1-GCCcore-12.3.0

@trz42 trz42 added 2023.06-software.eessi.io 2023.06 version of software.eessi.io a64fx labels May 27, 2025
@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented May 27, 2025

Instance eessi-bot-deucalion is configured to build for:

  • architectures: aarch64/a64fx
  • repositories: eessi.io-2023.06-software

NOTE, bot code wasn't updated on Deucalion, therefore it created this comment.

@eessi-bot-toprichard
Copy link

Instance rt-Grace-jr is configured to build for:

  • architectures: aarch64/nvidia/grace
  • repositories: eessi.io-2023.06-software

@trz42
Copy link
Collaborator Author

trz42 commented May 27, 2025

bot: build instance:eessi-bot-deucalion repository:eessi.io-2023.06-software architecture:aarch64/a64fx

@eessi-bot-toprichard
Copy link

Updates by the bot instance rt-Grace-jr (click for details)
  • account trz42 has NO permission to send commands to the bot

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented May 27, 2025

New job on instance eessi-bot-deucalion for CPU micro-architecture aarch64-a64fx for repository eessi.io-2023.06-software in job dir /home/eessibot/new-bot/jobs/2025.05/pr_1091/437784

date job status comment
May 27 08:16:06 UTC 2025 submitted job id 437784 awaits release by job manager
May 27 08:17:00 UTC 2025 released job awaits launch by Slurm scheduler
May 27 08:18:04 UTC 2025 running job 437784 is running
May 27 08:31:43 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-437784.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
May 27 08:31:43 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] (1/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (2/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (3/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (4/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ OK ] (5/9) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_a64fx+default
P: perf: 580.081 timesteps/s (r:0, l:None, u:None)
[ OK ] (6/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_a64fx+default
P: latency: 1.68 us (r:0, l:None, u:None)
[ OK ] (7/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_a64fx+default
P: latency: 1.72 us (r:0, l:None, u:None)
[ OK ] (8/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8794.13 MB/s (r:0, l:None, u:None)
[ OK ] (9/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8682.74 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 5/9 test case(s) from 9 check(s) (0 failure(s), 4 skipped, 0 aborted)
Details
✅ job output file slurm-437784.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@trz42
Copy link
Collaborator Author

trz42 commented May 27, 2025

Try again...
bot: build instance:eessi-bot-deucalion repository:eessi.io-2023.06-software architecture:aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented May 27, 2025

New job on instance eessi-bot-deucalion for CPU micro-architecture aarch64-a64fx for repository eessi.io-2023.06-software in job dir /home/eessibot/new-bot/jobs/2025.05/pr_1091/437986

  • job has been restarted three times ... we shouldn't let the queueing system do that automatically as it might overwrite job output and limit our ability to debug issues
date job status comment
May 27 12:28:41 UTC 2025 submitted job id 437986 awaits release by job manager
May 27 12:28:59 UTC 2025 released job awaits launch by Slurm scheduler
May 27 12:30:01 UTC 2025 running job 437986 is running
May 27 19:29:04 UTC 2025 finished
🤷 UNKNOWN (click triangle for detailed information)
  • Job results file _bot_job437986.result does not exist in job directory, or parsing it failed.
  • No artefacts were found/reported.
May 27 19:29:04 UTC 2025 test result
🤷 UNKNOWN (click triangle for detailed information)
  • Job test file _bot_job437986.test does not exist in job directory, or parsing it failed.

@trz42
Copy link
Collaborator Author

trz42 commented May 27, 2025

Rerun with additional argument --no-requeue to prevent automatic restarts...
bot: build instance:eessi-bot-deucalion repository:eessi.io-2023.06-software architecture:aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented May 27, 2025

New job on instance eessi-bot-deucalion for CPU micro-architecture aarch64-a64fx for repository eessi.io-2023.06-software in job dir /home/eessibot/new-bot/jobs/2025.05/pr_1091/438489

date job status comment
May 27 19:32:14 UTC 2025 submitted job id 438489 awaits release by job manager
May 27 19:33:08 UTC 2025 released job awaits launch by Slurm scheduler
May 27 19:34:11 UTC 2025 running job 438489 is running
May 27 21:21:33 UTC 2025 finished
🤷 UNKNOWN (click triangle for detailed information)
  • Job results file _bot_job438489.result does not exist in job directory, or parsing it failed.
  • No artefacts were found/reported.
May 27 21:21:33 UTC 2025 test result
🤷 UNKNOWN (click triangle for detailed information)
  • Job test file _bot_job438489.test does not exist in job directory, or parsing it failed.

@boegel boegel changed the base branch from 2023.06-software.eessi.io to main June 15, 2025 14:45
@boegel
Copy link
Contributor

boegel commented Jun 15, 2025

bot: build instance:eessi-bot-deucalion repository:eessi.io-2023.06-software architecture:aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Jun 15, 2025

New job on instance eessi-bot-deucalion for CPU micro-architecture aarch64-a64fx for repository eessi.io-2023.06-software in job dir /home/eessibot/new-bot/jobs/2025.06/pr_1091/461123

date job status comment
Jun 15 14:46:09 UTC 2025 submitted job id 461123 awaits release by job manager
Jun 15 14:46:34 UTC 2025 released job awaits launch by Slurm scheduler
Jun 15 14:47:37 UTC 2025 running job 461123 is running
Jun 15 16:40:18 UTC 2025 finished
🤷 UNKNOWN (click triangle for detailed information)
  • Job results file _bot_job461123.result does not exist in job directory, or parsing it failed.
  • No artefacts were found/reported.
Jun 15 16:40:18 UTC 2025 test result
🤷 UNKNOWN (click triangle for detailed information)
  • Job test file _bot_job461123.test does not exist in job directory, or parsing it failed.

@boegel
Copy link
Contributor

boegel commented Jun 17, 2025

Looks like job 461123 failed prematurely for no good reason?
Last completed installation was MetalWalls/21.06.1-foss-2023a, maybe it got killed while trying to install QuantumESPRESSO-7.3.1-foss-2023a.eb?

@boegel
Copy link
Contributor

boegel commented Jun 17, 2025

bot: build instance:eessi-bot-deucalion repository:eessi.io-2023.06-software architecture:aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Jun 17, 2025

New job on instance eessi-bot-deucalion for CPU micro-architecture aarch64-a64fx for repository eessi.io-2023.06-software in job dir /home/eessibot/new-bot/jobs/2025.06/pr_1091/463781

date job status comment
Jun 17 09:08:56 UTC 2025 submitted job id 463781 awaits release by job manager
Jun 17 09:09:27 UTC 2025 released job awaits launch by Slurm scheduler
Jun 17 09:13:23 UTC 2025 running job 463781 is running
Jun 17 11:00:39 UTC 2025 finished
🤷 UNKNOWN (click triangle for detailed information)
  • Job results file _bot_job463781.result does not exist in job directory, or parsing it failed.
  • No artefacts were found/reported.
Jun 17 11:00:39 UTC 2025 test result
🤷 UNKNOWN (click triangle for detailed information)
  • Job test file _bot_job463781.test does not exist in job directory, or parsing it failed.

@trz42
Copy link
Collaborator Author

trz42 commented Jun 17, 2025

bot: build instance:eessi-bot-deucalion repository:eessi.io-2023.06-software architecture:aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Jun 17, 2025

New job on instance eessi-bot-deucalion for CPU micro-architecture aarch64-a64fx for repository eessi.io-2023.06-software in job dir /home/eessibot/new-bot/jobs/2025.06/pr_1091/464151

date job status comment
Jun 17 14:02:58 UTC 2025 submitted job id 464151 awaits release by job manager
Jun 17 14:03:35 UTC 2025 released job awaits launch by Slurm scheduler
Jun 17 14:04:38 UTC 2025 running job 464151 is running
Jun 17 18:23:02 UTC 2025 finished
🤷 UNKNOWN (click triangle for detailed information)
  • Job results file _bot_job464151.result does not exist in job directory, or parsing it failed.
  • No artefacts were found/reported.
Jun 17 18:23:02 UTC 2025 test result
🤷 UNKNOWN (click triangle for detailed information)
  • Job test file _bot_job464151.test does not exist in job directory, or parsing it failed.

@boegel
Copy link
Contributor

boegel commented Jun 17, 2025

I think we need to change tactics a bit when building for A64FX, since by default there's less than 1GB per core on Deucalion A64FX partition:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2023.06-software.eessi.io 2023.06 version of software.eessi.io a64fx
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants