Skip to content

Conversation

@trz42
Copy link
Collaborator

@trz42 trz42 commented May 27, 2025

Separated from #1051

@trz42 trz42 added 2023.06-software.eessi.io 2023.06 version of software.eessi.io a64fx labels May 27, 2025
@trz42
Copy link
Collaborator Author

trz42 commented May 27, 2025

bot: build instance:eessi-bot-deucalion repository:eessi.io-2023.06-software architecture:aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented May 27, 2025

New job on instance eessi-bot-deucalion for CPU micro-architecture aarch64-a64fx for repository eessi.io-2023.06-software in job dir /home/eessibot/new-bot/jobs/2025.05/pr_1098/438526

date job status comment
May 27 21:16:21 UTC 2025 submitted job id 438526 awaits release by job manager
May 27 21:17:21 UTC 2025 released job awaits launch by Slurm scheduler
May 27 21:18:24 UTC 2025 running job 438526 is running
May 27 22:14:47 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-438526.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-a64fx-17483835730.tar.gzsize: 21 MiB (22037723 bytes)
entries: 899
modules under 2023.06/software/linux/aarch64/a64fx/modules/all
Bison/3.8.2-GCCcore-12.2.0.lua
ICU/72.1-GCCcore-12.2.0.lua
Ninja/1.11.1-GCCcore-12.2.0.lua
PCRE2/10.40-GCCcore-12.2.0.lua
software under 2023.06/software/linux/aarch64/a64fx/software
Bison/3.8.2-GCCcore-12.2.0
ICU/72.1-GCCcore-12.2.0
Ninja/1.11.1-GCCcore-12.2.0
PCRE2/10.40-GCCcore-12.2.0
other under 2023.06/software/linux/aarch64/a64fx
no other files in tarball
May 27 22:14:47 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] (1/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (2/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (3/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (4/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ OK ] (5/9) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_a64fx+default
P: perf: 579.282 timesteps/s (r:0, l:None, u:None)
[ OK ] (6/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_a64fx+default
P: latency: 1.74 us (r:0, l:None, u:None)
[ OK ] (7/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_a64fx+default
P: latency: 1.77 us (r:0, l:None, u:None)
[ OK ] (8/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 7667.77 MB/s (r:0, l:None, u:None)
[ OK ] (9/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 7612.59 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 5/9 test case(s) from 9 check(s) (0 failure(s), 4 skipped, 0 aborted)
Details
✅ job output file slurm-438526.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@boegel boegel changed the base branch from 2023.06-software.eessi.io to main June 15, 2025 14:48
@boegel
Copy link
Contributor

boegel commented Jun 15, 2025

bot: build instance:eessi-bot-deucalion repository:eessi.io-2023.06-software architecture:aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Jun 15, 2025

New job on instance eessi-bot-deucalion for CPU micro-architecture aarch64-a64fx for repository eessi.io-2023.06-software in job dir /home/eessibot/new-bot/jobs/2025.06/pr_1098/461125

date job status comment
Jun 15 14:49:08 UTC 2025 submitted job id 461125 awaits release by job manager
Jun 15 14:49:41 UTC 2025 released job awaits launch by Slurm scheduler
Jun 15 14:50:44 UTC 2025 running job 461125 is running
Jun 15 15:42:51 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-461125.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-a64fx-17500016520.tar.gzsize: 21 MiB (22045670 bytes)
entries: 899
modules under 2023.06/software/linux/aarch64/a64fx/modules/all
Bison/3.8.2-GCCcore-12.2.0.lua
ICU/72.1-GCCcore-12.2.0.lua
Ninja/1.11.1-GCCcore-12.2.0.lua
PCRE2/10.40-GCCcore-12.2.0.lua
software under 2023.06/software/linux/aarch64/a64fx/software
Bison/3.8.2-GCCcore-12.2.0
ICU/72.1-GCCcore-12.2.0
Ninja/1.11.1-GCCcore-12.2.0
PCRE2/10.40-GCCcore-12.2.0
other under 2023.06/software/linux/aarch64/a64fx
no other files in tarball
Jun 15 15:42:51 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] (1/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (2/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (3/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] (4/9) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ OK ] (5/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_a64fx+default
P: latency: 1.73 us (r:0, l:None, u:None)
[ OK ] (6/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_a64fx+default
P: latency: 1.81 us (r:0, l:None, u:None)
[ OK ] (7/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 7588.3 MB/s (r:0, l:None, u:None)
[ OK ] (8/9) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 7766.1 MB/s (r:0, l:None, u:None)
[ OK ] (9/9) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_a64fx+default
P: perf: 416.528 timesteps/s (r:0, l:None, u:None)
[ PASSED ] Ran 5/9 test case(s) from 9 check(s) (0 failure(s), 4 skipped, 0 aborted)
Details
✅ job output file slurm-461125.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator

bedroge commented Jul 9, 2025

The build of Rust 1.65.0 failed with a weird error:

[ 54%] Built target LLVMTarget
[ 54%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/PredicateInfo.cpp.o
[ 54%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/PromoteMemoryToRegister.cpp.o
[ 54%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/RelLookupTableConverter.cpp.o
[ 54%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/ScalarEvolutionExpander.cpp.o
[ 54%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/SCCPSolver.cpp.o
[ 55%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/StripGCRelocates.cpp.o
[ 55%] Linking CXX executable ../../bin/llvm-tli-checker
[ 55%] Linking CXX executable ../../bin/llvm-sim
[ 55%] Linking CXX executable ../../bin/llvm-stress
[ 55%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/SSAUpdater.cpp.o
[ 55%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/SSAUpdaterBulk.cpp.o
[ 55%] Built target llvm-stress
[ 55%] Built target llvm-sim
[ 55%] Building CXX object lib/Transforms/Utils/CMakeFiles/LLVMTransformUtils.dir/SampleProfileInference.cpp.o
[ 55%] Built target llvm-tli-checker
[ 55%] Building CXX object lib/DWP/CMakeFiles/LLVMDWP.dir/DWP.cpp.o
[ 55%] Building CXX object lib/DWP/CMakeFiles/LLVMDWP.dir/DWPError.cpp.o
/tmp/eb-ujicqi2g/eb-q95hr60e/ccMOYzFH.s: Assembler messages:
/tmp/eb-ujicqi2g/eb-q95hr60e/ccMOYzFH.s:18063: Error: immediate offset out of range 0 to 31 at operand 3 -- `st1b z2.d,p0,[z0.d,#88]'

@ocaisa
Copy link
Member

ocaisa commented Jul 9, 2025

EasyBuild LLM integration would have probably helped here:

This is an ARM SVE (Scalable Vector Extension) assembly instruction (st1b) where the offset #88 is too large for what the instruction allows.

Specifically, st1b stores bytes to memory using predicate-controlled SVE addressing, and the immediate offset must be within the range [0, 31] (i.e., it's encoded in 5 bits). An offset of #88 is out of range.

@bedroge
Copy link
Collaborator

bedroge commented Jul 9, 2025

EasyBuild LLM integration would have probably helped here:

This is an ARM SVE (Scalable Vector Extension) assembly instruction (st1b) where the offset #88 is too large for what the instruction allows.

Specifically, st1b stores bytes to memory using predicate-controlled SVE addressing, and the immediate offset must be within the range [0, 31] (i.e., it's encoded in 5 bits). An offset of #88 is out of range.

That is indeed helpful 😄 But can it also provide a patch? 😅

@ocaisa
Copy link
Member

ocaisa commented Jul 9, 2025

It tries, one thing it says is

Make sure your assembler (binutils) supports the instruction set your compiler is generating.

and I wonder if the vector widths of a64fx (which IIRC are especially wide) are actually supported.

The other option is to switch off SVE somehow.

@bedroge
Copy link
Collaborator

bedroge commented Jul 9, 2025

It tries, one thing it says is

Make sure your assembler (binutils) supports the instruction set your compiler is generating.

and I wonder if the vector widths of a64fx (which IIRC are especially wide) are actually supported.

The other option is to switch off SVE somehow.

A slightly newer Rust (1.70.0 instead of 1.65.0) built with GCCcore-12.3.0 instead of GCCcore 12.2.0 does work fine. And the binutils version is fixed, since that's taken from the compat layer.

@bedroge
Copy link
Collaborator

bedroge commented Sep 23, 2025

Let's give this another try, the updated hook from EESSI/software-layer-scripts#86 should make sure that we use Rust 1.75.0 (from #1200).

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-deucalion for:arch=aarch64/a64fx

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Sep 23, 2025

New job on instance eessi-bot-deucalion for repository eessi.io-2023.06-software
Building on: a64fx
Building for: aarch64/a64fx
Job dir: /home/eessibot/new-bot/jobs/2025.09/pr_1098/553601

date job status comment
Sep 23 14:32:18 UTC 2025 submitted job id 553601 awaits release by job manager
Sep 23 14:33:05 UTC 2025 released job awaits launch by Slurm scheduler
Sep 23 14:34:07 UTC 2025 running job 553601 is running
Sep 23 19:03:36 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-553601.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-a64fx-17586536550.tar.gzsize: 459 MiB (481884670 bytes)
entries: 32481
modules under 2023.06/software/linux/aarch64/a64fx/modules/all
Brotli/1.0.9-GCCcore-12.2.0.lua
Doxygen/1.9.5-GCCcore-12.2.0.lua
GLib/2.75.0-GCCcore-12.2.0.lua
GObject-Introspection/1.74.0-GCCcore-12.2.0.lua
HarfBuzz/5.3.1-GCCcore-12.2.0.lua
ICU/72.1-GCCcore-12.2.0.lua
Meson/0.64.0-GCCcore-12.2.0.lua
Ninja/1.11.1-GCCcore-12.2.0.lua
PCRE2/10.40-GCCcore-12.2.0.lua
Python/3.10.8-GCCcore-12.2.0.lua
X11/20221110-GCCcore-12.2.0.lua
cairo/1.17.4-GCCcore-12.2.0.lua
fontconfig/2.14.1-GCCcore-12.2.0.lua
freetype/2.12.1-GCCcore-12.2.0.lua
git/2.38.1-GCCcore-12.2.0-nodocs.lua
libiconv/1.17-GCCcore-12.2.0.lua
libpng/1.6.38-GCCcore-12.2.0.lua
pixman/0.42.2-GCCcore-12.2.0.lua
software under 2023.06/software/linux/aarch64/a64fx/software
Brotli/1.0.9-GCCcore-12.2.0
Doxygen/1.9.5-GCCcore-12.2.0
GLib/2.75.0-GCCcore-12.2.0
GObject-Introspection/1.74.0-GCCcore-12.2.0
HarfBuzz/5.3.1-GCCcore-12.2.0
ICU/72.1-GCCcore-12.2.0
Meson/0.64.0-GCCcore-12.2.0
Ninja/1.11.1-GCCcore-12.2.0
PCRE2/10.40-GCCcore-12.2.0
Python/3.10.8-GCCcore-12.2.0
X11/20221110-GCCcore-12.2.0
cairo/1.17.4-GCCcore-12.2.0
fontconfig/2.14.1-GCCcore-12.2.0
freetype/2.12.1-GCCcore-12.2.0
git/2.38.1-GCCcore-12.2.0-nodocs
libiconv/1.17-GCCcore-12.2.0
libpng/1.6.38-GCCcore-12.2.0
pixman/0.42.2-GCCcore-12.2.0
reprod directories under 2023.06/software/linux/aarch64/a64fx/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/a64fx
no other files in tarball
Sep 23 19:03:36 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] ( 1/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 2/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 3/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ SKIP ] ( 4/10) Skipping test: nodes in this partition only have 30720 MiB memory available (per node) accodring to the current ReFrame configuration, but 49152 MiB is needed
[ OK ] ( 5/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_a64fx+default
P: perf: 56.777 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_a64fx+default
P: perf: 583.699 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_a64fx+default
P: latency: 1.68 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_a64fx+default
P: latency: 1.71 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8760.23 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_a64fx+default
P: bandwidth: 8615.86 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 6/10 test case(s) from 10 check(s) (0 failure(s), 4 skipped, 0 aborted)
Details
✅ job output file slurm-553601.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Sep 23 20:02:46 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-aarch64-a64fx-17586536550.tar.gz to S3 bucket succeeded

@bedroge bedroge added the bot:deploy Ask bot to deploy missing software installations to EESSI label Sep 23, 2025
@bedroge
Copy link
Collaborator

bedroge commented Sep 23, 2025

Staging PR merged, tarball ingested.

@bedroge bedroge merged commit d38bb14 into EESSI:main Sep 23, 2025
50 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2023.06-software.eessi.io 2023.06 version of software.eessi.io a64fx bot:deploy Ask bot to deploy missing software installations to EESSI

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants