Skip to content

Conversation

@casparvl
Copy link
Collaborator

@casparvl casparvl commented Oct 30, 2025

I think we should deploy the script from EESSI/software-layer-scripts#120 through this current PR, then change the build.sh back to it's original form. The issue is that EESSI/software-layer-scripts#120 can't be deployed there, because no software is built, and thus no "no missing installations" message is printed. This causes the bot to consider the build step a 'failure'.

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15643733

date job status comment
Oct 30 13:02:23 UTC 2025 submitted job id 15643733 will be eligible to start in about 20 seconds
Oct 30 13:02:30 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:02:55 UTC 2025 running job 15643733 is running
Oct 30 13:04:05 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15643733.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618294030.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
no other files in tarball
Oct 30 13:04:05 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15643733.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15644119

date job status comment
Oct 30 13:16:00 UTC 2025 submitted job id 15644119 will be eligible to start in about 20 seconds
Oct 30 13:16:11 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:16:24 UTC 2025 running job 15644119 is running
Oct 30 13:17:58 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15644119.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618302260.tar.gzsize: 0 MiB (421 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 13:17:58 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15644119.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15644178

date job status comment
Oct 30 13:19:57 UTC 2025 submitted job id 15644178 will be eligible to start in about 20 seconds
Oct 30 13:20:03 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:20:26 UTC 2025 running job 15644178 is running
Oct 30 13:46:57 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15644178.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618319710.tar.gzsize: 0 MiB (420 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 13:46:57 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15644178.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Failure in the cuDNN host injections installations because it doesn't contain ptx code (fixed in EESSI/software-layer-scripts@e25b625 en bf2fc9c)

Also, another failure:

ERROR: Failed to create directory /cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all: [Errno 30] Read-only file system: '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel'

Not sure what's wrong here. We may be missing a mkdir -p, because this dir is not there yet since this is the first GPU software we install in this prefix. However, I thought we hit the same issue in 2023.06 and we fixed that - but it's been too long to remember. It might also be that we have a mkdir -p and that this is simply the error it hits when creating that dir...

@casparvl
Copy link
Collaborator Author

Added some extra verbosity EESSI/software-layer-scripts@54bd9ad , let's see

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15645493

date job status comment
Oct 30 14:26:09 UTC 2025 submitted job id 15645493 will be eligible to start in about 20 seconds
Oct 30 14:26:15 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 14:26:39 UTC 2025 running job 15645493 is running
Oct 30 15:05:14 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-15645493.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618356610.tar.gzsize: 6872 MiB (7206354023 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
CUDA/12.6.0/20251030_143907UTC
CUDA/12.8.0/20251030_144306UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_144727UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_144502UTC
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 15:05:14 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15645493.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Making it verbose seems to have solved the issue. That is, of course, impossible, but... things are working now:

mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all'
...
== COMPLETED: Installation ended successfully (took 3 mins 3 secs)
== Results of the build can be found in the log file(s) /cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software/CUDA/12.6.0/easybuild/easybuild-CUDA-12.6.0-20251030.153905.log.bz2

So maybe this was just one more of unionfs's hickups?

@casparvl
Copy link
Collaborator Author

Let's get all of those host-injections installed...

All bots that run native builds (one architecture per bot is sufficient)

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/intel/cascadelake,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-jsc for:arch=aarch64/nvidia/grace,accel=nvidia/cc90

x86_64 and arm archs on AWS bot:

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=x86_64/generic for:arch=x86_64/generic,accel=nvidia/cc70

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Oct 30, 2025

New job on instance eessi-bot-jsc for repository eessi.io-2025.06-software
Building on: nvidia-grace and accelerator nvidia/cc90
Building for: aarch64/nvidia/grace and accelerator nvidia/cc90
Job dir: /p/project1/ceasybuilders/eessibot/jobs/2025.10/pr_1278/14161581

date job status comment
Oct 30 15:39:49 UTC 2025 submitted job id 14161581 awaits release by job manager
Oct 30 15:40:07 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:41:11 UTC 2025 running job 14161581 is running
Oct 30 17:17:12 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-14161581.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-aarch64-nvidia-grace-accel-nvidia-cc90-17618429730.tar.gzsize: 5980 MiB (6271255423 bytes)
entries: 8879
modules under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/reprod
CUDA/12.6.0/20251030_161626UTC
CUDA/12.8.0/20251030_163743UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_163905UTC
other under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 17:17:12 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-14161581.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100486

date job status comment
Oct 30 15:39:50 UTC 2025 submitted job id 100486 awaits release by job manager
Oct 30 15:40:11 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:46:16 UTC 2025 running job 100486 is running
Oct 30 16:20:40 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100486.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc70-17618404130.tar.gzsize: 6197 MiB (6498851361 bytes)
entries: 12594
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_155004UTC
CUDA/12.8.0/20251030_155518UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_155800UTC
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 16:20:40 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100486.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: generic
Building for: x86_64/generic and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100487

date job status comment
Oct 30 15:39:56 UTC 2025 submitted job id 100487 awaits release by job manager
Oct 30 15:40:13 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:46:18 UTC 2025 running job 100487 is running
Oct 30 16:33:58 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-100487.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-generic-accel-nvidia-cc70-17618411070.tar.gzsize: 6872 MiB (7206333085 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_160705UTC
CUDA/12.8.0/20251030_161220UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_161645UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_161420UTC
other under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 16:33:58 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100487.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl casparvl added 2025.06-software.eessi.io 2025.06 version of software.eessi.io accel:nvidia labels Oct 30, 2025
@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Edit: not sure why the previous build failed. The installations in the host_injections failed with a message that the lock file was already present. That's very strange, there should not be a lock file in the host_injections...

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc70

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100489

date job status comment
Oct 30 21:20:43 UTC 2025 submitted job id 100489 awaits release by job manager
Oct 30 21:21:32 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:22:34 UTC 2025 running job 100489 is running
Oct 30 21:50:35 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-100489.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc70-17618601690.tar.gzsize: 6872 MiB (7206289626 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_212519UTC
CUDA/12.8.0/20251030_212935UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_213418UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_213140UTC
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 21:50:35 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100489.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

Oh crap, I see the issue, the other build was x86_64/generic, while intended to start it on ARM...

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2023.06-software
Building on: generic
Building for: aarch64/generic
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100490

date job status comment
Oct 30 21:24:47 UTC 2025 submitted job id 100490 awaits release by job manager
Oct 30 21:25:39 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:30:47 UTC 2025 running job 100490 is running
Oct 30 21:33:55 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100490.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-17618598470.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
reprod directories under 2023.06/software/linux/aarch64/generic/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 30 21:33:55 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_generic+default
P: perf: 699.891 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_generic+default
P: perf: 697.918 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:aarch64_generic+default
P: latency: 3.24 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:aarch64_generic+default
P: latency: 3.47 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:aarch64_generic+default
P: latency: 5.51 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:aarch64_generic+default
P: latency: 5.57 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_generic+default
P: latency: 0.44 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_generic+default
P: latency: 0.46 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_generic+default
P: bandwidth: 20802.34 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_generic+default
P: bandwidth: 20541.79 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-100490.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2023.06-software
Building on: generic
Building for: aarch64/generic
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100491

date job status comment
Oct 30 21:37:29 UTC 2025 submitted job id 100491 awaits release by job manager
Oct 30 21:38:02 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:39:07 UTC 2025 running job 100491 is running
Oct 30 21:41:12 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100491.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-17618603120.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
reprod directories under 2023.06/software/linux/aarch64/generic/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 30 21:41:12 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_generic+default
P: perf: 696.808 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_generic+default
P: perf: 706.728 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:aarch64_generic+default
P: latency: 3.51 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:aarch64_generic+default
P: latency: 3.5 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:aarch64_generic+default
P: latency: 5.45 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:aarch64_generic+default
P: latency: 5.62 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_generic+default
P: latency: 0.45 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_generic+default
P: latency: 0.44 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_generic+default
P: bandwidth: 20690.98 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_generic+default
P: bandwidth: 20851.54 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-100491.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

Wrong version...

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Nov 4, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: amd-zen4 and accelerator nvidia/cc90
Building for: x86_64/amd/zen4 and accelerator nvidia/cc90
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.11/pr_1278/15756355

date job status comment
Nov 04 16:44:26 UTC 2025 submitted job id 15756355 will be eligible to start in about 20 seconds
Nov 04 16:44:38 UTC 2025 received job awaits launch by Slurm scheduler
Nov 04 16:45:09 UTC 2025 running job 15756355 is running
Nov 04 17:05:24 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-15756355.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17622750940.tar.gzsize: 6197 MiB (6498814027 bytes)
entries: 12593
modules under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/reprod
CUDA/12.6.0/20251104_164737UTC
CUDA/12.8.0/20251104_164953UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251104_165124UTC
other under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90
no other files in tarball
Nov 04 17:05:24 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15756355.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@laraPPr
Copy link
Collaborator

laraPPr commented Nov 5, 2025

bot: help

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-surf (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-deucalion (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-vsc-ugent (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-jsc (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@casparvl
Copy link
Collaborator Author

casparvl commented Nov 5, 2025

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/intel/cascadelake,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/amd/zen3,accel=nvidia/cc80

@laraPPr
Copy link
Collaborator

laraPPr commented Nov 5, 2025

bot: help

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-vsc-ugent (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-surf (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-deucalion (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Nov 5, 2025

Updates by the bot instance eessi-bot-jsc (click for details)
  • received bot command help from laraPPr

    • expanded format: help
  • handling command help resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@laraPPr
Copy link
Collaborator

laraPPr commented Nov 5, 2025

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/intel/cascadelake,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/amd/zen3,accel=nvidia/cc80

@laraPPr
Copy link
Collaborator

laraPPr commented Nov 5, 2025

bot: show_config

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 5, 2025

Instance eessi-bot-mc-aws is configured to build on:

  • Node type x86-64-generic:

    • OS: linux
    • CPU architecture: x86_64/generic
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-haswell:

    • OS: linux
    • CPU architecture: x86_64/intel/haswell
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-sapphirerapids:

    • OS: linux
    • CPU architecture: x86_64/intel/sapphirerapids
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-skylake:

    • OS: linux
    • CPU architecture: x86_64/intel/skylake_avx512
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-cascadelake:

    • OS: linux
    • CPU architecture: x86_64/intel/cascadelake
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-icelake:

    • OS: linux
    • CPU architecture: x86_64/intel/icelake
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-zen2:

    • OS: linux
    • CPU architecture: x86_64/amd/zen2
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-zen3:

    • OS: linux
    • CPU architecture: x86_64/amd/zen3
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-zen4:

    • OS: linux
    • CPU architecture: x86_64/amd/zen4
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-generic:

    • OS: linux
    • CPU architecture: aarch64/generic
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-neoverse_n1:

    • OS: linux
    • CPU architecture: aarch64/neoverse_n1
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-neoverse_v1:

    • OS: linux
    • CPU architecture: aarch64/neoverse_v1
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-graviton4:

    • OS: linux
    • CPU architecture: aarch64/aws/graviton4
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']

@eessi-bot-deucalion
Copy link

Instance eessi-bot-deucalion is configured to build on:

  • Node type a64fx:
    • OS: linux
    • CPU architecture: aarch64/a64fx
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Nov 5, 2025

Instance eessi-bot-vsc-ugent is configured to build on:

  • Node type gpu_a100:

    • OS: linux
    • CPU architecture: x86_64/amd/zen3
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
    • Accelerators: nvidia/cc80
  • Node type gpu_v100:

    • OS: linux
    • CPU architecture: x86_64/intel/cascadelake
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
    • Accelerators: nvidia/cc70

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Nov 5, 2025

Instance eessi-bot-jsc is configured to build on:

  • Node type aarch64-nvidia-grace:

    • OS: linux
    • CPU architecture: aarch64/nvidia/grace
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-nvidia-gh200:

    • OS: linux
    • CPU architecture: aarch64/nvidia/grace
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
    • Accelerators: nvidia/cc90

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Nov 5, 2025

Instance eessi-bot-surf is configured to build on:

  • Node type cpu_zen2:

    • OS: linux
    • CPU architecture: x86_64/amd/zen2
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type cpu_zen4:

    • OS: linux
    • CPU architecture: x86_64/amd/zen4
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type gpu_a100:

    • OS: linux
    • CPU architecture: x86_64/intel/icelake
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
    • Accelerators: nvidia/cc80
  • Node type gpu_h100:

    • OS: linux
    • CPU architecture: x86_64/amd/zen4
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
    • Accelerators: nvidia/cc90

@laraPPr
Copy link
Collaborator

laraPPr commented Nov 5, 2025

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/intel/cascadelake,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/amd/zen3,accel=nvidia/cc80

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Nov 5, 2025

New job on instance eessi-bot-vsc-ugent for repository eessi.io-2025.06-software
Building on: intel-cascadelake and accelerator nvidia/cc70
Building for: x86_64/intel/cascadelake and accelerator nvidia/cc70
Job dir: /scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.11/pr_1278/40737159

date job status comment
Nov 05 15:27:10 UTC 2025 submitted job id 40737159 awaits release by job manager
Nov 05 15:28:59 UTC 2025 released job awaits launch by Slurm scheduler
Nov 05 15:47:32 UTC 2025 running job 40737159 is running
Nov 05 16:18:15 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-40737159.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-cascadelake-accel-nvidia-cc70-17623587740.tar.gzsize: 6197 MiB (6498993494 bytes)
entries: 12593
modules under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251105_160013UTC
CUDA/12.8.0/20251105_160403UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251105_160558UTC
other under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70
no other files in tarball
Nov 05 16:18:15 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-40737159.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Nov 5, 2025

New job on instance eessi-bot-vsc-ugent for repository eessi.io-2025.06-software
Building on: amd-zen3 and accelerator nvidia/cc80
Building for: x86_64/amd/zen3 and accelerator nvidia/cc80
Job dir: /scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.11/pr_1278/15553067

date job status comment
Nov 05 15:27:15 UTC 2025 submitted job id 15553067 awaits release by job manager
Nov 05 15:28:54 UTC 2025 released job awaits launch by Slurm scheduler
Nov 05 15:33:04 UTC 2025 running job 15553067 is running
Nov 05 15:49:37 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15553067.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen3-accel-nvidia-cc80-17623577410.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80
no other files in tarball
Nov 05 15:49:37 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-15553067.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Nov 6, 2025

Hm, both tried to install in the same host_injections. The second one gave another of those strange, random permissions errors. Let me retry...

@casparvl
Copy link
Collaborator Author

casparvl commented Nov 6, 2025

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/intel/cascadelake,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/amd/zen3,accel=nvidia/cc80

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Nov 6, 2025

New job on instance eessi-bot-vsc-ugent for repository eessi.io-2025.06-software
Building on: intel-cascadelake and accelerator nvidia/cc70
Building for: x86_64/intel/cascadelake and accelerator nvidia/cc70
Job dir: /scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.11/pr_1278/40737957

date job status comment
Nov 06 14:46:02 UTC 2025 submitted job id 40737957 awaits release by job manager
Nov 06 14:47:49 UTC 2025 released job awaits launch by Slurm scheduler
Nov 06 15:10:00 UTC 2025 running job 40737957 is running
Nov 06 15:31:03 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-40737957.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-cascadelake-accel-nvidia-cc70-17624422690.tar.gzsize: 6197 MiB (6498918470 bytes)
entries: 12593
modules under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251106_151246UTC
CUDA/12.8.0/20251106_151549UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251106_151735UTC
other under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc70
no other files in tarball
Nov 06 15:31:03 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-40737957.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Nov 6, 2025

New job on instance eessi-bot-vsc-ugent for repository eessi.io-2025.06-software
Building on: amd-zen3 and accelerator nvidia/cc80
Building for: x86_64/amd/zen3 and accelerator nvidia/cc80
Job dir: /scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.11/pr_1278/15553665

date job status comment
Nov 06 14:46:06 UTC 2025 submitted job id 15553665 awaits release by job manager
Nov 06 14:47:44 UTC 2025 released job awaits launch by Slurm scheduler
Nov 06 15:09:55 UTC 2025 running job 15553665 is running
Nov 06 15:28:59 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-15553665.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen3-accel-nvidia-cc80-17624422710.tar.gzsize: 6197 MiB (6498950786 bytes)
entries: 12593
modules under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/reprod
CUDA/12.6.0/20251106_151243UTC
CUDA/12.8.0/20251106_151550UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251106_151734UTC
other under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80
no other files in tarball
Nov 06 15:28:59 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-15553665.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@laraPPr
Copy link
Collaborator

laraPPr commented Nov 6, 2025

What causes this?
ERROR: Clone of the test suite /eessi_bot_job/EESSI-test-suite is not available!

@laraPPr
Copy link
Collaborator

laraPPr commented Nov 6, 2025

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2025.06-software.eessi.io 2025.06 version of software.eessi.io accel:nvidia

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants