Skip to content

Conversation

@casparvl
Copy link
Contributor

@casparvl casparvl commented Aug 25, 2025

Let's make sure the EASYBUILD_INSTALLPATH is used for the disk space check, since the installpath changed in #59 and now it was doing the disk space check on a different folder than the installation prefix.

Also, make sure the install prefix exists to avoid strange errors that say you're out of disk space, while actually, the dir just didn't exist

…check. Also, make sure that dir exists to avoid strange errors that say you're out of disk space, while actually, the dir just didn't exist
@casparvl
Copy link
Contributor Author

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-surf for:arch=x86_64/amd/zen4,accel=nvidia/cc90

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Aug 25, 2025

New job on instance eessi-bot-surf for repository eessi.io-2023.06-software
Building on: amd-zen4 and accelerator nvidia/cc90
Building for: x86_64/amd/zen4 and accelerator nvidia/cc90
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.08/pr_72/14283059

date job status comment
Aug 25 13:28:06 UTC 2025 submitted job id 14283059 will be eligible to start in about 20 seconds
Aug 25 13:28:17 UTC 2025 received job awaits launch by Slurm scheduler
Aug 25 13:28:30 UTC 2025 running job 14283059 is running
Aug 25 13:36:14 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-14283059.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17561289160.tar.gzsize: 0 MiB (8164 bytes)
entries: 2
modules under 2023.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/software
no software packages in tarball
reprod directories under 2023.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/reprod
no reprod directories in tarball
other under 2023.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90
2023.06/scripts/gpu_support/nvidia/install_cuda_and_libraries.sh
2023.06/software/linux/x86_64/amd/zen4/.lmod/SitePackage.lua
Aug 25 13:36:14 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] (1/8) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (2/8) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (3/8) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (4/8) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (5/8) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (6/8) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (7/8) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (8/8) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ PASSED ] Ran 0/8 test case(s) from 8 check(s) (0 failure(s), 8 skipped, 0 aborted)
Details
✅ job output file slurm-14283059.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl casparvl changed the title Fix error stating there is insufficient disk space for CUDA libs in host injections, while in fact the folder didn't exist Update one more dir to reflect the new location of CUDA installations in host_injections Aug 25, 2025
@casparvl
Copy link
Contributor Author

I'm not sure why the SitePackage.lua keeps ending up in my tarballs. It did not change...

…SI_ACCELERATOR_TARGET, as it should be. The changes in this commit where just forgotten, thus making this now inconsistent
@casparvl
Copy link
Contributor Author

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-surf for:arch=x86_64/amd/zen4,accel=nvidia/cc90

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Aug 25, 2025

New job on instance eessi-bot-surf for repository eessi.io-2023.06-software
Building on: amd-zen4 and accelerator nvidia/cc90
Building for: x86_64/amd/zen4 and accelerator nvidia/cc90
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.08/pr_72/14283488

date job status comment
Aug 25 13:56:04 UTC 2025 submitted job id 14283488 will be eligible to start in about 20 seconds
Aug 25 13:56:11 UTC 2025 received job awaits launch by Slurm scheduler
Aug 25 13:56:34 UTC 2025 running job 14283488 is running
Aug 25 13:58:07 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-14283488.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17561302300.tar.gzsize: 0 MiB (4411 bytes)
entries: 1
modules under 2023.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/software
no software packages in tarball
reprod directories under 2023.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/reprod
no reprod directories in tarball
other under 2023.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90
2023.06/scripts/gpu_support/nvidia/install_cuda_and_libraries.sh
Aug 25 13:58:07 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] (1/8) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (2/8) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (3/8) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (4/8) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (5/8) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (6/8) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (7/8) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (8/8) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ PASSED ] Ran 0/8 test case(s) from 8 check(s) (0 failure(s), 8 skipped, 0 aborted)
Details
✅ job output file slurm-14283488.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Aug 25 14:12:35 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17561302300.tar.gz to S3 bucket succeeded
Aug 25 14:36:16 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17561302300.tar.gz to S3 bucket succeeded

@casparvl casparvl marked this pull request as ready for review August 25, 2025 13:59
@casparvl
Copy link
Contributor Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/amd/zen4,accel=nvidia/cc90

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Aug 25, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: amd-zen4 and accelerator nvidia/cc90
Building for: x86_64/amd/zen4 and accelerator nvidia/cc90
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.08/pr_72/14283533

date job status comment
Aug 25 14:02:22 UTC 2025 submitted job id 14283533 will be eligible to start in about 20 seconds
Aug 25 14:02:32 UTC 2025 received job awaits launch by Slurm scheduler
Aug 25 14:02:45 UTC 2025 running job 14283533 is running
Aug 25 14:03:30 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-14283533.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17561305900.tar.gzsize: 0 MiB (4409 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90
2025.06/scripts/gpu_support/nvidia/install_cuda_and_libraries.sh
Aug 25 14:03:30 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-14283533.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Aug 25 14:12:38 UTC 2025 not uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17561305900.tar.gz to S3 bucket failed (no bucket specified for eessi.io-2025.06-software)
Aug 25 14:36:19 UTC 2025 not uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17561305900.tar.gz to S3 bucket failed (no bucket specified for eessi.io-2025.06-software)

@casparvl
Copy link
Contributor Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws arch:zen2

@casparvl
Copy link
Contributor Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws for:arch=zen2

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Aug 25, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: zen2
Job dir: /project/def-users/SHARED/jobs/2025.08/pr_72/85366

date job status comment
Aug 25 14:21:25 UTC 2025 submitted job id 85366 awaits release by job manager
Aug 25 14:22:19 UTC 2025 released job awaits launch by Slurm scheduler
Aug 25 14:27:22 UTC 2025 running job 85366 is running
Aug 25 14:28:23 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-85366.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-zen2-17561320340.tar.gzsize: 0 MiB (4399 bytes)
entries: 1
modules under 2025.06/software/linux/zen2/modules/all
no module files in tarball
software under 2025.06/software/linux/zen2/software
no software packages in tarball
reprod directories under 2025.06/software/linux/zen2/reprod
no reprod directories in tarball
other under 2025.06/software/linux/zen2
2025.06/scripts/gpu_support/nvidia/install_cuda_and_libraries.sh
Aug 25 14:28:23 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-85366.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Contributor Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws for:arch=x86_64/amd/zen2

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Aug 25, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2
Job dir: /project/def-users/SHARED/jobs/2025.08/pr_72/85367

date job status comment
Aug 25 14:31:53 UTC 2025 submitted job id 85367 awaits release by job manager
Aug 25 14:32:26 UTC 2025 released job awaits launch by Slurm scheduler
Aug 25 14:33:27 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-85367.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-17561323550.tar.gzsize: 0 MiB (4397 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen2
2025.06/scripts/gpu_support/nvidia/install_cuda_and_libraries.sh
Aug 25 14:33:27 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-85367.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Aug 25 14:35:53 UTC 2025 uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen2-17561323550.tar.gz to S3 bucket succeeded

@bedroge
Copy link
Contributor

bedroge commented Aug 25, 2025

Tarballs have been ingested.

@bedroge bedroge merged commit aa99eae into EESSI:main Aug 25, 2025
66 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants