Skip to content

Conversation

@trz42
Copy link
Collaborator

@trz42 trz42 commented Mar 22, 2025

Also adds dependency Java 11.0

@trz42 trz42 added 2023.06-software.eessi.io 2023.06 version of software.eessi.io grace NVIDIA Grace CPU labels Mar 22, 2025
@eessi-bot
Copy link

eessi-bot bot commented Mar 22, 2025

Instance eessi-bot-mc-aws is configured to build for:

  • architectures: x86_64/generic, x86_64/intel/haswell, x86_64/intel/sapphirerapids, x86_64/intel/skylake_avx512, x86_64/amd/zen2, x86_64/amd/zen3, aarch64/generic, aarch64/neoverse_n1, aarch64/neoverse_v1
  • repositories: eessi.io-2023.06-software, eessi.io-2023.06-compat

@eessi-bot
Copy link

eessi-bot bot commented Mar 22, 2025

Instance eessi-bot-mc-azure is configured to build for:

  • architectures: x86_64/amd/zen4
  • repositories: eessi.io-2023.06-compat, eessi.io-2023.06-software

@eessi-bot-toprichard
Copy link

Instance rt-Grace-jr is configured to build for:

  • architectures: aarch64/nvidia/grace
  • repositories: eessi.io-2023.06-software

@eessi-bot-trz42
Copy link

Instance trz42-GH200-jr is configured to build for:

  • architectures: aarch64/nvidia/grace
  • repositories: eessi.io-2023.06-software

@trz42
Copy link
Collaborator Author

trz42 commented Mar 22, 2025

bot: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace

@eessi-bot
Copy link

eessi-bot bot commented Mar 22, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Mar 22, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

    • no jobs were submitted

@eessi-bot-toprichard
Copy link

Updates by the bot instance rt-Grace-jr (click for details)
  • account trz42 has NO permission to send commands to the bot

@eessi-bot-trz42
Copy link

eessi-bot-trz42 bot commented Mar 22, 2025

Updates by the bot instance trz42-GH200-jr (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

@eessi-bot-trz42
Copy link

eessi-bot-trz42 bot commented Mar 22, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.03/pr_975/13529544

date job status comment
Mar 22 07:55:10 UTC 2025 submitted job id 13529544 awaits release by job manager
Mar 22 07:55:28 UTC 2025 released job awaits launch by Slurm scheduler
Mar 22 07:56:31 UTC 2025 running job 13529544 is running
Mar 22 08:16:02 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-13529544.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-nvidia-grace-1742631217.tar.gzsize: 324 MiB (340548499 bytes)
entries: 9664
modules under 2023.06/software/linux/aarch64/nvidia/grace/modules/all
Java/.modulerc.lua
Java/11.0.20.lua
Nextflow/23.10.0.lua
ReFrame/4.6.2.lua
software under 2023.06/software/linux/aarch64/nvidia/grace/software
Java/11.0.20
Nextflow/23.10.0
ReFrame/4.6.2
other under 2023.06/software/linux/aarch64/nvidia/grace
no other files in tarball
Mar 22 08:16:02 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-13529544.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@trz42
Copy link
Collaborator Author

trz42 commented Mar 23, 2025

Building ReFrame-4.3.3.eb failed. It seems it tries to access setup.py from the directory where the eb command is run. However, changing the ReFrame-4.3.3.eb easyconfig as follows solves the issue

--- ReFrame-4.3.3.eb    2025-03-22 11:27:33.000000000 +0100
+++ _ReFrame-4.3.3.eb   2025-03-23 08:10:54.281700829 +0100
@@ -23,10 +23,12 @@
         'checksums': ['fd11ba3d0fdb4c07fbc5ecbba0b1b719809420f25038f8ee3cd913d3faa3033a'],
     }),
     ('wheel', '0.37.1', {
+        'use_pip': True,
         'source_tmpl': 'wheel-%(version)s-py2.py3-none-any.whl',
         'checksums': ['4bdcd7d840138086126cd09254dc6195fb4fc6f01c050a1d7236f2630db1d22a'],
     }),
     ('reframe', version, {
+        'use_pip': True,
         'preinstallopts': "export PATH=%(installdir)s/bin:$PATH && "
                           # use PyYAML 6.0.1 to solve Cython 3 incompatibility issues
                           "sed -i 's@PyYAML==6.0@PyYAML==6.0.1@' requirements.txt && "

Because we don't use ReFrame and there is a newer version of ReFrame installed through this PR, we rather not install ReFrame/4.3.3 for now until we find a solution.

@trz42
Copy link
Collaborator Author

trz42 commented Mar 23, 2025

bot: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace

@eessi-bot
Copy link

eessi-bot bot commented Mar 23, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Mar 23, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

    • no jobs were submitted

@eessi-bot-trz42
Copy link

eessi-bot-trz42 bot commented Mar 23, 2025

Updates by the bot instance trz42-GH200-jr (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

@eessi-bot-toprichard
Copy link

Updates by the bot instance rt-Grace-jr (click for details)
  • account trz42 has NO permission to send commands to the bot

@eessi-bot-trz42
Copy link

eessi-bot-trz42 bot commented Mar 23, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.03/pr_975/13530846

  • test step failed due to missing ReFrame site configuration file
date job status comment
Mar 23 07:25:48 UTC 2025 submitted job id 13530846 awaits release by job manager
Mar 23 07:25:53 UTC 2025 released job awaits launch by Slurm scheduler
Mar 23 07:26:55 UTC 2025 running job 13530846 is running
Mar 23 07:37:11 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-13530846.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-nvidia-grace-1742715312.tar.gzsize: 324 MiB (340545920 bytes)
entries: 9664
modules under 2023.06/software/linux/aarch64/nvidia/grace/modules/all
Java/.modulerc.lua
Java/11.0.20.lua
Nextflow/23.10.0.lua
ReFrame/4.6.2.lua
software under 2023.06/software/linux/aarch64/nvidia/grace/software
Java/11.0.20
Nextflow/23.10.0
ReFrame/4.6.2
other under 2023.06/software/linux/aarch64/nvidia/grace
no other files in tarball
Mar 23 07:37:11 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-13530846.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@trz42 trz42 changed the title {2023.06}[system,grace] NextFlow 23.10.0, ReFrame 4.3.3 + 4.6.2 {2023.06}[system,grace] NextFlow 23.10.0, ReFrame 4.6.2 Mar 23, 2025
@trz42
Copy link
Collaborator Author

trz42 commented Mar 23, 2025

Added a reframe_config.py for the bot instance. Let's build again to verify if the test step is run...
bot: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace

@eessi-bot
Copy link

eessi-bot bot commented Mar 23, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Mar 23, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

    • no jobs were submitted

@eessi-bot-trz42
Copy link

eessi-bot-trz42 bot commented Mar 23, 2025

Updates by the bot instance trz42-GH200-jr (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

@eessi-bot-toprichard
Copy link

Updates by the bot instance rt-Grace-jr (click for details)
  • account trz42 has NO permission to send commands to the bot

@eessi-bot-trz42
Copy link

eessi-bot-trz42 bot commented Mar 23, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.03/pr_975/13530873

date job status comment
Mar 23 09:09:23 UTC 2025 submitted job id 13530873 awaits release by job manager
Mar 23 09:10:23 UTC 2025 released job awaits launch by Slurm scheduler
Mar 23 09:11:26 UTC 2025 running job 13530873 is running
Mar 23 09:21:42 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-13530873.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-nvidia-grace-1742721547.tar.gzsize: 324 MiB (340549788 bytes)
entries: 9664
modules under 2023.06/software/linux/aarch64/nvidia/grace/modules/all
Java/.modulerc.lua
Java/11.0.20.lua
Nextflow/23.10.0.lua
ReFrame/4.6.2.lua
software under 2023.06/software/linux/aarch64/nvidia/grace/software
Java/11.0.20
Nextflow/23.10.0
ReFrame/4.6.2
other under 2023.06/software/linux/aarch64/nvidia/grace
no other files in tarball
Mar 23 09:21:42 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-13530873.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Mar 23 12:37:41 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-aarch64-nvidia-grace-1742721547.tar.gz to S3 bucket succeeded

@trz42 trz42 added the ready-to-deploy Mark a PR as ready to deploy label Mar 23, 2025
@trz42 trz42 added bot:deploy Ask bot to deploy missing software installations to EESSI and removed ready-to-deploy Mark a PR as ready to deploy labels Mar 23, 2025
@eessi-bot-toprichard
Copy link

Label bot:deploy has been set by user trz42, but this person does not have permission to trigger deployments

@bedroge
Copy link
Collaborator

bedroge commented Mar 23, 2025

Staging PR merged.

@bedroge
Copy link
Collaborator

bedroge commented Mar 23, 2025

@trz42 I guess we need #976 to fix the failing CI?

@trz42
Copy link
Collaborator Author

trz42 commented Mar 23, 2025

@trz42 I guess we need #976 to fix the failing CI?

That’s right.

@bedroge
Copy link
Collaborator

bedroge commented Mar 23, 2025

Ignoring the failing CI, as it's clearly unrelated to this PR, and the tarballs have already been ingested.

@bedroge bedroge merged commit 540a9e6 into EESSI:2023.06-software.eessi.io Mar 23, 2025
49 of 59 checks passed
@eessi-bot
Copy link

eessi-bot bot commented Mar 23, 2025

PR merged! Moved [] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.03.23

1 similar comment
@eessi-bot
Copy link

eessi-bot bot commented Mar 23, 2025

PR merged! Moved [] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.03.23

@eessi-bot-trz42
Copy link

PR merged! Moved ['/p/project1/ceasybuilders/bot-trz42/jobs/2025.03/pr_975/13530846', '/p/project1/ceasybuilders/bot-trz42/jobs/2025.03/pr_975/13529544', '/p/project1/ceasybuilders/bot-trz42/jobs/2025.03/pr_975/13530873'] to /p/project1/ceasybuilders/bot-trz42/trash_bin/EESSI/software-layer/2025.03.23

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2023.06-software.eessi.io 2023.06 version of software.eessi.io bot:deploy Ask bot to deploy missing software installations to EESSI grace NVIDIA Grace CPU

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants