Skip to content

Conversation

@trz42
Copy link
Collaborator

@trz42 trz42 commented Mar 28, 2025

Based on #936

Has been reworked to use EasyBuild 4.9.4. Thus all originally used [include-easyblocks-]from-{pr,commit} could be removed.

However, a new from-commit was added for scikit-build-core because it was rebuilt with an easyconfigs that was merged after EB 4.9.4 has been released. Also, we currently cannot rebuild for NVIDIA Grace.

Note, ReFrame config for the bot instance has been updated to include 'processor' information. Hopefully that results in running tests successfully.

@trz42 trz42 added 2023.06-software.eessi.io 2023.06 version of software.eessi.io grace NVIDIA Grace CPU labels Mar 28, 2025
@eessi-bot
Copy link

eessi-bot bot commented Mar 28, 2025

Instance eessi-bot-mc-aws is configured to build for:

  • architectures: x86_64/generic, x86_64/intel/haswell, x86_64/intel/sapphirerapids, x86_64/intel/skylake_avx512, x86_64/amd/zen2, x86_64/amd/zen3, aarch64/generic, aarch64/neoverse_n1, aarch64/neoverse_v1
  • repositories: eessi.io-2023.06-software, eessi.io-2023.06-compat

@eessi-bot
Copy link

eessi-bot bot commented Mar 28, 2025

Instance eessi-bot-mc-azure is configured to build for:

  • architectures: x86_64/amd/zen4
  • repositories: eessi.io-2023.06-compat, eessi.io-2023.06-software

@eessi-bot-toprichard
Copy link

Instance rt-Grace-jr is configured to build for:

  • architectures: aarch64/nvidia/grace
  • repositories: eessi.io-2023.06-software

@eessi-bot-trz42
Copy link

Instance trz42-GH200-jr is configured to build for:

  • architectures: aarch64/nvidia/grace
  • repositories: eessi.io-2023.06-software

@trz42
Copy link
Collaborator Author

trz42 commented Mar 28, 2025

bot: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace

@eessi-bot
Copy link

eessi-bot bot commented Mar 28, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Mar 28, 2025

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

    • no jobs were submitted

@eessi-bot-trz42
Copy link

eessi-bot-trz42 bot commented Mar 28, 2025

Updates by the bot instance trz42-GH200-jr (click for details)
  • received bot command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace from trz42

    • expanded format: build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace
  • handling command build instance:trz42-GH200-jr repository:eessi.io-2023.06-software architecture:aarch64/nvidia/grace resulted in:

@eessi-bot-toprichard
Copy link

Updates by the bot instance rt-Grace-jr (click for details)
  • account trz42 has NO permission to send commands to the bot

@eessi-bot-trz42
Copy link

eessi-bot-trz42 bot commented Mar 28, 2025

New job on instance trz42-GH200-jr for CPU micro-architecture aarch64-nvidia-grace for repository eessi.io-2023.06-software in job dir /p/project1/ceasybuilders/bot-trz42/jobs/2025.03/pr_987/13544938

  • ReFrame tests now fail with
    .../stage/BotBuildTests/aarch64_nvidia_grace/default/EESSI_OSU_coll_775175bf/rfm_job.sh: line 7: mpirun: command not found
    
  • there are a number of other issues with ReFrame and some failing CI (updated apptainer version) that could all be addressed in a separate PR
date job status comment
Mar 28 19:13:40 UTC 2025 submitted job id 13544938 awaits release by job manager
Mar 28 19:13:53 UTC 2025 released job awaits launch by Slurm scheduler
Mar 28 19:14:57 UTC 2025 running job 13544938 is running
Mar 28 20:54:40 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-13544938.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-nvidia-grace-1743194868.tar.gzsize: 129 MiB (136046255 bytes)
entries: 5256
modules under 2023.06/software/linux/aarch64/nvidia/grace/modules/all
GROMACS/2024.1-foss-2023b.lua
NLTK/3.8.1-foss-2023b.lua
Valgrind/3.23.0-gompi-2023b.lua
mpi4py/3.1.5-gompi-2023b.lua
networkx/3.2.1-gfbf-2023b.lua
scikit-build-core/0.9.3-GCCcore-13.2.0.lua
scikit-learn/1.4.0-gfbf-2023b.lua
tqdm/4.66.2-GCCcore-13.2.0.lua
software under 2023.06/software/linux/aarch64/nvidia/grace/software
GROMACS/2024.1-foss-2023b
NLTK/3.8.1-foss-2023b
Valgrind/3.23.0-gompi-2023b
mpi4py/3.1.5-gompi-2023b
networkx/3.2.1-gfbf-2023b
scikit-build-core/0.9.3-GCCcore-13.2.0
scikit-learn/1.4.0-gfbf-2023b
tqdm/4.66.2-GCCcore-13.2.0
other under 2023.06/software/linux/aarch64/nvidia/grace
no other files in tarball
Mar 28 20:54:40 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite produced failures.
ReFrame Summary
[ FAILED ] Ran 9/9 test case(s) from 9 check(s) (9 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-13544938.out
❌ found message matching ERROR:
❌ found message matching [\s*FAILED\s*].*Ran .* test case
Mar 28 22:00:19 UTC 2025 uploaded transfer of eessi-2023.06-software-linux-aarch64-nvidia-grace-1743194868.tar.gz to S3 bucket succeeded

Copy link
Collaborator

@TopRichard TopRichard left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@TopRichard TopRichard added ready-to-deploy Mark a PR as ready to deploy and removed ready-to-review labels Mar 28, 2025
@trz42 trz42 added bot:deploy Ask bot to deploy missing software installations to EESSI and removed ready-to-deploy Mark a PR as ready to deploy labels Mar 28, 2025
@eessi-bot-toprichard
Copy link

Label bot:deploy has been set by user trz42, but this person does not have permission to trigger deployments

@trz42
Copy link
Collaborator Author

trz42 commented Mar 28, 2025

Tarball ingested and software packages available via /cvmfs

@trz42 trz42 merged commit e5872d2 into EESSI:2023.06-software.eessi.io Mar 28, 2025
52 of 59 checks passed
@eessi-bot
Copy link

eessi-bot bot commented Mar 28, 2025

PR merged! Moved [] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.03.28

1 similar comment
@eessi-bot
Copy link

eessi-bot bot commented Mar 28, 2025

PR merged! Moved [] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2025.03.28

@eessi-bot-trz42
Copy link

PR merged! Moved ['/p/project1/ceasybuilders/bot-trz42/jobs/2025.03/pr_987/13544938'] to /p/project1/ceasybuilders/bot-trz42/trash_bin/EESSI/software-layer/2025.03.28

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2023.06-software.eessi.io 2023.06 version of software.eessi.io bot:deploy Ask bot to deploy missing software installations to EESSI grace NVIDIA Grace CPU

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants