Skip to content

Conversation

@bedroge
Copy link
Collaborator

@bedroge bedroge commented Apr 30, 2024

Using this PR to debug the issue observed in #546.

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Instance eessi-bot-mc-aws is configured to build:

  • arch x86_64/generic for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/generic for repo eessi-hpc.org-2023.06-software
  • arch x86_64/generic for repo eessi.io-2023.06-compat
  • arch x86_64/generic for repo eessi.io-2023.06-software
  • arch x86_64/intel/haswell for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/intel/haswell for repo eessi-hpc.org-2023.06-software
  • arch x86_64/intel/haswell for repo eessi.io-2023.06-compat
  • arch x86_64/intel/haswell for repo eessi.io-2023.06-software
  • arch x86_64/intel/skylake_avx512 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/intel/skylake_avx512 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/intel/skylake_avx512 for repo eessi.io-2023.06-compat
  • arch x86_64/intel/skylake_avx512 for repo eessi.io-2023.06-software
  • arch x86_64/amd/zen2 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/amd/zen2 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/amd/zen2 for repo eessi.io-2023.06-compat
  • arch x86_64/amd/zen2 for repo eessi.io-2023.06-software
  • arch x86_64/amd/zen3 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/amd/zen3 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/amd/zen3 for repo eessi.io-2023.06-compat
  • arch x86_64/amd/zen3 for repo eessi.io-2023.06-software
  • arch aarch64/generic for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/generic for repo eessi-hpc.org-2023.06-software
  • arch aarch64/generic for repo eessi.io-2023.06-compat
  • arch aarch64/generic for repo eessi.io-2023.06-software
  • arch aarch64/neoverse_n1 for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/neoverse_n1 for repo eessi-hpc.org-2023.06-software
  • arch aarch64/neoverse_n1 for repo eessi.io-2023.06-compat
  • arch aarch64/neoverse_n1 for repo eessi.io-2023.06-software
  • arch aarch64/neoverse_v1 for repo eessi-hpc.org-2023.06-compat
  • arch aarch64/neoverse_v1 for repo eessi-hpc.org-2023.06-software
  • arch aarch64/neoverse_v1 for repo eessi.io-2023.06-compat
  • arch aarch64/neoverse_v1 for repo eessi.io-2023.06-software

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Instance eessi-bot-mc-azure is configured to build:

  • arch x86_64/amd/zen4 for repo eessi-hpc.org-2023.06-compat
  • arch x86_64/amd/zen4 for repo eessi-hpc.org-2023.06-software
  • arch x86_64/amd/zen4 for repo eessi.io-2023.06-compat
  • arch x86_64/amd/zen4 for repo eessi.io-2023.06-software

@bedroge
Copy link
Collaborator Author

bedroge commented Apr 30, 2024

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.04/pr_555/10006

date job status comment
Apr 30 11:44:32 UTC 2024 submitted job id 10006 awaits release by job manager
Apr 30 11:45:08 UTC 2024 released job awaits launch by Slurm scheduler
Apr 30 11:46:10 UTC 2024 running job 10006 is running
Apr 30 11:57:25 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-10006.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Apr 30 11:57:25 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-10006.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Apr 30, 2024

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.04/pr_555/10007

date job status comment
Apr 30 11:50:59 UTC 2024 submitted job id 10007 awaits release by job manager
Apr 30 11:51:16 UTC 2024 released job awaits launch by Slurm scheduler
Apr 30 11:56:24 UTC 2024 running job 10007 is running
Apr 30 12:08:47 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-10007.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
Apr 30 12:08:47 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-10007.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Apr 30, 2024

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.04/pr_555/10008

date job status comment
Apr 30 12:00:21 UTC 2024 submitted job id 10008 awaits release by job manager
Apr 30 12:00:30 UTC 2024 released job awaits launch by Slurm scheduler
Apr 30 12:01:34 UTC 2024 running job 10008 is running
Apr 30 12:23:23 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-10008.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1714479127.tar.gzsize: 111 MiB (116574222 bytes)
entries: 10238
modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all
Python/3.11.5-GCCcore-13.2.0.lua
software under 2023.06/software/linux/x86_64/amd/zen3/software
Python/3.11.5-GCCcore-13.2.0
other under 2023.06/software/linux/x86_64/amd/zen3
no other files in tarball
Apr 30 12:23:23 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-10008.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Apr 30, 2024

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.04/pr_555/10009

date job status comment
Apr 30 12:08:46 UTC 2024 submitted job id 10009 awaits release by job manager
Apr 30 12:09:50 UTC 2024 released job awaits launch by Slurm scheduler
Apr 30 12:10:53 UTC 2024 running job 10009 is running
Apr 30 12:33:42 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-10009.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1714479706.tar.gzsize: 111 MiB (116582197 bytes)
entries: 10238
modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all
Python/3.11.5-GCCcore-13.2.0.lua
software under 2023.06/software/linux/x86_64/amd/zen3/software
Python/3.11.5-GCCcore-13.2.0
other under 2023.06/software/linux/x86_64/amd/zen3
no other files in tarball
Apr 30 12:33:42 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-10009.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Apr 30, 2024

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.04/pr_555/10010

date job status comment
Apr 30 12:15:06 UTC 2024 submitted job id 10010 awaits release by job manager
Apr 30 12:16:03 UTC 2024 released job awaits launch by Slurm scheduler
Apr 30 12:22:20 UTC 2024 running job 10010 is running
Apr 30 12:44:57 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-10010.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1714480420.tar.gzsize: 111 MiB (116569691 bytes)
entries: 10238
modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all
Python/3.11.5-GCCcore-13.2.0.lua
software under 2023.06/software/linux/x86_64/amd/zen3/software
Python/3.11.5-GCCcore-13.2.0
other under 2023.06/software/linux/x86_64/amd/zen3
no other files in tarball
Apr 30 12:44:57 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-10010.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Apr 30, 2024

Manually added write permissions to /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/hatchling/1.18.0-GCCcore-13.2.0/ on the Stratum 0, let's try again 🤞

@bedroge
Copy link
Collaborator Author

bedroge commented Apr 30, 2024

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 resulted in:

    • no jobs were submitted

@eessi-bot
Copy link

eessi-bot bot commented Apr 30, 2024

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen3 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.04/pr_555/10013

date job status comment
Apr 30 12:51:56 UTC 2024 submitted job id 10013 awaits release by job manager
Apr 30 12:52:01 UTC 2024 released job awaits launch by Slurm scheduler
Apr 30 12:53:03 UTC 2024 running job 10013 is running
Apr 30 13:15:31 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-10013.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1714482246.tar.gzsize: 111 MiB (117080049 bytes)
entries: 10590
modules under 2023.06/software/linux/x86_64/amd/zen3/modules/all
hatchling/1.18.0-GCCcore-13.2.0.lua
Python/3.11.5-GCCcore-13.2.0.lua
software under 2023.06/software/linux/x86_64/amd/zen3/software
hatchling/1.18.0-GCCcore-13.2.0
Python/3.11.5-GCCcore-13.2.0
other under 2023.06/software/linux/x86_64/amd/zen3
no other files in tarball
Apr 30 13:15:31 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-10013.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Apr 30, 2024

I've tried several things here: purging the CVMFS cache between the removal and build steps, adding write permissions before removing the installation directories, only removing the contents of the installations directories (and not directory itself), moving instead of removing, but it didn't solve the issue. In the end, I added write permissions to the hatchling directory on the stratum 0, and that did solve the issue. So, I'll do that for all CPU targets, and then try rebuilding the apps in #546.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant