Skip to content

Conversation

@casparvl
Copy link

No description provided.

@casparvl
Copy link
Author

bot:show_config

1 similar comment
@casparvl
Copy link
Author

bot:show_config

@casparvl
Copy link
Author

bot:status

@surf-hpcv-bot
Copy link

surf-hpcv-bot bot commented Jul 30, 2025

This is the status of all the bot: build commands:

on for repo result date status url

@casparvl
Copy link
Author

bot:show_config

@casparvl
Copy link
Author

Since we don't have our own repo yet (and thus we don't have a repo config for it), let's, for now, build for eessi.io-2023.06-software

bot:build instance:surf-hpcv-bot repo:eessi.io-2023.06-software on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc80

@casparvl
Copy link
Author

Hm, I don't know why it actually isn't doing anything...

[20250730-T09:58:06] [handle_issue_comment_event]: Comment in https://api.github.com/repos/SURF-hpcv/software-layer/issues/1 (owned by @casparvl) created by @casparvl
[20250730-T09:58:06] [handle_issue_comment_event]: comment action 'created' is handled
[20250730-T09:58:06] [handle_issue_comment_event]: account `casparvl` has permission to send commands to bot
[20250730-T09:58:06] [handle_issue_comment_event]: found bot command: 'build instance:surf-hpcv-bot repo:eessi.io-2023.06-software on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc80'
[20250730-T09:58:06] [handle_issue_comment_event]: comment response: '
- received bot command `build instance:surf-hpcv-bot repo:eessi.io-2023.06-software on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc80` from `casparvl`
  - expanded format: `build architecture:zen2 instance:surf-hpcv-bot repository:eessi.io-2023.06-software`'
[20250730-T09:58:06] [handle_bot_command]: Handling bot command build
[20250730-T09:58:07] [handle_bot_command_build]: repository: 'SURF-hpcv/software-layer'

@casparvl
Copy link
Author

Hm, I must have gotten my app.cfg wrong:

Traceback (most recent call last):
  File "/home/casparvl/EESSI/bot-instance/venv_eessibot/lib64/python3.6/site-packages/pyghee/lib.py", line 194, in process_event
    self.handle_event(event_info, log_file=log_file)
  File "/home/casparvl/EESSI/bot-instance/venv_eessibot/lib64/python3.6/site-packages/pyghee/lib.py", line 124, in handle_event
    handler(event_info, log_file=log_file)
  File "/home/casparvl/EESSI/bot-instance/eessi-bot-software-layer/eessi_bot_event_handler.py", line 310, in handle_issue_comment_event
    update = self.handle_bot_command(event_info, cmd)
  File "/home/casparvl/EESSI/bot-instance/eessi-bot-software-layer/eessi_bot_event_handler.py", line 489, in handle_bot_command
    return handler(event_info, bot_command)
  File "/home/casparvl/EESSI/bot-instance/eessi-bot-software-layer/eessi_bot_event_handler.py", line 535, in handle_bot_command_build
    submitted_jobs = submit_build_jobs(pr, event_info, bot_command.action_filters, bot_command.build_params)
  File "/home/casparvl/EESSI/bot-instance/eessi-bot-software-layer/tasks/build.py", line 1052, in submit_build_jobs
    jobs = prepare_jobs(pr, cfg, event_info, action_filter, build_params)
  File "/home/casparvl/EESSI/bot-instance/eessi-bot-software-layer/tasks/build.py", line 575, in prepare_jobs
    node_map = get_node_types(cfg)
  File "/home/casparvl/EESSI/bot-instance/eessi-bot-software-layer/tasks/build.py", line 200, in get_node_types
    node_type_map = json.loads(node_types.get(config.NODE_TYPE_MAP))
  File "/usr/lib64/python3.6/json/__init__.py", line 354, in loads
    return _default_decoder.decode(s)
  File "/usr/lib64/python3.6/json/decoder.py", line 339, in decode
    obj, end = self.raw_decode(s, idx=_w(s, 0).end())
  File "/usr/lib64/python3.6/json/decoder.py", line 355, in raw_decode
    obj, end = self.scan_once(s, idx)
json.decoder.JSONDecodeError: Expecting property name enclosed in double quotes: line 79 column 3 (char 3343)

@casparvl
Copy link
Author

Ah, I didn't close it properly. Retrying:

bot:show_config

@surf-hpcv-bot
Copy link

surf-hpcv-bot bot commented Jul 30, 2025

Instance SURF-hpcv-bot is configured to build on:

  • Node type x86-64-generic:

    • OS: linux
    • CPU architecture: x86_64/generic
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-haswell:

    • OS: linux
    • CPU architecture: x86_64/intel/haswell
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-sapphirerapids:

    • OS: linux
    • CPU architecture: x86_64/intel/sapphirerapids
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-skylake:

    • OS: linux
    • CPU architecture: x86_64/intel/skylake_avx512
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-cascadelake:

    • OS: linux
    • CPU architecture: x86_64/intel/cascadelake
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-icelake:

    • OS: linux
    • CPU architecture: x86_64/intel/icelake
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-zen2:

    • OS: linux
    • CPU architecture: x86_64/amd/zen2
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-zen3:

    • OS: linux
    • CPU architecture: x86_64/amd/zen3
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type x86-64-zen4:

    • OS: linux
    • CPU architecture: x86_64/amd/zen4
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-generic:

    • OS: linux
    • CPU architecture: aarch64/generic
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-neoverse_n1:

    • OS: linux
    • CPU architecture: aarch64/neoverse_n1
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-neoverse_v1:

    • OS: linux
    • CPU architecture: aarch64/neoverse_v1
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']
  • Node type aarch64-graviton4:

    • OS: linux
    • CPU architecture: aarch64/aws/graviton4
    • Repositories: ['eessi.io-2023.06-compat', 'eessi.io-2023.06-software', 'eessi.io-2025.06-compat', 'eessi.io-2025.06-software']

@casparvl
Copy link
Author

Now that's more like it! Lets try a build:

bot:build instance:surf-hpcv-bot repo:eessi.io-2023.06-software on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc80

@casparvl
Copy link
Author

Ah, in my app.cfg I called the bot SURF-hpc-bot, with caps. I'll undo that to make it similar to the app name in github. Now retrying:

bot:build instance:surf-hpcv-bot repo:eessi.io-2023.06-software on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc80

@surf-hpcv-bot
Copy link

surf-hpcv-bot bot commented Jul 30, 2025

New job on instance surf-hpcv-bot for repository eessi.io-2023.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc80
Job dir: /project/def-users/casparvl/hpcv-bot-instance/jobs/2025.07/pr_1/79559

date job status comment
Jul 30 10:08:38 UTC 2025 submitted job id 79559 awaits release by job manager
Jul 30 10:08:59 UTC 2025 released job awaits launch by Slurm scheduler
Jul 30 10:17:02 UTC 2025 running job 79559 is running
Jul 30 10:24:09 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-79559.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-17538709920.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80
no other files in tarball
Jul 30 10:24:09 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-79559.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Author

bot:status

@surf-hpcv-bot
Copy link

surf-hpcv-bot bot commented Jul 30, 2025

This is the status of all the bot: build commands:

on for repo result date status url

@casparvl
Copy link
Author

bot:status

@surf-hpcv-bot
Copy link

surf-hpcv-bot bot commented Jul 30, 2025

This is the status of all the bot: build commands:

on for repo result date status url
amd-zen2 x86_64/amd/zen2, nvidia/cc80 eessi.io-2023.06-software 😢 FAILURE Jul 30 10:24:09 UTC 2025 finished #1 (comment)

@casparvl
Copy link
Author

Btw, failure is

== 2025-07-30 10:22:23,882 modules.py:835 DEBUG Output of module command '/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/share/Lmod/libexec/lmod python load CUDA/12.1.1': stdout: _mlstatus = False
; stderr: Lmod has detected the following error:
You requested to load CUDA  but while the module file exists, the actual
software is not entirely shipped with EESSI due to licencing. You will need to
install a full copy of the CUDA package where EESSI can find it.
For more information on how to do this, see
https://www.eessi.io/docs/site_specific_config/gpu/.

While processing the following module(s):
    Module fullname  Module Filename
    ---------------  ---------------
    CUDA/12.1.1      /cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/.modules/all/CUDA/12.1.1.lua

doesn't seem to be related to this particular bot instance, but something wrong with our LMOD hooks for CUDA maybe?

@casparvl
Copy link
Author

bot:build instance:surf-hpcv-bot repo:eessi.io-2023.06-software on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc80

@surf-hpcv-bot
Copy link

surf-hpcv-bot bot commented Jul 31, 2025

New job on instance surf-hpcv-bot for repository eessi.io-2023.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc80
Job dir: /project/def-users/casparvl/hpcv-bot-instance/jobs/2025.07/pr_1/79807

date job status comment
Jul 31 14:52:40 UTC 2025 submitted job id 79807 awaits release by job manager
Jul 31 14:52:54 UTC 2025 released job awaits launch by Slurm scheduler
Jul 31 14:54:57 UTC 2025 finished
🤷 UNKNOWN (click triangle for detailed information)
  • Job results file _bot_job79807.result does not exist in job directory or reading it failed.
  • No artefacts were found/reported.
Jul 31 14:54:57 UTC 2025 test result
🤷 UNKNOWN (click triangle for detailed information)
  • Job test file _bot_job79807.test does not exist in job directory or reading it failed.

@casparvl
Copy link
Author

not where I wanted to test this... cancelling

boegel and others added 16 commits August 5, 2025 13:50
Install BWA-0.7.18-GCCcore-12.3.0.eb as part of the DoitNow test builds on Deucalin
…023b

{2023.06}[2023b,a64fx] Lit and Astropy
also check on EESSI 2025.06 w.r.t. missing installations and software layer being in sync across CPU targets
…CC-13.3.0-14.2.0

{2025.06} GCC 13.3.0 + 14.2.0
@casparvl
Copy link
Author

bot:show_config

@casparvl
Copy link
Author

bot:build instance:surf-hpcv-bot repo:eessi.io-2023.06-software on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc80

@casparvl
Copy link
Author

casparvl commented Oct 6, 2025

bot:show_config

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants