Skip to content

Conversation

@casparvl
Copy link
Collaborator

@casparvl casparvl commented Oct 30, 2025

I think we should deploy the script from EESSI/software-layer-scripts#120 through this current PR, then change the build.sh back to it's original form. The issue is that EESSI/software-layer-scripts#120 can't be deployed there, because no software is built, and thus no "no missing installations" message is printed. This causes the bot to consider the build step a 'failure'.

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15643733

date job status comment
Oct 30 13:02:23 UTC 2025 submitted job id 15643733 will be eligible to start in about 20 seconds
Oct 30 13:02:30 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:02:55 UTC 2025 running job 15643733 is running
Oct 30 13:04:05 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15643733.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618294030.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
no other files in tarball
Oct 30 13:04:05 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15643733.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15644119

date job status comment
Oct 30 13:16:00 UTC 2025 submitted job id 15644119 will be eligible to start in about 20 seconds
Oct 30 13:16:11 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:16:24 UTC 2025 running job 15644119 is running
Oct 30 13:17:58 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15644119.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618302260.tar.gzsize: 0 MiB (421 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 13:17:58 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15644119.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15644178

date job status comment
Oct 30 13:19:57 UTC 2025 submitted job id 15644178 will be eligible to start in about 20 seconds
Oct 30 13:20:03 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 13:20:26 UTC 2025 running job 15644178 is running
Oct 30 13:46:57 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-15644178.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618319710.tar.gzsize: 0 MiB (420 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 13:46:57 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15644178.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Failure in the cuDNN host injections installations because it doesn't contain ptx code (fixed in EESSI/software-layer-scripts@e25b625 en bf2fc9c)

Also, another failure:

ERROR: Failed to create directory /cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all: [Errno 30] Read-only file system: '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel'

Not sure what's wrong here. We may be missing a mkdir -p, because this dir is not there yet since this is the first GPU software we install in this prefix. However, I thought we hit the same issue in 2023.06 and we fixed that - but it's been too long to remember. It might also be that we have a mkdir -p and that this is simply the error it hits when creating that dir...

@casparvl
Copy link
Collaborator Author

Added some extra verbosity EESSI/software-layer-scripts@54bd9ad , let's see

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Oct 30, 2025

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2025.10/pr_1278/15645493

date job status comment
Oct 30 14:26:09 UTC 2025 submitted job id 15645493 will be eligible to start in about 20 seconds
Oct 30 14:26:15 UTC 2025 received job awaits launch by Slurm scheduler
Oct 30 14:26:39 UTC 2025 running job 15645493 is running
Oct 30 15:05:14 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-15645493.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17618356610.tar.gzsize: 6872 MiB (7206354023 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
CUDA/12.6.0/20251030_143907UTC
CUDA/12.8.0/20251030_144306UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_144727UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_144502UTC
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 15:05:14 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-15645493.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Making it verbose seems to have solved the issue. That is, of course, impossible, but... things are working now:

mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules'
mkdir: created directory '/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all'
...
== COMPLETED: Installation ended successfully (took 3 mins 3 secs)
== Results of the build can be found in the log file(s) /cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software/CUDA/12.6.0/easybuild/easybuild-CUDA-12.6.0-20251030.153905.log.bz2

So maybe this was just one more of unionfs's hickups?

@casparvl
Copy link
Collaborator Author

Let's get all of those host-injections installed...

All bots that run native builds (one architecture per bot is sufficient)

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-vsc-ugent for:arch=x86_64/intel/cascadelake,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-jsc for:arch=aarch64/nvidia/grace,accel=nvidia/cc90

x86_64 and arm archs on AWS bot:

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc70
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=x86_64/generic for:arch=x86_64/generic,accel=nvidia/cc70

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Oct 30, 2025

New job on instance eessi-bot-jsc for repository eessi.io-2025.06-software
Building on: nvidia-grace and accelerator nvidia/cc90
Building for: aarch64/nvidia/grace and accelerator nvidia/cc90
Job dir: /p/project1/ceasybuilders/eessibot/jobs/2025.10/pr_1278/14161581

date job status comment
Oct 30 15:39:49 UTC 2025 submitted job id 14161581 awaits release by job manager
Oct 30 15:40:07 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:41:11 UTC 2025 running job 14161581 is running
Oct 30 17:17:12 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-14161581.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-aarch64-nvidia-grace-accel-nvidia-cc90-17618429730.tar.gzsize: 5980 MiB (6271255423 bytes)
entries: 8879
modules under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90/reprod
CUDA/12.6.0/20251030_161626UTC
CUDA/12.8.0/20251030_163743UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_163905UTC
other under 2025.06/software/linux/aarch64/nvidia/grace/accel/nvidia/cc90
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 17:17:12 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-14161581.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100486

date job status comment
Oct 30 15:39:50 UTC 2025 submitted job id 100486 awaits release by job manager
Oct 30 15:40:11 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:46:16 UTC 2025 running job 100486 is running
Oct 30 16:20:40 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100486.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc70-17618404130.tar.gzsize: 6197 MiB (6498851361 bytes)
entries: 12594
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_155004UTC
CUDA/12.8.0/20251030_155518UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_155800UTC
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 16:20:40 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100486.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: generic
Building for: x86_64/generic and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100487

date job status comment
Oct 30 15:39:56 UTC 2025 submitted job id 100487 awaits release by job manager
Oct 30 15:40:13 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 15:46:18 UTC 2025 running job 100487 is running
Oct 30 16:33:58 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-100487.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-generic-accel-nvidia-cc70-17618411070.tar.gzsize: 6872 MiB (7206333085 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_160705UTC
CUDA/12.8.0/20251030_161220UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_161645UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_161420UTC
other under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 16:33:58 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100487.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl casparvl added 2025.06-software.eessi.io 2025.06 version of software.eessi.io accel:nvidia labels Oct 30, 2025
@casparvl
Copy link
Collaborator Author

casparvl commented Oct 30, 2025

Edit: not sure why the previous build failed. The installations in the host_injections failed with a message that the lock file was already present. That's very strange, there should not be a lock file in the host_injections...

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc70

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc70
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100489

date job status comment
Oct 30 21:20:43 UTC 2025 submitted job id 100489 awaits release by job manager
Oct 30 21:21:32 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:22:34 UTC 2025 running job 100489 is running
Oct 30 21:50:35 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-100489.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc70-17618601690.tar.gzsize: 6872 MiB (7206289626 bytes)
entries: 12679
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/modules/all
CUDA/12.6.0.lua
CUDA/12.8.0.lua
cuDNN/9.10.1.4-CUDA-12.8.0.lua
cuDNN/9.5.0.50-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/software
CUDA/12.6.0
CUDA/12.8.0
cuDNN/9.10.1.4-CUDA-12.8.0
cuDNN/9.5.0.50-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70/reprod
CUDA/12.6.0/20251030_212519UTC
CUDA/12.8.0/20251030_212935UTC
cuDNN/9.10.1.4-CUDA-12.8.0/20251030_213418UTC
cuDNN/9.5.0.50-CUDA-12.6.0/20251030_213140UTC
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc70
2025.06/scripts/gpu_support/nvidia/easystacks/eessi-2025.06-eb-5.1.2-CUDA-host-injections.yml
Oct 30 21:50:35 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-100489.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

Oh crap, I see the issue, the other build was x86_64/generic, while intended to start it on ARM...

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2023.06-software
Building on: generic
Building for: aarch64/generic
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100490

date job status comment
Oct 30 21:24:47 UTC 2025 submitted job id 100490 awaits release by job manager
Oct 30 21:25:39 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:30:47 UTC 2025 running job 100490 is running
Oct 30 21:33:55 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100490.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-17618598470.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
reprod directories under 2023.06/software/linux/aarch64/generic/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 30 21:33:55 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_generic+default
P: perf: 699.891 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_generic+default
P: perf: 697.918 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:aarch64_generic+default
P: latency: 3.24 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:aarch64_generic+default
P: latency: 3.47 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:aarch64_generic+default
P: latency: 5.51 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:aarch64_generic+default
P: latency: 5.57 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_generic+default
P: latency: 0.44 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_generic+default
P: latency: 0.46 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_generic+default
P: bandwidth: 20802.34 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_generic+default
P: bandwidth: 20541.79 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-100490.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Oct 30, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2023.06-software
Building on: generic
Building for: aarch64/generic
Job dir: /project/def-users/SHARED/jobs/2025.10/pr_1278/100491

date job status comment
Oct 30 21:37:29 UTC 2025 submitted job id 100491 awaits release by job manager
Oct 30 21:38:02 UTC 2025 released job awaits launch by Slurm scheduler
Oct 30 21:39:07 UTC 2025 running job 100491 is running
Oct 30 21:41:12 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-100491.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-17618603120.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
reprod directories under 2023.06/software/linux/aarch64/generic/reprod
no reprod directories in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 30 21:41:12 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:aarch64_generic+default
P: perf: 696.808 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:aarch64_generic+default
P: perf: 706.728 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:aarch64_generic+default
P: latency: 3.51 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:aarch64_generic+default
P: latency: 3.5 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:aarch64_generic+default
P: latency: 5.45 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:aarch64_generic+default
P: latency: 5.62 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:aarch64_generic+default
P: latency: 0.45 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:aarch64_generic+default
P: latency: 0.44 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:aarch64_generic+default
P: bandwidth: 20690.98 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:aarch64_generic+default
P: bandwidth: 20851.54 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-100491.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

Wrong version...

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws for:arch=aarch64/generic

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen3
Building for: x86_64/amd/zen3 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103851

date job status comment
Nov 12 09:28:10 UTC 2025 submitted job id 103851 awaits release by job manager
Nov 12 09:28:55 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:30:56 UTC 2025 running job 103851 is running
Nov 12 10:38:56 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103851.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen3-accel-nvidia-cc100-17629429940.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc100
no other files in tarball
Nov 12 10:38:58 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103851.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen4
Building for: x86_64/amd/zen4 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103852

date job status comment
Nov 12 09:28:15 UTC 2025 submitted job id 103852 awaits release by job manager
Nov 12 09:28:59 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:30:59 UTC 2025 running job 103852 is running
Nov 12 10:39:05 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103852.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc100-17629431680.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc100
no other files in tarball
Nov 12 10:39:06 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103852.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Nov 12, 2025

New job on instance eessi-bot-jsc for repository eessi.io-2025.06-software
Building on: nvidia-grace
Building for: aarch64/nvidia/grace and accelerator nvidia/cc100
Job dir: /p/project1/ceasybuilders/eessibot/jobs/2025.11/pr_1278/14204554

date job status comment
Nov 12 09:28:17 UTC 2025 submitted job id 14204554 awaits release by job manager
Nov 12 09:29:10 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:30:16 UTC 2025 running job 14204554 is running

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-skylake_avx512
Building for: x86_64/intel/skylake_avx512 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103853

date job status comment
Nov 12 09:28:22 UTC 2025 submitted job id 103853 awaits release by job manager
Nov 12 09:29:14 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:31:24 UTC 2025 running job 103853 is running
Nov 12 10:29:13 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103853.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-skylake_avx512-accel-nvidia-cc100-17629424990.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/skylake_avx512/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/skylake_avx512/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/skylake_avx512/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/skylake_avx512/accel/nvidia/cc100
no other files in tarball
Nov 12 10:29:16 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103853.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-haswell
Building for: x86_64/intel/haswell and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103854

date job status comment
Nov 12 09:28:28 UTC 2025 submitted job id 103854 awaits release by job manager
Nov 12 09:29:07 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:31:11 UTC 2025 running job 103854 is running
Nov 12 10:35:48 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103854.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-haswell-accel-nvidia-cc100-17629429780.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/haswell/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/haswell/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/haswell/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/haswell/accel/nvidia/cc100
no other files in tarball
Nov 12 10:35:58 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103854.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-icelake
Building for: x86_64/intel/icelake and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103855

date job status comment
Nov 12 09:28:33 UTC 2025 submitted job id 103855 awaits release by job manager
Nov 12 09:29:11 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:31:16 UTC 2025 running job 103855 is running
Nov 12 10:36:21 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103855.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc100-17629427470.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc100
no other files in tarball
Nov 12 10:36:23 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103855.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-cascadelake
Building for: x86_64/intel/cascadelake and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103856

date job status comment
Nov 12 09:28:39 UTC 2025 submitted job id 103856 awaits release by job manager
Nov 12 09:29:03 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:31:04 UTC 2025 running job 103856 is running
Nov 12 10:29:04 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103856.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-cascadelake-accel-nvidia-cc100-17629423320.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc100
no other files in tarball
Nov 12 10:29:05 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103856.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-sapphirerapids
Building for: x86_64/intel/sapphirerapids and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103857

date job status comment
Nov 12 09:28:53 UTC 2025 submitted job id 103857 awaits release by job manager
Nov 12 09:30:40 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:34:51 UTC 2025 running job 103857 is running
Nov 12 10:41:12 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103857.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-sapphirerapids-accel-nvidia-cc100-17629437050.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/sapphirerapids/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/sapphirerapids/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/sapphirerapids/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/sapphirerapids/accel/nvidia/cc100
no other files in tarball
Nov 12 10:41:12 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103857.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: generic
Building for: x86_64/generic and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103858

date job status comment
Nov 12 09:29:02 UTC 2025 submitted job id 103858 awaits release by job manager
Nov 12 09:30:34 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:33:49 UTC 2025 running job 103858 is running
Nov 12 10:39:15 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103858.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-generic-accel-nvidia-cc100-17629432470.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc100
no other files in tarball
Nov 12 10:39:17 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103858.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: generic
Building for: aarch64/generic and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103859

date job status comment
Nov 12 09:29:10 UTC 2025 submitted job id 103859 awaits release by job manager
Nov 12 09:30:19 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:32:57 UTC 2025 running job 103859 is running
Nov 12 10:32:49 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103859.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-aarch64-generic-accel-nvidia-cc100-17629426840.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/aarch64/generic/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/aarch64/generic/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/aarch64/generic/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/aarch64/generic/accel/nvidia/cc100
no other files in tarball
Nov 12 10:32:49 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103859.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: neoverse_n1
Building for: aarch64/neoverse_n1 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103860

date job status comment
Nov 12 09:29:17 UTC 2025 submitted job id 103860 awaits release by job manager
Nov 12 09:30:23 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:33:07 UTC 2025 running job 103860 is running
Nov 12 10:32:54 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103860.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-aarch64-neoverse_n1-accel-nvidia-cc100-17629425650.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/aarch64/neoverse_n1/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/aarch64/neoverse_n1/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/aarch64/neoverse_n1/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/aarch64/neoverse_n1/accel/nvidia/cc100
no other files in tarball
Nov 12 10:32:55 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103860.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: neoverse_v1
Building for: aarch64/neoverse_v1 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103861

date job status comment
Nov 12 09:29:25 UTC 2025 submitted job id 103861 awaits release by job manager
Nov 12 09:30:27 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:33:13 UTC 2025 running job 103861 is running
Nov 12 10:33:06 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103861.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-aarch64-neoverse_v1-accel-nvidia-cc100-17629425980.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/aarch64/neoverse_v1/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/aarch64/neoverse_v1/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/aarch64/neoverse_v1/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/aarch64/neoverse_v1/accel/nvidia/cc100
no other files in tarball
Nov 12 10:33:06 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103861.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen3 for:arch=x86_64/amd/zen3,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen4 for:arch=x86_64/amd/zen4,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=skylake_avx512 for:arch=x86_64/intel/skylake_avx512,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=haswell for:arch=x86_64/intel/haswell,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=icelake for:arch=x86_64/intel/icelake,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=cascadelake for:arch=x86_64/intel/cascadelake,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=sapphirerapids for:arch=x86_64/intel/sapphirerapids,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=x86_64/generic for:arch=x86_64/generic,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=aarch64/generic for:arch=aarch64/generic,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=neoverse_n1 for:arch=aarch64/neoverse_n1,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=neoverse_v1 for:arch=aarch64/neoverse_v1,accel=nvidia/cc120
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-jsc on:arch=aarch64/nvidia/grace for:arch=aarch64/nvidia/grace,accel=nvidia/cc120

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Nov 12, 2025

New job on instance eessi-bot-jsc for repository eessi.io-2025.06-software
Building on: nvidia-grace
Building for: aarch64/nvidia/grace and accelerator nvidia/cc120
Job dir: /p/project1/ceasybuilders/eessibot/jobs/2025.11/pr_1278/14204563

date job status comment
Nov 12 09:30:46 UTC 2025 submitted job id 14204563 awaits release by job manager
Nov 12 09:31:22 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:32:37 UTC 2025 running job 14204563 is running

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103862

date job status comment
Nov 12 09:31:30 UTC 2025 submitted job id 103862 awaits release by job manager
Nov 12 09:32:39 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:36:23 UTC 2025 running job 103862 is running
Nov 12 10:38:43 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103862.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc120-17629432140.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc120
no other files in tarball
Nov 12 10:38:46 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103862.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen3
Building for: x86_64/amd/zen3 and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103863

date job status comment
Nov 12 09:32:22 UTC 2025 submitted job id 103863 awaits release by job manager
Nov 12 09:32:52 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:36:36 UTC 2025 running job 103863 is running
Nov 12 10:41:04 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103863.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen3-accel-nvidia-cc120-17629437800.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc120
no other files in tarball
Nov 12 10:41:04 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103863.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen4
Building for: x86_64/amd/zen4 and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103866

date job status comment
Nov 12 09:36:38 UTC 2025 submitted job id 103866 awaits release by job manager
Nov 12 09:39:03 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:45:07 UTC 2025 running job 103866 is running
Nov 12 10:41:08 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103866.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc120-17629436840.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc120
no other files in tarball
Nov 12 10:41:08 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103866.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-skylake_avx512
Building for: x86_64/intel/skylake_avx512 and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103869

date job status comment
Nov 12 09:47:39 UTC 2025 submitted job id 103869 awaits release by job manager
Nov 12 09:48:10 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:53:58 UTC 2025 running job 103869 is running
Nov 12 10:51:14 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103869.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-skylake_avx512-accel-nvidia-cc120-17629444090.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/skylake_avx512/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/skylake_avx512/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/skylake_avx512/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/skylake_avx512/accel/nvidia/cc120
no other files in tarball
Nov 12 10:51:14 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103869.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-haswell
Building for: x86_64/intel/haswell and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103870

date job status comment
Nov 12 09:53:07 UTC 2025 submitted job id 103870 awaits release by job manager
Nov 12 09:55:13 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 09:59:51 UTC 2025 running job 103870 is running
Nov 12 11:01:35 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103870.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-haswell-accel-nvidia-cc120-17629449310.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/haswell/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/haswell/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/haswell/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/haswell/accel/nvidia/cc120
no other files in tarball
Nov 12 11:01:37 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103870.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-icelake
Building for: x86_64/intel/icelake and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103871

date job status comment
Nov 12 09:57:29 UTC 2025 submitted job id 103871 awaits release by job manager
Nov 12 09:58:24 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 10:03:27 UTC 2025 running job 103871 is running
Nov 12 11:07:14 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103871.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc120-17629452120.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc120
no other files in tarball
Nov 12 11:07:15 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103871.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-cascadelake
Building for: x86_64/intel/cascadelake and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103872

date job status comment
Nov 12 10:02:20 UTC 2025 submitted job id 103872 awaits release by job manager
Nov 12 10:04:51 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 10:08:33 UTC 2025 running job 103872 is running
Nov 12 11:07:02 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103872.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-cascadelake-accel-nvidia-cc120-17629451710.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/cascadelake/accel/nvidia/cc120
no other files in tarball
Nov 12 11:07:03 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103872.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: intel-sapphirerapids
Building for: x86_64/intel/sapphirerapids and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103873

date job status comment
Nov 12 10:05:18 UTC 2025 submitted job id 103873 awaits release by job manager
Nov 12 10:07:13 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 10:16:11 UTC 2025 running job 103873 is running
Nov 12 11:13:54 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103873.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-sapphirerapids-accel-nvidia-cc120-17629456470.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/intel/sapphirerapids/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/intel/sapphirerapids/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/intel/sapphirerapids/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/intel/sapphirerapids/accel/nvidia/cc120
no other files in tarball
Nov 12 11:13:54 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103873.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: generic
Building for: x86_64/generic and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103874

date job status comment
Nov 12 10:06:57 UTC 2025 submitted job id 103874 awaits release by job manager
Nov 12 10:07:07 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 10:12:42 UTC 2025 running job 103874 is running
Nov 12 11:12:35 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103874.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-generic-accel-nvidia-cc120-17629454030.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/generic/accel/nvidia/cc120
no other files in tarball
Nov 12 11:12:40 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103874.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: generic
Building for: aarch64/generic and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103875

date job status comment
Nov 12 10:13:36 UTC 2025 submitted job id 103875 awaits release by job manager
Nov 12 10:18:03 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 10:25:21 UTC 2025 running job 103875 is running
Nov 12 11:13:47 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103875.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-aarch64-generic-accel-nvidia-cc120-17629456120.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/aarch64/generic/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/aarch64/generic/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/aarch64/generic/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/aarch64/generic/accel/nvidia/cc120
no other files in tarball
Nov 12 11:13:48 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103875.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: neoverse_n1
Building for: aarch64/neoverse_n1 and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103876

date job status comment
Nov 12 10:32:13 UTC 2025 submitted job id 103876 awaits release by job manager
Nov 12 10:37:37 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 10:40:28 UTC 2025 running job 103876 is running
Nov 12 11:13:50 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103876.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-aarch64-neoverse_n1-accel-nvidia-cc120-17629456620.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/aarch64/neoverse_n1/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/aarch64/neoverse_n1/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/aarch64/neoverse_n1/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/aarch64/neoverse_n1/accel/nvidia/cc120
no other files in tarball
Nov 12 11:13:50 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103876.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Nov 12, 2025

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: neoverse_v1
Building for: aarch64/neoverse_v1 and accelerator nvidia/cc120
Job dir: /project/def-users/SHARED/jobs/2025.11/pr_1278/103877

date job status comment
Nov 12 10:37:15 UTC 2025 submitted job id 103877 awaits release by job manager
Nov 12 10:37:43 UTC 2025 released job awaits launch by Slurm scheduler
Nov 12 10:40:35 UTC 2025 running job 103877 is running
Nov 12 11:13:52 UTC 2025 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-103877.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-aarch64-neoverse_v1-accel-nvidia-cc120-17629457170.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/aarch64/neoverse_v1/accel/nvidia/cc120/modules/all
no module files in tarball
software under 2025.06/software/linux/aarch64/neoverse_v1/accel/nvidia/cc120/software
no software packages in tarball
reprod directories under 2025.06/software/linux/aarch64/neoverse_v1/accel/nvidia/cc120/reprod
no reprod directories in tarball
other under 2025.06/software/linux/aarch64/neoverse_v1/accel/nvidia/cc120
no other files in tarball
Nov 12 11:13:52 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-103877.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

casparvl commented Nov 12, 2025

Strange... CC 70/80/90 works, but 100 and 120 give me:

== FAILED: Installation ended unsuccessfully: No matching questions found for current command output, giving up after 1000 seconds! (took 45 mins 47 secs)
== Results of the build can be found in the log file(s) /tmp/eb-3xfzba78/eb-vct8_9wr/easybuild-CUDA-12.6.0-20251112.092936.pqdlx.log
== Summary:
   * [FAILED]  CUDA/12.6.0
ERROR: Installation of CUDA-12.6.0.eb failed: 'No matching questions found for current command output, giving up after 1000 seconds!'

And in the build log:

== 2025-11-12 09:58:39,857 easyblock.py:4718 INFO Running method install_step part of step install
== 2025-11-12 09:58:39,857 run.py:457 INFO run_shell_cmd: command environment of "export LANG=C &&    ./cuda-installer  --silent --samples --samplespath=/eessi_bot_job/.local/easybuild/build/CUDA/12.6.0/system-system --toolkit --toolkitpath=/cvmfs/software.eessi.io/versions/2025.06/sof
tware/linux/x86_64/amd/zen2/software/CUDA/12.6.0 --defaultroot=/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/amd/zen2/software/CUDA/12.6.0  --override" will be saved to /tmp/eb-3xfzba78/eb-vct8_9wr/run-shell-cmd-output/export-oncoxpko
== 2025-11-12 09:58:39,857 run.py:460 INFO run_shell_cmd: Output of "export LANG=C &&    ./cuda-installer  --silent --samples --samplespath=/eessi_bot_job/.local/easybuild/build/CUDA/12.6.0/system-system --toolkit --toolkitpath=/cvmfs/software.eessi.io/versions/2025.06/software/linux/x
86_64/amd/zen2/software/CUDA/12.6.0 --defaultroot=/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/amd/zen2/software/CUDA/12.6.0  --override" will be logged to /tmp/eb-3xfzba78/eb-vct8_9wr/run-shell-cmd-output/export-oncoxpko/out.txt
== 2025-11-12 09:58:39,934 run.py:494 INFO Path to bash that will be used to run shell commands: /cvmfs/software.eessi.io/versions/2025.06/compat/linux/x86_64/bin/bash
== 2025-11-12 09:58:39,934 run.py:508 INFO Running interactive shell command 'export LANG=C &&    ./cuda-installer  --silent --samples --samplespath=/eessi_bot_job/.local/easybuild/build/CUDA/12.6.0/system-system --toolkit --toolkitpath=/cvmfs/software.eessi.io/versions/2025.06/softwar
e/linux/x86_64/amd/zen2/software/CUDA/12.6.0 --defaultroot=/cvmfs/software.eessi.io/versions/2025.06/software/linux/x86_64/amd/zen2/software/CUDA/12.6.0  --override' in /eessi_bot_job/.local/easybuild/build/CUDA/12.6.0/system-system

I don't get why. I don't think the CUDA compute capability is even used by EasyBuild when installing CUDA itself, until the sanity check. Why would it be different between compute capabilities?

@casparvl
Copy link
Collaborator Author

Oh, I also see:

2025-11-12 11:10:45 [ERROR] Value of $EESSI_ACCELERATOR_TARGET_OVERRIDE should match 'accel/nvidia/cc[0-9[0-9]', but it does not: 'accel/nvidia/cc120'
archdetect could not detect any accelerators

I guess we need to update that to match any number of 0-9, and potentially a,f as well.

@casparvl
Copy link
Collaborator Author

Lmod has detected the following error: Incorrect value for
$EESSI_ACCELERATOR_TARGET: accel/nvidia/cc120

@casparvl
Copy link
Collaborator Author

Ah, it seems that because of this no cuda-compute-capability is set for EasyBuild. Maybe that is somehow problematic.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2025.06-software.eessi.io 2025.06 version of software.eessi.io accel:nvidia

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants