It would be great to add something that can sanity check the bot configuration. For example, when creating a cluster with Magic Castle it would be quite easy to attach the wrong type of instance to a partition name (and it would not be easy to notice, the Zen2 nodes are called Standard_HB120-16rs_v2 and the Zen4 nodes are Standard_HB176-24rs_v4 so no obvious hints in the names).
One way to verify the architecture is
[ocaisa@login1 ~]$ srun -n 1 -p x86-64-amd-zen2-node /cvmfs/software.eessi.io/versions/2023.06/init/eessi_archdetect.sh cpupath
x86_64/amd/zen2
[ocaisa@login1 ~]$ srun -n 1 -p x86-64-amd-zen4-node /cvmfs/software.eessi.io/versions/2023.06/init/eessi_archdetect.sh cpupath
x86_64/amd/zen4
[ocaisa@login1 ~]$ srun -n 1 -p aarch64-neoverse-n1-node /cvmfs/software.eessi.io/versions/2023.06/init/eessi_archdetect.sh cpupath
aarch64/neoverse_n1
This would also check that the bot can successfully submit jobs.
You'd also want to verify that the bot can talk to the target repository with the correct permissions.
It would be great to add something that can sanity check the bot configuration. For example, when creating a cluster with Magic Castle it would be quite easy to attach the wrong type of instance to a partition name (and it would not be easy to notice, the Zen2 nodes are called
Standard_HB120-16rs_v2and the Zen4 nodes areStandard_HB176-24rs_v4so no obvious hints in the names).One way to verify the architecture is
This would also check that the bot can successfully submit jobs.
You'd also want to verify that the bot can talk to the target repository with the correct permissions.