Hi there,
I am trying to understand slurm allocation algorithm… In particular, I have submitted some jobs that require a40 GPU, and they are shown as queued, waiting for resources. However, the output of nodeinfo shows that there are multiple(12?) idle gpus of the type I’ve requested, so I was wondering what am I missing and why my allocations are not being fulfilled. I read on the CARC page that the limit of GPU usage is 36, and I am only allocated 3 right now.
Thanks!
Shushan