Hi, I keep running out of memory on my jobs. I wanted to know which partition has more than 100GB of memory. I tried using main at 100 GB limit and my job hit that amount. Which specifications should I give as well. This was the seff output from my most recent job:
Job ID: 926978
Cluster: discovery
User/Group: mohazzab/mohazzab
State: OUT_OF_MEMORY (exit code 0)
Cores: 1
CPU Utilized: 03:28:50
CPU Efficiency: 99.74% of 03:29:23 core-walltime
Job Wall-clock time: 03:29:23
Memory Utilized: 97.78 GB
Memory Efficiency: 97.78% of 100.00 GB
It wouldn’t let me run this:
#!/bin/bash
#SBATCH --time=18:30:00
#SBATCH --partition=epyc-64
#SBATCH --mem-per-cpu=256GB
#SBATCH --export=none
#SBATCH --mail-type=END,FAIL
#SBATCH --mail-user=mohazzab@usc.edu
#SBATCH --output=adult_cis.log # Standard output and error log
Please specify which partition I should use. Thanks