For the past two days, I am trying to create a carc-ondemand jupyterlab notebook.
This is for the gpu partition.
The session gets started and goes to complete state immediately.
I tried multiple options for gpus and I still face this issue.
Kindly help.
11117395 ood/jupyt+ gpu xiangren_+ 1 FAILED 1:0
11117395.ba+ batch xiangren_+ 1 FAILED 1:0
11117395.ex+ extern xiangren_+ 1 COMPLETED 0:0
11117431 ood/jupyt+ gpu jpujara_3+ 1 FAILED 1:0
11117431.ba+ batch jpujara_3+ 1 FAILED 1:0
11117431.ex+ extern jpujara_3+ 1 COMPLETED 0:0
11117445 ood/jupyt+ gpu xiangren_+ 1 FAILED 1:0
11117445.ba+ batch xiangren_+ 1 FAILED 1:0
11117445.ex+ extern xiangren_+ 1 COMPLETED 0:0
11117450 ood/jupyt+ gpu jpujara_3+ 1 FAILED 1:0
11117450.ba+ batch jpujara_3+ 1 FAILED 1:0
11117450.ex+ extern jpujara_3+ 1 COMPLETED 0:0
In the session output.log files, there are errors about module command not found. Looks related to your switch to zsh. Try adding the following in ~/.zshrc:
source /spack/apps/lmod/8.2/init/bash
export MODULEPATH=/spack/apps/lmod/linux-centos7-x86_64/Core
Hi, I’m getting a similar behavior trying to start ondemand notebooks on the qcbr or qcb queues on endeavour, that I have not encountered until today. echo $SHELL gives /bin/bash.
Error message is below, any suggestions for resolving this appreciated!!
==========================================
SLURM_JOB_ID = 8729528
SLURM_JOB_NODELIST = b01-12
TMPDIR = /tmp/SLURM_8729528
Script starting…
Waiting for JupyterLab server to open port 6611…
TIMING - Starting wait at: Wed Oct 5 13:00:57 PDT 2022
TIMING - Starting main script at: Wed Oct 5 13:00:57 PDT 2022
Lmod has detected the following error: These module(s) or extension(s) exist
but cannot be loaded as requested: “python/3.9.2”
Try: “module spider python/3.9.2” to see how to load the module(s).
No modules loaded
TIMING - Starting JupyterLab at: Wed Oct 5 13:00:57 PDT 2022
- jupyter-lab --config=/home1/fudenber/ondemand/data/sys/dashboard/batch_connect/sys/jupyter/output/db27a4ce-956a-4a63-8db4-72622d8e88fe/config.py
/home1/fudenber/ondemand/data/sys/dashboard/batch_connect/sys/jupyter/output/db27a4ce-956a-4a63-8db4-72622d8e88fe/script.sh: line 23: jupyter-lab: command not found
Timed out waiting for JupyterLab server to open port 6611!
TIMING - Wait ended at: Wed Oct 5 13:01:58 PDT 2022
Cleaning up…
I tried adding
module load gcc/8.3.0
module load python/3.9.2
to my .bashrc, but this did not help