Record hostname of jobs in Slurm logs & job DB
- Sep 30, 2024
-
-
Thomas Kluyver authored3b04ddba
-
Thomas Kluyver authored80fa2f0d
-
Thomas Kluyver authored1b552d94
-
Over the weekend there were problems on some specific nodes. It would occasionally be useful to see which node a job runs on, and it's easy to capture that information.
EDIT: this now also gets hostnames from squeue
and stores them in the job database.
Tested on Maxwell:
xfel-calibrate jungfrau CORRECT \
--karabo-da JNGFR03 --receiver-template JNGFR03 \
--in-folder /gpfs/exfel/exp/FXE/202405/p006640/raw \
--karabo-id FXE_XAD_JF500K --run 133 \
--out-folder /gpfs/exfel/data/scratch/kluyvert/jf-corr-p6640-r133
Tested the job_monitor & serve-overview changes by deploying on max-exfl-cal002.