Skip to content

Record hostname of jobs in Slurm logs & job DB

Thomas Kluyver requested to merge feat/echo-hostname into master

Description

Over the weekend there were problems on some specific nodes. It would occasionally be useful to see which node a job runs on, and it's easy to capture that information.

EDIT: this now also gets hostnames from squeue and stores them in the job database.

How Has This Been Tested?

Tested on Maxwell:

xfel-calibrate jungfrau CORRECT \
  --karabo-da JNGFR03 --receiver-template JNGFR03 \
  --in-folder /gpfs/exfel/exp/FXE/202405/p006640/raw \
  --karabo-id FXE_XAD_JF500K --run 133 \
  --out-folder /gpfs/exfel/data/scratch/kluyvert/jf-corr-p6640-r133 

Tested the job_monitor & serve-overview changes by deploying on max-exfl-cal002.

Types of changes

  • New feature (non-breaking change which adds functionality)

Checklist:

  • My code follows the code style of this project.

Reviewers

@schmidtp @ahmedk

Edited by Thomas Kluyver

Merge request reports