Slurm Notes

Helper Script

Open XDMoD includes a helper script to pull data from Slurm’s sacct into Open XDMoD’s shredder system. This script can be used in place of the shredder to import data. To shred data for a Slurm cluster use this command with the name of a single cluster that would be used with sacct’s --clusters option:

$ xdmod-slurm-helper -r mycluster

The helper script doesn’t update the aggregate tables, so that must be done after the data has been shredded:

$ xdmod-ingestor

If your sacct executable isn’t in the PATH of the user that will be running the xdmod-slurm-helper command, you can specify the path by modifying the following section in your portal_settings.ini file.

[slurm]
sacct = "/path/to/sacct"

Use this command to display the help text for the Slurm helper script:

$ xdmod-slurm-helper -h

Input Format

If you’d prefer to not use the helper script, you can export data from Slurm into a file manually using the sacct command and then shred that file. The format must be the same as below. Also, the --parsable2, --noheader and --allocations are all required. Replace *cluster* with the name of your resource. It may also be possible to use other options that limit the output.

$ TZ=UTC sacct --clusters *cluster* --allusers \
    --parsable2 --noheader --allocations --duplicates \
    --format jobid,jobidraw,cluster,partition,account,group,gid,\
user,uid,submit,eligible,start,end,elapsed,exitcode,state,nnodes,\
ncpus,reqcpus,reqmem,reqgres,reqtres,timelimit,nodelist,jobname \
    --state CANCELLED,COMPLETED,FAILED,NODE_FAIL,PREEMPTED,TIMEOUT \
    --starttime 2013-01-01T00:00:00 --endtime 2013-01-01T23:59:59 \
    >/tmp/slurm.log

$ xdmod-shredder -r *cluster* -f slurm -i /tmp/slurm.log

NOTE: The time zone used in the output from sacct must be UTC to prevent ambiguities caused by clock changes due to daylight savings. The shredder will assume input times are in UTC regardless of your system time zone.

Unsupported Shredder Features

The xdmod-shredder -d/--dir option was designed to work with the accounting log naming convention used by PBS/TORQUE. If you are not using the same convention (files are named YYYYMMDD corresponding to the date jobs ended), do not use this option.