Monitoring and Accounting
Once a job is running, the next question is usually where it is, how much time it has left, and whether it finished the way you expected. Slurm gives you a few commands that cover most of that day-to-day checking.
Cluster status
See what partitions and nodes are available with:
sinfo
Active jobs
Check your own jobs with:
squeue -u $USER
That is usually the first command to run when something seems delayed or when you want to confirm which node your allocation landed on.
Completed jobs
List completed jobs for your account with:
sacct -u $USER
To focus on one job:
sacct -j <jobid>
If you want a more detailed accounting line, use:
sacct -j <jobid> --format=JobID,JobName,Account,Partition,QOS,State,Elapsed,AllocTRES,ExitCode
Job inspection
For a scheduler-level summary of a job, this command is usually the most useful:
scontrol show job <jobid>