I'm running jobs on our university cluster (regular user, no admin rights), which uses the SLURM scheduling system and I'm interested in plotting the CPU and memory usage over time, i.e while the job is running. I know about <code>sacct</code> and <code>sstat</code> and I was thinking to include these commands in my submission script, e.g. something in the line of <pre class="prettyprint"><code>#!/bin/bash #SBATCH <options> # Running the actual job in background srun my_program input.in output.out & # While loop that records resources JobStatus="$(sacct -j $SLURM_JOB_ID | awk 'FNR == 3 {print $6}')" FIRST=0 #sleep time in seconds STIME=15 while [ "$JobStatus" != "COMPLETED" ]; do #update job status JobStatus="$(sacct -j $SLURM_JOB_ID | awk 'FNR == 3 {print $6}')" if [ "$JobStatus" == "RUNNING" ]; then if [ $FIRST -eq 0 ]; then sstat --format=AveCPU,AveRSS,MaxRSS -P -j ${SLURM_JOB_ID} >> usage.txt FIRST=1 else sstat --format=AveCPU,AveRSS,MaxRSS -P --noheader -j ${SLURM_JOB_ID} >> usage.txt fi sleep $STIME elif [ "$JobStatus" == "PENDING" ]; then sleep $STIME else sacct -j ${SLURM_JOB_ID} --format=AllocCPUS,ReqMem,MaxRSS,AveRSS,AveDiskRead,AveDiskWrite,ReqCPUS,AllocCPUs,NTasks,Elapsed,State >> usage.txt JobStatus="COMPLETED" break fi done </code></pre> However, I'm not really convinced of this solution: <ul> <li><code>sstat</code> unfortunately doesn't show how many cpus are used at the moment (only average)</li> <li>MaxRSS is also not helpful if I try to record memory usage over time</li> <li>there still seems to be some error (script doesn't stop after job finishes)</li> </ul> Does anyone have an idea how to do that properly? Maybe even with <code>top</code> or <code>htop</code> instead of <code>sstat</code>? Any help is much appreciated.

Slurm offers a plugin to record a profile of a job (PCU usage, memory usage, even disk/net IO for some technologies) into a HDF5 file. The file contains a time series for each measure tracked, and you can choose the time resolution. You can activate it with <pre class="prettyprint"><code>#SBATCH --profile=<all|none|[energy[,|task[,|filesystem[,|network]]]]> </code></pre> See the documentation here. To check that this plugin is installed, run <pre class="prettyprint"><code>scontrol show config | grep AcctGatherProfileType </code></pre> It should output <code>AcctGatherProfileType = acct_gather_profile/hdf5</code>. The files are created in the folder referred to in the <code>ProfileHDF5Dir</code> Slurm configuration parameter (in <code>slurm.conf</code>) As for your script, you could try replacing <code>sstat</code> with an SSH connection to the compute nodes to run <code>ps</code>. Assuming <code>pdsh</code> or <code>clush</code> is installed, you could run something like: <pre class="prettyprint"><code>pdsh -j $SLURM_JOB_ID ps -u $USER -o pid,state,cputime,%cpu,rssize,command --columns 100 >> usage.txt </code></pre> This will give you CPU and memory usage per process. As a final note, your job never terminates simply because it will terminate when the <code>while</code> loop terminates, and the <code>while</code> loop will terminate when the job terminates... The condition <code>"$JobStatus" == "COMPLETED" </code> will never be observed from within the script. When the job is completed, the script is killed.

How to monitor resources during slurm job?

Tags:

memory

cpu

resource-monitor

slurm

I'm running jobs on our university cluster (regular user, no admin rights), which uses the SLURM scheduling system and I'm interested in plotting the CPU and memory usage over time, i.e while the job is running. I know about sacct and sstat and I was thinking to include these commands in my submission script, e.g. something in the line of

#!/bin/bash
#SBATCH <options>

# Running the actual job in background
srun my_program input.in output.out &

# While loop that records resources
JobStatus="$(sacct -j $SLURM_JOB_ID | awk 'FNR == 3 {print $6}')"
FIRST=0
#sleep time in seconds
STIME=15
while [ "$JobStatus" != "COMPLETED" ]; do
    #update job status
    JobStatus="$(sacct -j $SLURM_JOB_ID | awk 'FNR == 3 {print $6}')"
    if [ "$JobStatus" == "RUNNING" ]; then
        if [ $FIRST -eq 0 ]; then
            sstat --format=AveCPU,AveRSS,MaxRSS -P -j ${SLURM_JOB_ID} >> usage.txt
            FIRST=1
        else
            sstat --format=AveCPU,AveRSS,MaxRSS -P --noheader -j ${SLURM_JOB_ID} >> usage.txt
        fi
        sleep $STIME
    elif [ "$JobStatus" == "PENDING" ]; then
        sleep $STIME
    else
        sacct -j ${SLURM_JOB_ID} --format=AllocCPUS,ReqMem,MaxRSS,AveRSS,AveDiskRead,AveDiskWrite,ReqCPUS,AllocCPUs,NTasks,Elapsed,State >> usage.txt
        JobStatus="COMPLETED"
        break
    fi
done

However, I'm not really convinced of this solution:

sstat unfortunately doesn't show how many cpus are used at the moment (only average)
MaxRSS is also not helpful if I try to record memory usage over time
there still seems to be some error (script doesn't stop after job finishes)

Does anyone have an idea how to do that properly? Maybe even with top or htop instead of sstat? Any help is much appreciated.

272

asked May 08 '17 17:05

CoffeeNerd

1 Answers

Slurm offers a plugin to record a profile of a job (PCU usage, memory usage, even disk/net IO for some technologies) into a HDF5 file. The file contains a time series for each measure tracked, and you can choose the time resolution.

You can activate it with

#SBATCH --profile=<all|none|[energy[,|task[,|filesystem[,|network]]]]>

See the documentation here.

To check that this plugin is installed, run

scontrol show config | grep AcctGatherProfileType

It should output AcctGatherProfileType = acct_gather_profile/hdf5.

The files are created in the folder referred to in the ProfileHDF5Dir Slurm configuration parameter (in slurm.conf)

As for your script, you could try replacing sstat with an SSH connection to the compute nodes to run ps. Assuming pdsh or clush is installed, you could run something like:

pdsh -j $SLURM_JOB_ID ps -u $USER -o pid,state,cputime,%cpu,rssize,command --columns 100 >> usage.txt

This will give you CPU and memory usage per process.

As a final note, your job never terminates simply because it will terminate when the while loop terminates, and the while loop will terminate when the job terminates... The condition "$JobStatus" == "COMPLETED" will never be observed from within the script. When the job is completed, the script is killed.

answered Oct 29 '22 13:10

damienfrancois

Related questions
                            
                                Maximum memory allocation on openCL CPU
                            
                                What is the Remainder in the Eclipse Memory Analyser main pie chart?
                            
                                Heroku Memory Error with PHP and reading large file from S3
                            
                                Is there a thread-based mprotect?
                            
                                Tomcat crash: There is insufficient memory for the Java Runtime Environment to continue
                            
                                C# Char* to String
                            
                                How to force Windows to send 'LOW_MEMORY' signal to all applications?
                            
                                Best fit vs segregated fit vs buddy system for least fragmentation
                            
                                R foreach issue (some processes returning NULL)
                            
                                Java memory leak with native C and Fortran code
                            
                                Python os.walk memory issue
                            
                                node.js web server on heroku - constant memory grow
                            
                                Java BufferedImage memory consumption
                            
                                WinDbg MEM_COMMIT is at 1GB, eeheap is showing 150MB, can't find remaining memory
                            
                                Battling Java heap size, big difference between Java locally and Java web start
                            
                                Plotting a very large number of points on HTML5 canvas with JavaScript
                            
                                Do I need to realloc after memmove when remove an element from dynamic array?
                            
                                std::vector get slower and slower when load/clear huge amount of data
                            
                                Using Smart Pointers in Cython for Dynamically Allocated Arrays
                            
                                Should all structs that are expected to be read from binary be marked as packed?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With