Submitting R code to SLURM
Last updated on 2025-09-30 | Edit this page
Overview
Questions
- How do you write a SLURM submission script?
Objectives
- Explain the general structure of a SLURM script
- Explain how and why to request different resources with SLURM
- Give sample template SLURM scripts for submitting R code
SLURM scripts
Now that we understand how to parallelize R code and run it on the command line, let’s put it all together. We’ll write a SLURM script to submit our code to run on compute nodes on the HPCC.
We’ve already seen an example of
a SLURM script in the parallelization section, where
future was submitting SLURM jobs for us. However, if we
want to have more control, we’ll have to understand how to write these
ourselves.
The basic structure of a SLURM script can be split into three parts:
- The #!/bin/bash --loginline
- Resources specifications
- Code you want to run (as if you’re inputting it on the command line)
The #!/bin/bash --login line always needs to be there.
It just tells the system to run your code with bash, which
is what you’re using on the command line.
The second section is where we specify resources. This is done using lines in the form
The options let you specify things like
- The time you need to run your code, e.g.,
#SBATCH --time=01:05:30for 1 hour, 5 minutes, and 30 seconds
- The number of cores you want to run your code on, e.g.,
#SBATCH --cpus-per-task=8for 8 cores
- The number of nodes you need to run your code on, e.g.,
#SBATCH --nodes=2for 2 nodes
- The amount of memory your code will need, e.g.,
#SBATCH --mem=10GBfor 10GB or--mem-per-cpu=750MBfor 750MB per core you ask for
- The node type, e.g., #SBATCH --constraint=intel16to run your code only on intel16 nodes
- The SLURM account you want to use (if you have a buy-in node), e.g.,
#SBATCH --account=my_buyinto activate the buy-in nodes associated to themy_buyinaccount
Finally, you will add your code below these #SBATCH.
This code is exactly what you would enter on the command line to run
your R scripts as we showed in the previous epsiode.
A SLURM script template
BASH
#!/bin/bash --login
#SBATCH --time=00:05:00  # 5 minutes
#SBATCH --cpus-per-task=1  # Use 1 core
#SBATCH --mem-per-cpu=500MB  # Use 500MB of memory per core requested
#SBATCH --nodes=1  # Use 1 node
#SBATCH --constraint=intel16  # Use only intel16 nodes
# Load the R module
module purge
module load R-bundle-CRAN/2023.12-foss-2023a
# Get to our project directory
cd ~/r_workshop
# Run the script
Rscript src/test_sqrt.R
# Bonus: write information about our job in the output file
scontrol show job $SLURM_JOB_IDA template like this will work for you 90% of the time, where all you need to do is set your resources correctly, load the right version of R, set the directory you want to work in, and choose your script.
SLURM script submission
Create a new directory in our R project directory called
slurm, and save the above file there as
single_core.sh. Then submit the script with
sbatch:
The script will sit in the queue until its resources are available. We can check on its status with
After the script has run, we can see its output in a file with a name
like slurm-<jobid>.out in the current working
directory.
OUTPUT
   user  system elapsed
  0.027   0.010   5.042
JobId=48169646 JobName=single_core.R
   UserId=k0068027(2238) GroupId=math(2011) MCS_label=N/A
   Priority=59669 Nice=0 Account=general QOS=k0068027
   JobState=RUNNING Reason=None Dependency=(null)
   Requeue=1 Restarts=0 BatchFlag=1 Reboot=0 ExitCode=0:0
   RunTime=00:00:17 TimeLimit=00:05:00 TimeMin=N/A
   SubmitTime=2025-01-07T17:33:34 EligibleTime=2025-01-07T17:33:34
   AccrueTime=2025-01-07T17:33:34
   StartTime=2025-01-07T17:33:50 EndTime=2025-01-07T17:38:50 Deadline=N/A
   SuspendTime=None SecsPreSuspend=0 LastSchedEval=2025-01-07T17:33:50 Scheduler=Main
   Partition=general-short AllocNode:Sid=dev-amd20:1882606
   ReqNodeList=(null) ExcNodeList=(null)
   NodeList=lac-335
   BatchHost=lac-335
   NumNodes=1 NumCPUs=1 NumTasks=1 CPUs/Task=1 ReqB:S:C:T=0:0:*:*
   ReqTRES=cpu=1,mem=500M,node=1,billing=76
   AllocTRES=cpu=1,mem=500M,node=1,billing=76
   Socks/Node=* NtasksPerN:B:S:C=0:0:*:* CoreSpec=*
   MinCPUsNode=1 MinMemoryCPU=500M MinTmpDiskNode=0
   Features=[intel16|intel18|(amr|acm)|nvf|nal|nif|amd24] DelayBoot=00:00:00
   OverSubscribe=OK Contiguous=0 Licenses=(null) Network=(null)
   Command=/mnt/ufs18/home-237/k0068027/r_workshop/single_core.R
   WorkDir=/mnt/ufs18/home-237/k0068027/r_workshop
   Comment=stdout=/mnt/ufs18/home-237/k0068027/r_workshop/slurm-48169646.out
   StdErr=/mnt/ufs18/home-237/k0068027/r_workshop/slurm-48169646.out
   StdIn=/dev/null
   StdOut=/mnt/ufs18/home-237/k0068027/r_workshop/slurm-48169646.out
   TresPerTask=cpu=1Congratulations! We’ve just completed the workflow for submitting any job on the HPCC.
Submitting a multicore job
Copy your submission script to slurm/multi_core.sh.
Adjust it to request five cores and change it so that you run a copy of
src/test_sqrt_multisession.R with a multicore
backend for future with five workers. Submit the job and
compare the time it took to run with the single core job.
slurm/multi_core.sh:
BASH
#!/bin/bash --login
#SBATCH --time=00:05:00  # 5 minutes
#SBATCH --cpus-per-task=5  # Use 5 cores
#SBATCH --mem-per-cpu=500MB  # Use 500MB of memory per core requested
#SBATCH --nodes=1  # Use 1 node
#SBATCH --constraint=intel16  # Use only intel16 nodes
# Load the R module
module purge
module load R-bundle-CRAN/2023.12-foss-2023a
# Get to our project directory
cd ~/r_workshop
# Run the script
Rscript src/test_sqrt_multicore.R
# Bonus: write information about our job in the output file
scontrol show job $SLURM_JOB_IDsrc/test_sqrt_multicore.R:
R
library(foreach)
library(doFuture)
plan(multicore, workers = 5)
t <- proc.time()
x <- 1:10
z <- foreach(xi = x) %dofuture% {
  Sys.sleep(0.5)
  sqrt(xi)
}
print(proc.time() - t)
Cleaning up the output
Leaving the output in the directory we run the script in will get messy. For the steps below, you will need the list of SLURM job specifications.
- Create the directory resultsin your project directory.
- For the previous job script, change the name for the job allocation
to multicore-sqrt.
- Change it so the output and error files are stored in your project
directory under results/<jobname>-<jobid>.outandresults/<jobname>-<jobid>.errwhere<jobname>is the name you set in the previous step and<jobid>is the number that SLURM assigns your job.
Hint: you can reference the job ID in #SBATCH
lines with %j and the job name with %x.
multi_core.sh:
BASH
#!/bin/bash --login
#SBATCH --time=00:05:00  # 5 minutes
#SBATCH --cpus-per-task=5  # Use 5 cores
#SBATCH --mem-per-cpu=500MB  # Use 500MB of memory per core requested
#SBATCH --nodes=1  # Use 1 node
#SBATCH --constraint=intel16  # Use only intel16 nodes
#SBATCH --job-name=multicore
#SBATCH --output=~/r_workshop/results/%x-%j.out
#SBATCH --error=~/r_workshop/results/%x-%j.err
# Load the R module
module purge
module load R-bundle-CRAN/2023.12-foss-2023a
# Get to our project directory
cd ~/r_workshop
# Run the script
Rscript src/test_sqrt_multicore.RSubmitting SLURM scripts for code that runs on multiple nodes
Using the future package
In the section on writing parallel code across multiple nodes using
the future package, we discussed two approaches: using the
cluster backend, and having future submit
SLURM jobs for you. In the latter case, you would only need to submit
your controller script to SLURM using a simple submission script like
the one above. The batchtools_slurm backend will submit
SLURM jobs for the hard work that you want parallelized (though, as
discussed, you may need to write a template script).
In the former case of using the cluster backend, we had
to tell future which nodes we want it to run on. Luckily
future has SLURM in mind and can query the nodes available
in your SLURM job with
parallelly:availableWorkers(methods = "Slurm"). You can
then use these as the workers when specifying the plan
like
For MPI jobs
MPI is a one of the most popular ways to write code in many languages
that runs on multiple nodes. You may write something that uses it
explicitly (e.g., the Rmpi
or pdbMPI
packages), or implicitly, by using a package that uses MPI behind the
scenes (e.g., doMPI).
Any code using MPI must be called in a special way that lets the code
know about the environment it’s running on. Practically, this means that
you need to preface the Rscript command with
srun with the option --cpus-per-task so MPI
knows how many CPUs it can use per task.
Here is an example script:
BASH
#!/bin/bash --login
#SBATCH --time=00:10:00  # 10 minutes
#SBATCH --tasks=6  # MPI will start 6 versions of the program
#SBATCH --cpus-per-task=8  # Use 8 cores per task (48 in total)
#SBATCH --mem-per-cpu=1GB  # Use 1GB of memory per core requested
#SBATCH --nodes=2  # Distribute the tasks across 2 nodes
#SBATCH --constraint=intel16  # Use only intel16 nodes
# Load the R module
module purge
module load R-bundle-CRAN/2023.12-foss-2023a
# Get to our project directory
cd ~/r_workshop
# Run the script
srun --cpus-per-task=$SLURM_CPUS_PER_TASK Rscript src/some_MPI_script.R
# Bonus: write information about our job in the output file
scontrol show job $SLURM_JOB_IDNotice that we added an #SBATCH line for
tasks and used the $SLURM_CPUS_PER_TASK
variable to set the option for srun. This ensures that
whatever we set in the #SBATCH --cpus-per-task line will be
used by srun.
- A SLURM script requests resources
- Generally, the only code you need in a SLURM script is loading the R
module, changing to the right directory and running your R code with
Rscript
- Check the status of your jobs with squeue --me
