Example job scripts¶

Warning

These scripts and some behavior might be different from VSC5 and VSC4

General remarks¶

By default certain default settings are automatically applied for job scripts, which are intended to simplify the configuration of job scripts. If more control of the requested resources is desired a special flag #ASC --vanilla is available which prevents the job script parsing of lua (see Slurm modes).

Standard use cases¶

One or multiple exclusive CPU node(s) for 1 hour

#!/bin/bash
#SBATCH --job-name "myjob"
#SBATCH -N 1
#SBATCH -p zen4_0768
#SBATCH --qos zen4_0768
#SBATCH --time=01:00:00

1 Task/1 Core with 2 GB memory on a CPU node

#!/bin/bash
#SBATCH --job-name "myjob"
#SBATCH -n 1
#SBATCH --mem=2G
#SBATCH -p zen4_0768
#SBATCH --qos zen4_0768
#SBATCH --time=01:00:00

One exclusive GPU node (with 4 GPUs) for 1 hour

#!/bin/bash
#SBATCH --job-name "myjob"
#SBATCH -N 1
#SBATCH --gres=gpu:4
#SBATCH -p zen4_0768_h100x4
#SBATCH --qos zen4_0768_h100x4
#SBATCH --time=01:00:00

Two exclusive GPU nodes (with 4 GPUs/node) for 1 hour

#!/bin/bash
#SBATCH --job-name "myjob"
#SBATCH -N 2
#SBATCH --gres=gpu:4
#SBATCH -p zen4_0768_h100x4
#SBATCH --qos zen4_0768_h100x4
#SBATCH --time=01:00:00

Three Quarter GPU Node (with 3 GPUs) for 1 hour

#!/bin/bash
#SBATCH --job-name "myjob"
#SBATCH --gres=gpu:3
#SBATCH -p zen4_0768_h100x4
#SBATCH --qos zen4_0768_h100x4
#SBATCH --time=01:00:00

Half GPU Node (with 2 GPUs) for 1 hour

#!/bin/bash
#SBATCH --job-name "myjob"
#SBATCH --gres=gpu:2
#SBATCH -p zen4_0768_h100x4
#SBATCH --qos zen4_0768_h100x4
#SBATCH --time=01:00:00

Quarter GPU Node (with 1 GPU) for 1 hour

#!/bin/bash
#SBATCH --job-name "myjob"
#SBATCH --gres=gpu:1
#SBATCH -p zen4_0768_h100x4
#SBATCH --qos zen4_0768_h100x4
#SBATCH --time=01:00:00

Note

Only on GPU nodes (zen4_0768_h100x4) and for the time being: in order to get access to only the physical cores of the system we advise to specify in addition to the above:

#SBATCH --threads-per-core=1

If not specified, the job will access the logical cores (physical plus virtual) cores of the system as seen by hwloc.

Info

When using srun, on GPU nodes, processes will be correctly pinned to the NUMA domain(s) that are attached to the GPU(s) dedicated for the job. However, depending on your application you may have to use:

srun --gpu-bind=none ...

Advanced usage (Vanilla SLURM)¶

If unaltered SLURM behaviour with more individual tuning is desired, the following flag can be used to bypass all modifications introduced by lua:

#ASC --vanilla

Basic checks for the correctness of the job script will still be performed.

This is an example script for requesting one full GPU node (4 GPUs) with 4 tasks per node and 22 cores for each task using only the physical cores of the NUMA nodes:

#!/bin/bash
#ASC --vanilla
#SBATCH --job-name "myjob"
#SBATCH -N 1
#SBATCH --gres=gpu:4
#SBATCH --ntasks-per-node=4
#SBATCH --cpus-per-task=22
#SBATCH --threads-per-core=1
#SBATCH --qos zen4_0768_h100x4
#SBATCH -p zen4_0768_h100x4
#SBATCH --time=01:00:00

Particular examples¶

Example for running lammps on a CPU node:

#!/bin/bash

#SBATCH --job-name lammps_eessi_test
#SBATCH -N 1
#SBATCH -p zen4_0768
#SBATCH --qos zen4_0768
#SBATCH --time=00:05:00

module purge
module load LAMMPS/2Aug2023_update2-foss-2023a-kokkos

srun -n 4 lmp -i in.meam  #in.meam is your input file

Alternatively, with explicit binding:

#!/bin/bash
#SBATCH --job-name lammps_eessi_test
#SBATCH -N 1
#SBATCH --ntasks-per-node=4
#SBATCH -p zen4_0768
#SBATCH --qos zen4_0768
#SBATCH --time=00:05:00

module purge
module load LAMMPS/2Aug2023_update2-foss-2023a-kokkos

srun --cpu-bind=map_cpu:0,24,48,72 lmp -i in.meam   #in.meam is your input file

Example of running LAMMPS on GPU's:

#!/bin/bash
#SBATCH --job-name lammps_eessi_test
#SBATCH -N 1
#SBATCH -p zen4_0768_h100x4
#SBATCH --qos zen4_0768_h100x4
#SBATCH --gres=gpu:4
#SBATCH --time=00:05:00

module purge
ml LAMMPS/2Aug2023_update2-foss-2023a-kokkos-CUDA-12.1.1

export OMP_PROC_BIND=spread
export OMP_PLACES=threads

srun --gpu-bind=none lmp -k on g 4 -sf kk -in in.meam -pk kokkos newton on neigh half   #in.meam is your input file

Example job script for running a singularity container:

#!/bin/bash
#SBATCH -N 2
#SBATCH -p zen4_0768_h100x4
#SBATCH --qos zen4_0768_h100x4
#SBATCH --job-name <NAME>
#SBATCH --gres=gpu:4
#SBATCH --time=24:00:00

srun --ntasks-per-node=4 --cpu-bind=none --mpi=pmix singularity exec --nv --writable-tmpfs -B <MOUNT> <SIF IMAGE> <EXECUTEABLE>