MUSICA partitions and QoS¶
On MUSICA, nodes of the same type of hardware are grouped in to partitions. The quality of service (QoS), defines the maximum run time of a job and the number and type of allocate-able nodes.
Partitions¶
- AMD CPU nodes: The CPU nodes are comprised of 2x AMD Epyc 9654 (96 cores each) with 768GB DDR5 RAM and a local NVMe with 1.92TB.
- GPU nodes: The GPU nodes have 2x AMD Epyc 9654 (96 cores each) with 768GB DDR5 RAM, 4x Nvidia SXM5 H100-94G-700W GPUs, 4x NDR200 infiniband and a local NVMe with 7.68TB.
These are the partitions on MUSICA:
Partition | Nodes | Architecture | CPU | Cores per CPU (physical/with HT) | GPU | RAM | Use |
---|---|---|---|---|---|---|---|
zen4_0768 | 72 | AMD | 2x AMD 9654 | 192/384 | No | 768 GB | The default partition |
zen4_0768_h100x4 | 112 | AMD | 2x AMD 9654 | 192/384 | 4x Nvidia SXM5 H100 | 768 TB | GPU partition |
Tip
Type sinfo -o %P
on any node to see all the available partitions.
andsacctmgr show user $( id -u ) withassoc format=qos%200
for the available QoS.
Quality of service (QoS)¶
The QoS defines the maximum run time of a job and the number and type of allocate-able nodes.
The following QoS are available for all normal (=non private) projects:
QoS name | Gives access to Partition | Hard run time limits | Description |
---|---|---|---|
zen4_0768 | zen4_0768 | 72h (3 days) | Default |
zen4_0768_h100x4 | zen4_0768 | 72h (3 days) | No |
Tip
The QoS that are assigned to a specific user can be viewed with:
sacctmgr show user `id -u` withassoc format=user,defaultaccount,account,qos%40s,defaultqos%20s