... | ... | @@ -394,6 +394,7 @@ The following commands are useful: |
|
|
- sinfo
|
|
|
- sinfo -a
|
|
|
- sinfo -l
|
|
|
- sinfo -N -l
|
|
|
|
|
|
```
|
|
|
[user@res-hpc-lo01 ~]$ sinfo
|
... | ... | @@ -415,6 +416,15 @@ gpu up infinite 1-infinite no NO all 1 idl |
|
|
|
|
|
```
|
|
|
|
|
|
```
|
|
|
[user@res-hpc-lo01 ~]$ sinfo -N -l
|
|
|
Thu Jan 23 12:20:46 2020
|
|
|
NODELIST NODES PARTITION STATE CPUS S:C:T MEMORY TMP_DISK WEIGHT AVAIL_FE REASON
|
|
|
res-hpc-exe013 1 all* idle 16 2:8:1 128800 0 1 (null) none
|
|
|
res-hpc-exe014 1 all* idle 24 2:12:1 386800 0 1 (null) none
|
|
|
res-hpc-gpu01 1 gpu idle 48 2:24:1 515000 0 1 (null) none
|
|
|
```
|
|
|
|
|
|
If there are jobs running, you can see the following output:
|
|
|
```
|
|
|
[user@res-hpc-lo01 mpi-benchmarks]$ sinfo
|
... | ... | @@ -577,6 +587,60 @@ salloc: Relinquishing job allocation 267 |
|
|
|
|
|
[user@res-hpc-lo01 ~]$
|
|
|
```
|
|
|
In the example above, we won't run a command so we ended up in the bash environment.
|
|
|
With **exit** we leave the environment and we release the node.
|
|
|
|
|
|
```
|
|
|
[user@res-hpc-lo01 ~]$ salloc -N1 mpirun ./hello1
|
|
|
salloc: Granted job allocation 268
|
|
|
salloc: Waiting for resource configuration
|
|
|
salloc: Nodes res-hpc-exe013 are ready for job
|
|
|
Hello world from process 000 out of 001, processor name res-hpc-exe013.researchlumc.nl
|
|
|
salloc: Relinquishing job allocation 268
|
|
|
salloc: Job allocation 268 has been revoked.
|
|
|
```
|
|
|
Here we allocated 1 node with one core and ran the openmpi compiled "hello1" program.
|
|
|
|
|
|
Now the same with 2 nodes, 16 cores on each machine:
|
|
|
```
|
|
|
[user@res-hpc-lo01 ~]$ salloc -N2 --ntasks-per-node=16 mpirun ./hello1
|
|
|
salloc: Granted job allocation 270
|
|
|
salloc: Waiting for resource configuration
|
|
|
salloc: Nodes res-hpc-exe[013-014] are ready for job
|
|
|
Hello world from process 003 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 021 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 004 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 005 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 027 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 000 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 029 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 006 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 031 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 007 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 016 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 010 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 019 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 011 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 030 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 012 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 017 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 013 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 018 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 014 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 020 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 015 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 022 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 001 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 023 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 024 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 002 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 025 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 008 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
Hello world from process 026 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 028 out of 032, processor name res-hpc-exe014.researchlumc.nl
|
|
|
Hello world from process 009 out of 032, processor name res-hpc-exe013.researchlumc.nl
|
|
|
salloc: Relinquishing job allocation 270
|
|
|
```
|
|
|
|
|
|
### Submitting jobs
|
|
|
|
... | ... | |