... | @@ -41,13 +41,31 @@ Here are some self-explanatory job script examples for several softwares/applica |
... | @@ -41,13 +41,31 @@ Here are some self-explanatory job script examples for several softwares/applica |
|
|
|
|
|
### Regular partitions: short, standard, long
|
|
### Regular partitions: short, standard, long
|
|
* Accessible to all users via the uppa account.
|
|
* Accessible to all users via the uppa account.
|
|
* The number of concurrently allocated cores per user cumulated over the three partitions is limited to 64. This means that if you have 64 cores running on short, standard and long (cumulated), your subsequent jobs will pend until some of your running jobs terminate.
|
|
* The number of concurrently allocated cores per user cumulated over the three partitions is limited to 64. This means that if you have 64 cores running on short, standard and long (cumulated), your subsequent jobs will pend until some of your running jobs end.
|
|
|
|
|
|
### Special partitions: free, bigmem, gpu
|
|
### Special partitions: free, bigmem, gpu
|
|
* Every user is member of the free and bigmem accounts. These accounts are needed to access the free and bigmem partitions, respectively.
|
|
* Every user is member of the free and bigmem accounts. These accounts are needed to access the free and bigmem partitions, respectively.
|
|
* The gpu partition is only accessible to the members of the gpu account.
|
|
* The gpu partition is only accessible to the members of the gpu account.
|
|
* The number of concurrently allocated cores per user on the free, bigmem and gpu partitions is not limited.
|
|
* The number of concurrently allocated cores per user on the free, bigmem and gpu partitions is not limited.
|
|
|
|
|
|
|
|
## Slurm user commands
|
|
|
|
Submit a job:
|
|
|
|
```
|
|
|
|
sbatch script.sh
|
|
|
|
```
|
|
|
|
Cancel a job *job_id* (*job_id* is the number provided by Slurm to identify the job):
|
|
|
|
```
|
|
|
|
scancel job_id
|
|
|
|
```
|
|
|
|
Know the state of all the jobs submitted on the cluster:
|
|
|
|
```
|
|
|
|
squeue -u user_name
|
|
|
|
```
|
|
|
|
Know the state of the jobs submitted by the user on the cluster:
|
|
|
|
```
|
|
|
|
squeue -u user_name
|
|
|
|
```
|
|
|
|
|
|
## Error analysis
|
|
## Error analysis
|
|
* QOSMaxCpuPerJobLimit
|
|
* QOSMaxCpuPerJobLimit
|
|
* BeginTime ?
|
|
* BeginTime ?
|
... | | ... | |