... | ... | @@ -92,7 +92,7 @@ The title of the last column displayed by `squeue` is "NODELIST(REASON)": |
|
|
* For pending jobs, displays the pending reason:
|
|
|
* **Resources**: the resources requested by the job are not currently available since used by other jobs.
|
|
|
* **Priority**: the job priority is lower than the priority of other jobs.
|
|
|
* **QOSMaxCpuPerUserLimit**: the maximal number of authorized allocated cores has been reached by *username* ; the job is waiting for some running jobs of *username* to end.
|
|
|
* **QOSMaxCpuPerUserLimit**: the maximal number of authorized allocated cores has been reached by *username*; the job is waiting for some running jobs of *username* to end.
|
|
|
* **BeginTime**: the job earliest start time has not been reached yet. Can happen when the job is requeued by Slurm to fix an issue: in this case, Slurm sets a delayed start time for the job.
|
|
|
* **Held state**: the job *job_id* is hold by Slurm. To unlock it, do `scontrol release job_id`.
|
|
|
* **QOSMaxCpuPerJobLimit**: if a job specifies a memory per CPU limit that exceeds the partition limit, that job's count of CPUs per task will automatically be increased. This may result in the job failing due to CPU count limits. ***In this case, cancel your job and resubmit it with the correct parameters, otherwise it will pend forever***.
|
... | ... | @@ -105,8 +105,8 @@ scontrol show job job_id |
|
|
The `sinfo` command displays the current state of compute nodes:
|
|
|
* **STATE=alloc**: the node is fully allocated.
|
|
|
* **STATE=mix**: the node is partly allocated.
|
|
|
* **STATE=idle** : the node is not allocated.
|
|
|
* **STATE=drain** : the node does not accept new jobs, but the jobs currently allocated on the node keep running.
|
|
|
* **STATE=idle**: the node is not allocated.
|
|
|
* **STATE=drain**: the node does not accept new jobs, but the jobs currently allocated on the node keep running.
|
|
|
|
|
|
## Accounting
|
|
|
Slurm is connected to a database recording job accounting data. The `sacct` and `sreport` commands allow to access this accounting information.
|
... | ... | |