The DGX-2 machine can be accessed through the scheduler from Salomon login nodes `salomon.it4i.cz`.
The DGX-2 machine can be accessed through the scheduler from Barbora login nodes `barbora.it4i.cz` as a compute node cn202.
The NVIDIA DGX-2 has its own instance of the scheduler, it can be accessed by loading the `DGX-2` module. See [Resource Allocation and Job Execution][1].
## Storage
[1]:job_execution.md
There are three shared file systems on the DGX-2 system: HOME, SCRATCH (LSCRATCH), and PROJECT.
### HOME
The HOME filesystem is realized as an NFS filesystem. This is a shared home from the [Barbora cluster][1].
### SCRATCH
The SCRATCH is realized on an NVME storage. The SCRATCH filesystem is mounted in the `/scratch` directory.
Users may freely create subdirectories and files on the filesystem (`/scratch/user/$USER`).
Accessible capacity is 22TB, shared among all users.
!!! warning
Files on the SCRATCH filesystem that are not accessed for more than 60 days will be automatically deleted.
### PROJECT
The PROJECT data storage is IT4Innovations' central data storage accessible from all clusters.
For more information on accessing PROJECT, its quotas, etc., see the [PROJECT Data Storage][2] section.
To run a job, computational resources of DGX-2 must be allocated.
DGX-2 uses an independent PBS scheduler. To access the scheduler, load the DGX-2 module:
```console
$ml DGX-2
```
## Resources Allocation Policy
The resources are allocated to the job in a fair-share fashion, subject to constraints set by the queue. The queue provides prioritized and exclusive access to computational resources.
***qdgx**, the queue for DGX-2 machine
The queue for the DGX-2 machine is called **qdgx**.
!!! note
Maximum walltime of a job is **48** hours.
!!! note
The qdgx queue is configured to run one job and accept one job in a queue per user.
The qdgx queue is configured to run one job and accept one job in a queue per user with the maximum walltime of a job being **48** hours.
## Job Submission and Execution
...
...
@@ -28,9 +19,9 @@ The `qsub` submits the job into the queue. The command creates a request to the
When allocating computational resources for the job, specify:
1. a queue for your job (the default is **qdgx**)
1. the maximum wall time allocated to your calculation (default is **4 hour**, maximum is **48 hour**)
1. a Jobscript or interactive switch
1. a queue for your job (the default is **qdgx**);
1. the maximum wall time allocated to your calculation (default is **4 hour**, maximum is **48 hour**);
1. a jobscript or interactive switch.
!!! info
You can access the DGX PBS scheduler by loading the "DGX-2" module.
...
...
@@ -40,16 +31,14 @@ Submit the job using the `qsub` command: