Skip to content
Snippets Groups Projects
compute-nodes.md 4.37 KiB
Newer Older
  • Learn to ignore specific revisions
  • Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    # Compute Nodes
    
    
    ## Node Configuration
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Jan Siwiec's avatar
    Jan Siwiec committed
    Anselm is a cluster of x86-64 Intel-based nodes built with the Bull Extreme Computing bullx technology. The cluster contains four types of compute nodes.
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    ### Compute Nodes Without Accelerators
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * 180 nodes
    * 2880 cores in total
    * two Intel Sandy Bridge E5-2665, 8-core, 2.4GHz processors per node
    * 64 GB of physical memory per node
    * one 500GB SATA 2,5” 7,2 krpm HDD per node
    * bullx B510 blade servers
    * cn[1-180]
    
    ### Compute Nodes With a GPU Accelerator
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * 23 nodes
    * 368 cores in total
    * two Intel Sandy Bridge E5-2470, 8-core, 2.3GHz processors per node
    * 96 GB of physical memory per node
    * one 500GB SATA 2,5” 7,2 krpm HDD per node
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * GPU accelerator 1x NVIDIA Tesla Kepler K20m per node
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * bullx B515 blade servers
    * cn[181-203]
    
    ### Compute Nodes With a MIC Accelerator
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * 4 nodes
    * 64 cores in total
    * two Intel Sandy Bridge E5-2470, 8-core, 2.3GHz processors per node
    * 96 GB of physical memory per node
    * one 500GB SATA 2,5” 7,2 krpm HDD per node
    * MIC accelerator 1x Intel Phi 5110P per node
    * bullx B515 blade servers
    * cn[204-207]
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    ### Fat Compute Nodes
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * 2 nodes
    * 32 cores in total
    * 2 Intel Sandy Bridge E5-2665, 8-core, 2.4GHz processors per node
    * 512 GB of physical memory per node
    
    * two 300GB SAS 3,5” 15krpm HDD (RAID1) per node
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * two 100GB SLC SSD per node
    * bullx R423-E3 servers
    * cn[208-209]
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ![](../img/bullxB510.png)
    
    **Anselm bullx B510 servers**
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    ### Compute Node Summary
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    David Hrbáč's avatar
    David Hrbáč committed
    | Node type                    | Count | Range       | Memory | Cores       | Queues                                    |
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    | ---------------------------- | ----- | ----------- | ------ | ----------- | --------------------------------------    |
    | Nodes without an accelerator | 180   | cn[1-180]   | 64GB   | 16 @ 2.4GHz | qexp, qprod, qlong, qfree, qprace, qatlas |
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    | Nodes with a GPU accelerator | 23    | cn[181-203] | 96GB   | 16 @ 2.3GHz | qnvidia, qexp                             |
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    | Nodes with a MIC accelerator | 4     | cn[204-207] | 96GB   | 16 @ 2.3GHz | qmic, qexp                                |
    | Fat compute nodes            | 2     | cn[208-209] | 512GB  | 16 @ 2.4GHz | qfat, qexp                                |
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ## Processor Architecture
    
    
    Anselm is equipped with Intel Sandy Bridge processors Intel Xeon E5-2665 (nodes without accelerators and fat nodes) and Intel Xeon E5-2470 (nodes with accelerators). The processors support Advanced Vector Extensions (AVX) 256-bit instruction set.
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    ### Intel Sandy Bridge E5-2665 Processor
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * eight-core
    * speed: 2.4 GHz, up to 3.1 GHz using Turbo Boost Technology
    * peak performance:  19.2 GFLOP/s per core
    * caches:
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
      * L2: 256 KB per core
      * L3: 20 MB per processor
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * memory bandwidth at the level of the processor: 51.2 GB/s
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    ### Intel Sandy Bridge E5-2470 Processor
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * eight-core
    * speed: 2.3 GHz, up to 3.1 GHz using Turbo Boost Technology
    * peak performance:  18.4 GFLOP/s per core
    * caches:
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
      * L2: 256 KB per core
      * L3: 20 MB per processor
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * memory bandwidth at the level of the processor: 38.4 GB/s
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Nodes equipped with Intel Xeon E5-2665 CPU have a set PBS resource attribute cpu_freq = 24, nodes equipped with Intel Xeon E5-2470 CPU have set PBS resource attribute cpu_freq = 23.
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ```console
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    $ qsub -A OPEN-0-0 -q qprod -l select=4:ncpus=16:cpu_freq=24 -I
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ```
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    In this example, we allocate 4 nodes, 16 cores at 2.4GHhz per node.
    
    
    Jan Siwiec's avatar
    Jan Siwiec committed
    Intel Turbo Boost Technology is used by default, you can disable it for all nodes of job by using the cpu_turbo_boost resource attribute.
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ```console
    $ qsub -A OPEN-0-0 -q qprod -l select=4:ncpus=16 -l cpu_turbo_boost=0 -I
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ```
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ## Memmory Architecture
    
    The cluster contains three types of compute nodes.
    
    ### Compute Nodes Without Accelerators
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * 2 sockets
    * Memory Controllers are integrated into processors.
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
      * 8 DDR3 DIMMs per node
      * 4 DDR3 DIMMs per CPU
      * 1 DDR3 DIMMs per channel
      * Data rate support: up to 1600MT/s
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * Populated memory: 8 x 8 GB DDR3 DIMM 1600 MHz
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ### Compute Nodes With a GPU or MIC Accelerator
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * 2 sockets
    * Memory Controllers are integrated into processors.
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
      * 6 DDR3 DIMMs per node
      * 3 DDR3 DIMMs per CPU
      * 1 DDR3 DIMMs per channel
      * Data rate support: up to 1600MT/s
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * Populated memory: 6 x 16 GB DDR3 DIMM 1600 MHz
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    ### Fat Compute Nodes
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * 2 sockets
    * Memory Controllers are integrated into processors.
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
      * 16 DDR3 DIMMs per node
      * 8 DDR3 DIMMs per CPU
      * 2 DDR3 DIMMs per channel
      * Data rate support: up to 1600MT/s
    
    Lukáš Krupčík's avatar
    Lukáš Krupčík committed
    * Populated memory: 16 x 32 GB DDR3 DIMM 1600 MHz