diff --git a/docs.it4i/cs/introduction.md b/docs.it4i/cs/introduction.md index 2798b938f93f6d94ffc96c90557a8eb3037e5ea7..4c59761d35e3e7abf4f03ff1dce0fdf44fa1a170 100644 --- a/docs.it4i/cs/introduction.md +++ b/docs.it4i/cs/introduction.md @@ -1,10 +1,12 @@ # Complementary Systems Complementary systems offer development environment for users -that need to port and optimize their codes and application +that need to port and optimize their code and applications for various hardware architectures and software technologies that are not available on standard clusters. +## Complementary Systems 1 + First stage of complementary systems implementation comprises of these partitions: - compute partition 0 – based on ARM technology - legacy @@ -16,6 +18,17 @@ First stage of complementary systems implementation comprises of these partition  +## Complementary Systems 2 + +Second stage of complementary systems implementation comprises of these partitions: + +- compute partition 6 - based on ARM technology + CUDA programmable GPGPU accelerators on ampere architecture + DPU network processing units +- compute partition 7 - based on IBM Power10 architecture +- compute partition 8 - modern CPU with a very high L3 cache capacity (over 750MB) +- compute partition 9 - virtual GPU accelerated workstations + + + ## Modules and Architecture Availability Complementary systems list available modules automatically based on the detected architecture. diff --git a/docs.it4i/cs/specifications.md b/docs.it4i/cs/specifications.md index bc257f62ad000348116681db0f97a9639f91da51..93ed2efe8ee48888c93a64ef36e25225999e8e1b 100644 --- a/docs.it4i/cs/specifications.md +++ b/docs.it4i/cs/specifications.md @@ -27,7 +27,7 @@ consists of 8 compute nodes with the following per-node parameters: - 1x Infiniband HDR100 interface - connected via 16x PCI-e Gen3 slot to the CPU -## Partition 02 - Intel (Ice Lake, NVDIMMs) <!--- + Bitware FPGAs) --> +## Partition 2 - Intel (Ice Lake, NVDIMMs) <!--- + Bitware FPGAs) --> The partition is based on the Intel Ice Lake x86 architecture. It contains two servers with Intel NVDIMM memories. @@ -64,7 +64,7 @@ FPGA boards support application development using following design flows: - High-Level Synthesis (C/C++) including support for OneAPI - Verilog and VHDL -## Partition 03 - AMD (Milan, MI100 GPUs + Xilinx FPGAs) +## Partition 3 - AMD (Milan, MI100 GPUs + Xilinx FPGAs) The partition is based on two servers equipped with AMD Milan x86 CPUs, AMD GPUs and Xilinx FPGAs architectures and represents an alternative @@ -96,7 +96,7 @@ FPGA boards support application development using following design flows: - Verilog and VHDL - developer tools and libraries for AMD GPUs. -## Partition 04 - Edge Server +## Partition 4 - Edge Server The partition provides overview of the so-called edge computing class of resources with solutions powerful enough to provide data analytic capabilities (both CPU and GPU) @@ -118,7 +118,7 @@ The partition consists of one edge computing server with following parameters: - WiFi 802.11 ac, - LTE connectivity -## Partition 05 - FPGA Synthesis Server +## Partition 5 - FPGA Synthesis Server FPGAs design tools usually run for several hours to one day to generate a final bitstream (logic design) of large FPGA chips. These tools are usually sequential, therefore part of the system is a dedicated server for this task. @@ -132,6 +132,73 @@ This server is used by development tools needed for FPGA boards installed in bot - NVMe local storage - 2x NVMe disks 3.2TB, configured RAID 1 +## Partition 6 - ARM + CUDA GPGU (Ampere) + DPU + +This partition is based on ARM architecture and is equipped with CUDA programmable GPGPU accelerators +based on Ampere architecture and DPU network processing units. +The partition consists of two nodes with the following per-node parameters: + +- Server Gigabyte G242-P36, Ampere Altra Q80-30 (80c, 3.0GHz) +- 512GB DIMM DDR4, 3200MHz, ECC, CL22 +- 2x Micron 7400 PRO 1920GB NVMe M.2 Non-SED Enterprise SSD +- 2x NVIDIA A30 GPU Accelerator +- 2x NVIDIA BlueField-2 E-Series DPU 25GbE Dual-Port SFP56, PCIe Gen4 x16, 16GB DDR + 64, 200Gb Ethernet +- Mellanox ConnectX-5 EN network interface card, 10/25GbE dual-port SFP28, PCIe3.0 x8 +- Mellanox ConnectX-6 VPI adapter card, 100Gb/s (HDR100, EDR IB and 100GbE), single-port QSFP56 + +## Partition 7 - IBM + +The IBM Power10 server is a single-node partition with the following parameters: + +- Server IBM POWER S1022 +- 2x Power10 12-CORE TYPICAL 2.90 TO 4.0 GHZ (MAX) PO +- 512GB DDIMMS, 3200 MHZ, 8GBIT DDR4 +- 2x ENTERPRISE 1.6 TB SSD PCIE4 NVME U.2 MOD +- 2x ENTERPRISE 6.4 TB SSD PCIE4 NVME U.2 MOD +- PCIE3 LP 2-PORT 25/10GB NIC&ROCE SR/CU A + +## Partition 8 - HPE Proliant + +This partition provides a modern CPU with a very large L3 cache. +The goal is to enable users to develop algorithms and libraries +that will efficiently utilize this technology. +The processor is very efficient, for example, for linear algebra on relatively small matrices. +This is a single-node partition with the following parameters: + +- Server HPE Proliant DL 385 Gen10 Plus v2 CTO +- 2x AMD EPYC 7773X Milan-X, 64 cores, 2.2GHz, 768 MB L3 cache +- 16x HPE 16GB (1x+16GB) x4 DDR4-3200 Registered Smart Memory Kit +- 2x 3.84TB NVMe RI SFF BC U.3ST MV SSD +- BCM 57412 10GbE 2p SFP+ OCP3 Adptr +- HPE IB HDR100/EN 100Gb 1p QSFP56 Adptr1 +- HPE Cray Programming Environment for x86 Systems 2 Seats + +## Partition 9 - Virtual GPU Accelerated Workstation + +This partition provides users with a remote/virtual workstation running MS Windows OS. +It offers rich graphical environment with a focus on 3D OpenGL +or RayTracing-based applications with the smallest possible degradation of user experience. +The partition consists of two nodes with the following per-node parameters: + +- Server HPE Proliant DL 385 Gen10 Plus v2 CTO +- 2x AMD EPYC 7413, 24 cores, 2.55GHz +- 16x HPE 32GB 2Rx4 PC4-3200AA-R Smart Kit +- 2x 3.84TB NVMe RI SFF BC U.3ST MV SSD +- BCM 57412 10GbE 2p SFP+ OCP3 Adptr +- 2x NVIDIA A40 48GB GPU Accelerator + +### Available Software + +The following is the list of software available on partiton 09: + +- Academic VMware Horizon 8 Enterprise Term Edition: 10 Concurrent User Pack for 4 year term license; includes SnS +- 8x NVIDIA RTX Virtual Workstation, per concurent user, EDU, perpetual license +- 32x NVIDIA RTX Virtual Workstation, per concurent user, EDU SUMS per year +- 7x Windows Server 2022 Standard - 16 Core License Pack +- 10x Windows Server 2022 - 1 User CAL +- 40x Windows 10/11 Enterprise E3 VDA (Microsoft) per year +- Hardware VMware Horizon management + [1]: https://www.bittware.com/fpga/520n-mx/ [2]: https://www.xilinx.com/products/boards-and-kits/alveo/u250.html#overview [3]: https://www.xilinx.com/products/boards-and-kits/alveo/u280.html#overview diff --git a/docs.it4i/img/cs2_1.png b/docs.it4i/img/cs2_1.png new file mode 100644 index 0000000000000000000000000000000000000000..2003ef2cd083ea3630f0373170216812f25bfd97 Binary files /dev/null and b/docs.it4i/img/cs2_1.png differ diff --git a/docs.it4i/img/cs2_2.png b/docs.it4i/img/cs2_2.png new file mode 100644 index 0000000000000000000000000000000000000000..c7a30d18947d93b9ad9ff01180b772b84172b719 Binary files /dev/null and b/docs.it4i/img/cs2_2.png differ