Hardware Model | Quantity | Label | Specification |
---|---|---|---|
HUAWEI Tecal RH1288 V2 | 40 | 192.168.2.100-139 | Link |
SUPERMICRO# 118-20 | 4 | 192.168.2.200-203 | Link |
Note:
- The SuperMicro machines are equipped with 4 Nvidia Tesla K40m GPUs.
Hardware Model | Quantity | Label | Specification |
---|---|---|---|
Mellanox SN2100 | 2 (1 lent to NUDT) | - | MLNX 40/100G switches, 16 ports |
Accton AS7712 | 4 | - | Cumulus Linux, 32ports |
Accton AS5712 | 1 | - | Not yet installed |
Wedge32-BF (Barefoot Tofino) | 1 | - | ONL, 32ports |
Facebook Wedge32 | 1 | - | ONL, 32ports |
We have 4 networks for different purposes.
- 4 AS7712 switches as a leaf-spine topology. This network is used to emulate the real data center networking. DSCP based PFC and ECN is enabled on serveral ports to support RoCEv2. The architecture is show as follows:
-
1 Mellanox SN2100 switch for testing the rack-scale function. We currently test RDMA enabled MXNet upon those switches. The switch is also acted as the Ceph storage backend network.
-
1 Barefoot Wedge32-BF switch for testing programmable data plane.
-
1 Facebook Wedge32 for monitoring/diagnostic tools development. Zhaoxiong is in charge of this switch.
MGT IP | NVMe | GPU | NIC | NIC Port 0 | NIC Port 1 | Task |
---|---|---|---|---|---|---|
192.168.2.100 | - | - | Mellanox ConnectX 5 100G | Connect to P4 switch port 1 via Mellanox 100G DAC Cable | X | Lent to Bojie |
192.168.2.101 | - | - | Mellanox ConnectX 5 100G | Connect to P4 switch port 2 via Mellanox 100G DAC Cable | X | Lent to Bojie |
192.168.2.102 | - | - | Mellanox ConnectX 4 100G | 10.200.1.102 Connect to Leaf 0 port 1 via Mellanox 100G DAC Cable | - | Networking Test - Junxue & Jiacheng |
192.168.2.103 | - | - | Mellanox ConnectX 4 100G | 10.200.1.103 Connect to Leaf 0 port 2 via Mellanox 100G DAC Cable | - | Networking Test - Junxue & Jiacheng |
192.168.2.104 | - | - | Mellanox ConnectX 3 | Connect to Leaf 0 port 9 via Mellanox 40G DAC Cable | Connect to Mellanox Switch port 1 via Mellanox 40G DAC Cable | Ceph Storage - Justin & Chaoliang |
192.168.2.105 | - | - | Mellanox ConnectX 3 | Connect to Leaf 0 port 10 via Mellanox 40G DAC Cable | Connect to Mellanox Switch port 2 via Mellanox 40G DAC Cable | Ceph Storage - Justin & Chaoliang |
192.168.2.106 | - | - | Mellanox ConnectX 4 100G | 10.200.1.106 Connect to Leaf 0 port 3 via Mellanox 100G DAC Cable | - | Networking Test - Junxue & Jiacheng |
192.168.2.107 | - | - | Mellanox ConnectX 4 100G | 10.200.2.107 Connect to Leaf 1 port 1 via Mellanox 100G DAC Cable | - | Networking Test - Junxue & Jiacheng |
192.168.2.108 | - | - | Mellanox ConnectX 3 | Connect to Leaf 1 port 9 via Mellanox 40G DAC Cable | Connect to Mellanox Switch port 3 via Mellanox 40G DAC Cable | Ceph Storage - Justin & Chaoliang |
192.168.2.109 | - | - | - | - | X | Ceph Storage - Justin & Chaoliang |
192.168.2.110 ~ 192.168.2.128 | - | - | - | - | - | You can find any available machines to use |
192.168.2.130 | - | - | Mellanox ConnectX 4 50G | Connect to Leaf 0 port 13 via Edge-core 40G DAC Cable | X | Ceph Storage - Justin & Chaoliang |
192.168.2.131 | - | - | Mellanox ConnectX 4 50G | Connect to Leaf 0 port 14 via Edge-core 40G DAC Cable | X | Ceph Storage - Justin & Chaoliang |
192.168.2.132 | - | - | Mellanox ConnectX 4 50G | Connect to Leaf 1 port 13 via Edge-core 40G DAC Cable | X | Ceph Storage - Justin & Chaoliang |
192.168.2.133 | - | - | Mellanox ConnectX 4 25G | Connect to Leaf 1 port 30 via 100G breakout 25G 3M AOC Cable | X | Ceph Storage - Justin & Chaoliang |
192.168.2.134 | - | - | Mellanox ConnectX 4 25G | Connect to Leaf 1 port 30 via 100G breakout 25G 3M AOC Cable | X | Ceph Storage - Justin & Chaoliang |
192.168.2.200 | - | 4 x Tesla K40m | Mellanox ConnectX 4 40/56G | - | Connect to Mellanox Switch port 10 via Mellanox 40G DAC Cable | ML benchmark - Jiacheng/Junxue, disconnected |
192.168.2.201 | - | 4 x Tesla K40m | Mellanox ConnectX 4 40/56G | - | Connect to Mellanox Switch port 11 via Mellanox 40G DAC Cable | ML benchmark - Jiacheng/Junxue, disconnected |
192.168.2.202 | - | 4 x Tesla K40m | Mellanox ConnectX 4 40/56G | - | Connect to Mellanox Switch port 12 via Mellanox 40G DAC Cable | ML benchmark - Jiacheng/Junxue, disconnected |
192.168.2.203 | - | 4 x Tesla K40m | Mellanox ConnectX 4 40/56G | - | Connect to Mellanox Switch port 13 via Mellanox 40G DAC Cable | ML benchmark - Jiacheng/Junxue, disconnected |
NIC | Quantity |
---|---|
Mellanox ConnectX-4 Lx 50GbE one port | 1 |
Mellanox ConnectX-4 Lx 25GbE dual port | 2 |
Mellanox ConnectX-4 Lx 10GbE one port | 4 |
Cable | Quantity |
---|---|
Mellanox 56GB QSFP AOC | 2 |
Mellanox 100GbE to 2x50GbE 3m DAC | 1 |
Mellanox 40Gb 3m DAC | 2 |
Edge-core 10GbE SFP 3m DAC | 4 |
Edge-core 100GbE breakout 25GbE 3m DAC | 1 |
Edge-core 40GbE QSFP 3m DAC | 2 |
Edge-core 100GbE QSFP 3m DAC | 4 |
User | Equipment (Label) | Date from | Purpose |
---|---|---|---|
Gateway | 192.168.2.254 | - | Gateway |
Yu's group | 192.168.2.120,122-125 | now-late Feb. | Lent to prof. Weichuan Yu, ECE |
GPU | 10.40.2.200-203 | - | IPs for internal usage. |
Note:
GPU IPs have the corresponding NICs connected to high speed switches.
- Please keep the usage list up to date.