Appendix. Comparison of old and new TSUBAME¶
Appendix.1. Architecture¶
TSUBAME3.0 | TSUBAME4.0 | |
---|---|---|
Theoretical peak double precision performance | 12.5PFLOPS | 66.8PFLOPS |
Theoretical peak half precision performance | 47.2PFLOPS | 952PFLOPS |
Total main memory | 135TiB | 180TiB |
Total HDD capacity | 15.9PB | 44.2PB |
Total SSD capacity | 327TB | |
Compute nodes | 540 | 240 |
Total cores | 15,120 | 46,080 |
Total GPUs | 2,160 | 960 |
Interconnect | Intel Omni-Path HFI 100Gbps | InfiniBand NDR200 200Gbps |
Interenet | SINET5 100Gbps | SINET6 100Gbps |
Appendix.2. Compute nodes¶
TSUBAME3.0 | TSUBAME4.0 | |
---|---|---|
Computing Unit | Compute node HPE SGI ICE-XA 540 nodes | Compute node HPE Cray XD665 240 nodes |
Components (per node) | ||
CPU | Intel Xeon E5-2680 v4 2.4GHz x 2 Socket | AMD EPYC 9654 2.4GHz x 2 Socket |
Cores/Threads | 14cores / 28threads x 2CPU | 96cores / 192threads x 2CPU |
Memory | 256GiB | 768GiB (DDR5-4800) |
GPU | NVIDIA TESLA P100 for NVlink-Optimized Servers x 4 | NVIDIA H100 SXM5 94GB HBM2e x 4 |
SSD | 2TB | 1.92TB NVMe U.2 SSD |
Interconnect | Intel Omni-Path HFI 100Gbps x 4 | InfiniBand NDR200 200Gbps x 4 |
Appendix.3. Software¶
Appendix.3.1 System software¶
TSUBAME3.0 (as of 2023.4.6) | TSUBAME4.0 | |
---|---|---|
OS | SUSE Linux Enterprise Server 12 SP5 | RedHat Enterprise Linux Server 9.3 |
Job scheduler | Univa Grid Engine 8.6.11 | Altair Grid Engine 2023.1.1 |
Compilers | GCC 4.8.5, 12.2.0 Intel Compiler 23.0.0 NVIDIA HPC SDK 22.2 |
GCC 11.4.1 Intel oneAPI compiler 2024.0 and MKL NVIDIA HPC SDK 24.1 AOCC 4.1.0 |
MPI | Intel MPI 21.8.0 SGI MPT 2.16 OpenMPI 3.1.4 |
Intel MPI 2021.11 OpeMPI 5.0.2 |
CUDA library | 11.0.3 (12.1.0) | 12.3.2 |
CUDA driver | 450.172.01 | 545.23.08 |
OmniPath driver (OPA) | 10.10.3.1.1 | |
InfiniBand driver (OFED) | 23.10-1.1.9 |
Appendix.3.2 Commercial application¶
Software | Description | TSUBAME3.0 | TSUBAME4.0 |
---|---|---|---|
ANSYS | Analysis Software | ○ | ○ |
ABAQUS | Analysis Software | ○ | ○ |
ABACUS CAE | Analysis Software | ○ | ○ |
MSC Nastran | Analysis Software | ○ | |
MSC Patran | Analysis Software | ○ | |
Gaussian | Quantum chemistry calculation program | ○ | ○ |
GaussView | Quantum chemistry calculation program Pre-Post tool | ○ | ○ |
AMBER | Molecular dynamics calculation program | ○ | ○ |
Materials Studio | Chemical Simulation Software | ○ | ○ |
Discovery Studio | Chemical Simulation Software | ○ | ○ |
Mathematica | Mathematical Processing Software | ○ | ○ |
Maple | Mathematical Processing Software | ○ | |
AVS/Express | Visualization | ○ | |
AVS/Express PCE | Visualization | ○ | |
LS-DYNA | Analysis Software | ○ | ○ Included in ANSYS |
COMSOL | Analysis Software | ○ | ○ |
Schrodinger | Chemical Simulation Software | ○ | ○ |
MATLAB | Numerical calculation software | ○ | ○ |
VASP | Quantum molecular dynamics calculation program | ○ | ○ |
Linaro forge(ex:Arm Forge) | Debugger | ○ | ○ |
Intel Compiler | Compiler | ○ | ○ oneAPI |
PGI Compiler | Compiler | ○ | ○ NVIDIA HPC SDK |
Appendix.3.3 Freeware¶
Software | Description | TSUBAME3.0 | TSUBAME4.0 |
---|---|---|---|
GAMESS | Solver Simulator | ○ | ○ |
Tinker | Solver Simulator | ○ | ○ |
GROMACS | Solver Simulator | ○ | ○ |
LAMMPS | Solver Simulator | ○ | ○ |
NAMMD | Solver Simulator | ○ | ○ |
QUANTUM ESPRESSO | Solver Simulator | ○ | ○ |
CP2K | Solver Simulator | ○ | ○ |
OpenFOAM | Solver Simulator, Visualization | ○ | ○ |
CuDNN | GPU library | ○ | ○ |
NCCL | GPU library | ○ | ○ |
Caffe | DeepLearning framework | ○ | |
Chainer | DeepLearning framework | ○ | |
TensorFlow | DeepLearning framework | ○ | ○ |
DeePMD-kit | MD DeepLearning framework | ○ | ○ |
R | Interpriter (Rmpi,rpud) | ○ | ○ |
clang | Compiler | ○ | ○ AOCC |
Apache Hadoop | Distributed Data Processing Tools | ○ | ○ |
POV-Ray | Visualization | ○ | ○ |
ParaView | Visualization | ○ | ○ |
VisIt | Visualization | ○ | ○ |
turbovnc | Remote GUI(X11) | ○ | ○ |
gnuplot | Data grapher/visualization | ○ | ○ |
Tgif | Image Viewing and Editing | ○ | ○ |
GIMP | Image Viewing and Editing | ○ | ○ |
ImageMagick | Image Viewing and Editing | ○ | ○ |
TeX Live | TeX distribution | ○ | ○ |
Java SDK | Development tool | ○ | ○ |
PETSc | Linear system solvers, libraries | ○ | ○ |
FFTW | Fast Fourier Transform Library | ○ | ○ |
DMTCP | Checkpoint Restart | ○ | ○ |
Singularity | Linux container for HPC | ○ | ○ Apptainer |
Open OnDemand | Web portal site for HPC | ○ |
Appendx.4. Storage¶
TSUBAME3.0 | Storage | Mount point | Capacity | Filesystem |
---|---|---|---|---|
Home directory Shared application deployment (HDD) |
/home /apps |
40TB | GPFS+cNFS | |
Hight-speed storage area 1 (HDD) | /gs/hs0 | 4.8PB | Lustre | |
Hight-speed storage area 2 (HDD) | /gs/hs1 | 4.8PB | Lustre | |
Hight-speed storage area 3 (HDD) | /gs/hs2 | 4.8PB | Lustre | |
Local scratch area (SSD) | /scr | 1.9TB/node | xfs | |
Shared scratch area | /beeond | Depends on compute node | BeeGFS |
TSUBAME4.0 | Storage | Mount point | Capacity | Filesystem |
---|---|---|---|---|
High-speed storage area Home directory (SSD) |
/gs/fs /home |
372TB | Lustre | |
Large-scale (Big) storage area Shared application deployment (HDD) |
/gs/bs /apps |
44.2PB | Lustre | |
Local scratch area (SSD) | /local | 1.92TB/node | xfs |