| MPI2007 license: | 6569 | Test date: | Sep-2024 |
|---|---|---|---|
| Test sponsor: | Supermicro | Hardware Availability: | Oct-2024 |
| Tested by: | Supermicro | Software Availability: | Apr-2024 |
| Benchmark | Base | Peak | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ranks | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Ranks | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
| 104.milc | 256 | 18.5 | 84.6 | 18.4 | 85.1 | 18.3 | 85.6 | 256 | 18.5 | 84.6 | 18.4 | 85.1 | 18.3 | 85.6 |
| 107.leslie3d | 256 | 61.4 | 85.0 | 61.5 | 84.9 | 61.9 | 84.4 | 256 | 61.4 | 85.0 | 61.5 | 84.9 | 61.9 | 84.4 |
| 113.GemsFDTD | 256 | 104 | 60.7 | 104 | 60.6 | 104 | 60.5 | 256 | 104 | 60.7 | 104 | 60.6 | 104 | 60.5 |
| 115.fds4 | 256 | 13.9 | 141 | 13.9 | 140 | 13.9 | 140 | 256 | 13.9 | 141 | 13.9 | 140 | 13.9 | 140 |
| 121.pop2 | 256 | 61.5 | 67.2 | 61.6 | 67.0 | 61.3 | 67.4 | 256 | 61.5 | 67.2 | 61.6 | 67.0 | 61.3 | 67.4 |
| 122.tachyon | 256 | 30.2 | 92.6 | 29.4 | 95.0 | 29.4 | 95.1 | 256 | 30.2 | 92.6 | 29.4 | 95.0 | 29.4 | 95.1 |
| 126.lammps | 256 | 64.2 | 45.4 | 64.2 | 45.4 | 64.1 | 45.5 | 256 | 64.2 | 45.4 | 64.2 | 45.4 | 64.1 | 45.5 |
| 127.wrf2 | 256 | 42.1 | 185 | 42.5 | 183 | 42.1 | 185 | 256 | 42.1 | 185 | 42.5 | 183 | 42.1 | 185 |
| 128.GAPgeofem | 256 | 14.1 | 147 | 14.0 | 147 | 14.0 | 147 | 256 | 14.1 | 147 | 14.0 | 147 | 14.0 | 147 |
| 129.tera_tf | 256 | 24.4 | 114 | 24.4 | 114 | 24.4 | 114 | 256 | 24.4 | 114 | 24.4 | 114 | 24.4 | 114 |
| 130.socorro | 256 | 37.6 | 102 | 37.6 | 101 | 38.5 | 99.1 | 256 | 37.6 | 102 | 37.6 | 101 | 38.5 | 99.1 |
| 132.zeusmp2 | 256 | 33.0 | 93.9 | 33.0 | 94.0 | 33.0 | 94.0 | 256 | 33.0 | 93.9 | 33.0 | 94.0 | 33.0 | 94.0 |
| 137.lu | 256 | 30.4 | 121 | 30.4 | 121 | 30.4 | 121 | 256 | 30.4 | 121 | 30.4 | 121 | 30.4 | 121 |
| Hardware Summary | |
|---|---|
| Type of System: | Homogeneous |
| Compute Node: | Hyper A+ Server AS -2126HS-TN |
| Total Compute Nodes: | 1 |
| Total Chips: | 2 |
| Total Cores: | 256 |
| Total Threads: | 256 |
| Total Memory: | 1536 GB |
| Base Ranks Run: | 256 |
| Minimum Peak Ranks: | 256 |
| Maximum Peak Ranks: | 256 |
| Software Summary | |
|---|---|
| C Compiler: | Intel oneAPI DPC++/C++ Compiler 2024.2.1 |
| C++ Compiler: | Intel oneAPI DPC++/C++ Compiler 2024.2.1 |
| Fortran Compiler: | Intel oneAPI DPC++/C++ Compiler 2024.2.1 |
| Base Pointers: | 64-bit |
| Peak Pointers: | 64-bit |
| MPI Library: | Intel MPI Version 2021.13 |
| Other MPI Info: | None |
| Pre-processors: | No |
| Other Software: | Jemalloc-5.3.0 |
| Hardware | |
|---|---|
| Number of nodes: | 1 |
| Uses of the node: | compute |
| Vendor: | Supermicro |
| Model: | Hyper A+ Server AS -2126HS-TN |
| CPU Name: | AMD EPYC 9755 |
| CPU(s) orderable: | 1,2 chips |
| Chips enabled: | 2 |
| Cores enabled: | 256 |
| Cores per chip: | 128 |
| Threads per core: | 1 |
| CPU Characteristics: | Max. Boost Clock upto 4.1GHz |
| CPU MHz: | 2700 |
| Primary Cache: | 32 KB I + 48 KB D on chip per core |
| Secondary Cache: | 1 MB I+D on chip per core |
| L3 Cache: | 512 MB I+D on chip per chip, 32 MB shared / 8 cores |
| Other Cache: | None |
| Memory: | 1536 GB (24 x 64 GB 2Rx4 PC5-6400B-R, running at 6000) |
| Disk Subsystem: | 1 x 3.5 TB NVMe SSD |
| Other Hardware: | None |
| Adapter: | None |
| Number of Adapters: | 1 |
| Slot Type: | None |
| Data Rate: | None |
| Ports Used: | 0 |
| Interconnect Type: | None |
| Software | |
|---|---|
| Adapter: | None |
| Adapter Driver: | None |
| Adapter Firmware: | None |
| Operating System: | Ubuntu 24.04 LTS 6.8.0-44-generic |
| Local File System: | ext4 |
| Shared File System: | None |
| System State: | Multi-user, run level 3 |
| Other Software: | None |
The config file option 'submit' was used. mpiexec.hydra -bootstrap ssh -hosts localhost -genv I_MPI_COMPATIBILITY=3 -np $ranks -ppn $ranks $command
MPI startup command: mpiexec.hydra command was used to start MPI jobs. RAM configuration: Compute nodes have 1 x 64 GB RDIMM on each memory channel. BIOS settings: SMT = Disabled NUMA nodes per socket = NPS4 ACPI SRAT L3 Cache as NUMA Domain = Enabled Determinism Control = Manual Determinism Enable = Power xGMI Link Configuration = 4 xGMI Links 4 Link xGMI max speed = 32Gbps TDP Control = Manual TDP = 500 Package Power Limit Control = Manual Package Power Limit = 500 NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1) is mitigated in the system as tested and documented. Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2) is mitigated in the system as tested and documented.
| mpiicc -cc=icx |
| 126.lammps: | mpiicpc -cxx=icpx |
| mpiifort -fc=ifx |
| mpiicc -cc=icx mpiifort -fc=ifx |
| 104.milc: | -DSPEC_MPI_LP64 |
| 115.fds4: | -DSPEC_MPI_LP64 |
| 121.pop2: | -DSPEC_MPI_CASE_FLAG -DSPEC_MPI_LP64 |
| 122.tachyon: | -DSPEC_MPI_LP64 |
| 126.lammps: | -DMPICH_IGNORE_CXX_SEEK |
| 127.wrf2: | -DSPEC_MPI_CASE_FLAG -DSPEC_MPI_LINUX -DSPEC_MPI_LP64 |
| 128.GAPgeofem: | -DSPEC_MPI_LP64 |
| 130.socorro: | -DSPEC_MPI_LP64 |
| 132.zeusmp2: | -DSPEC_MPI_LP64 |
| -Ofast -ipo -march=skylake-avx512 -mtune=skylake-avx512 -ansi-alias |
| 126.lammps: | -Ofast -ipo -march=skylake-avx512 -mtune=skylake-avx512 -ansi-alias |
| -Ofast -ipo -march=skylake-avx512 -mtune=skylake-avx512 -nostandard-realloc-lhs -align array64byte |
| -Ofast -ipo -march=skylake-avx512 -mtune=skylake-avx512 -ansi-alias -nostandard-realloc-lhs -align array64byte |
| 104.milc: | -Wno-implicit-function-declaration -Wno-implicit-int -limf -Wl,--rpath=/usr/local/lib -ljemalloc |
| 122.tachyon: | -limf -Wl,--rpath=/usr/local/lib -ljemalloc |
| 126.lammps: | -Wno-register -limf -Wl,--rpath=/usr/local/lib -ljemalloc |
| -limf -Wl,--rpath=/usr/local/lib -ljemalloc |
| 104.milc: | basepeak = yes |
| 122.tachyon: | basepeak = yes |
| 126.lammps: | basepeak = yes |
| 107.leslie3d: | basepeak = yes |
| 113.GemsFDTD: | basepeak = yes |
| 129.tera_tf: | basepeak = yes |
| 137.lu: | basepeak = yes |
| 115.fds4: | basepeak = yes |
| 121.pop2: | basepeak = yes |
| 127.wrf2: | basepeak = yes |
| 128.GAPgeofem: | basepeak = yes |
| 130.socorro: | basepeak = yes |
| 132.zeusmp2: | basepeak = yes |