SPEC® CFP2006 Result

Copyright 2006-2017 Standard Performance Evaluation Corporation

Huawei

Huawei 1288H V5 (Intel Xeon Platinum 8170)

CPU2006 license: 3175 Test date: Jun-2017
Test sponsor: Huawei Hardware Availability: Aug-2017
Tested by: Huawei Software Availability: Nov-2016
Benchmark results graph
Hardware
CPU Name: Intel Xeon Platinum 8170
CPU Characteristics: Intel Turbo Boost Technology up to 3.70 GHz
CPU MHz: 2100
FPU: Integrated
CPU(s) enabled: 52 cores, 2 chips, 26 cores/chip, 2 threads/core
CPU(s) orderable: 1,2 chip
Primary Cache: 32 KB I + 32 KB D on chip per core
Secondary Cache: 1 MB I+D on chip per core
L3 Cache: 35.75 MB I+D on chip per chip
Other Cache: None
Memory: 768 GB (24 x 32 GB 2Rx4 PC4-2666V-R)
Disk Subsystem: 1 x 1200 GB SAS, 10000 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 7.3
(Maipo)
3.10.0-514.el7.x86_64
Compiler: C/C++: Version 17.0.0.098 of Intel C/C++
Compiler for Linux;
Fortran: Version 17.0.0.098 of Intel Fortran
Compiler for Linux
Auto Parallel: No
File System: xfs
System State: Run level 3 (multi-user)
Base Pointers: 32/64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 104 1269 1110 1270 1110 1270 1110 104 1270 1110 1271 1110 1270 1110
416.gamess 104 1074 1900 1073 1900 1117 1820 104 1074 1900 1073 1900 1117 1820
433.milc 104 894 1070 894 1070 895 1070 104 894 1070 894 1070 895 1070
434.zeusmp 104 509 1860 503 1880 506 1870 104 509 1860 503 1880 506 1870
435.gromacs 104 303 2450 304 2440 305 2430 104 298 2490 298 2490 299 2480
436.cactusADM 104 588 2110 589 2110 587 2120 104 588 2110 589 2110 587 2120
437.leslie3d 104 1194 819 1194 818 1194 819 104 1194 819 1194 818 1194 819
444.namd 104 488 1710 485 1720 483 1730 104 478 1740 480 1740 480 1740
447.dealII 104 374 3180 371 3210 367 3240 104 374 3180 371 3210 367 3240
450.soplex 104 1021 850 1020 851 1018 852 104 959 905 959 904 958 905
453.povray 104 192 2880 193 2870 193 2870 104 165 3360 165 3350 164 3370
454.calculix 104 264 3250 266 3220 266 3220 104 264 3250 266 3220 266 3220
459.GemsFDTD 104 1446 763 1445 764 1445 764 104 1444 764 1442 765 1442 765
465.tonto 104 553 1850 546 1870 551 1860 104 512 2000 516 1980 507 2020
470.lbm 104 928 1540 928 1540 928 1540 104 928 1540 928 1540 928 1540
481.wrf 104 864 1350 862 1350 867 1340 104 864 1350 862 1350 867 1340
482.sphinx3 104 1319 1540 1316 1540 1321 1530 104 1319 1540 1316 1540 1321 1530

Submit Notes

 The numactl mechanism was used to bind copies to processors. The config file option 'submit'
 was used to generate numactl commands to bind each copy to a specific processor.
 For details, please see the config file.

Operating System Notes

 Stack size set to unlimited using "ulimit -s unlimited"

Platform Notes

 BIOS configuration:
 Set Power Efficiency Mode to Performance
 Set SNC to Enable
 Set IMC Interleaving to 1 way
 Set Patrol Scrub to Disable
 Sysinfo program /spec17/config/sysinfo.rev6993
 Revision 6993 of 2015-11-06 (b5e8d4b4eb51ed28d7f98696cbe290c1)
 running on localhost.localdomain Mon Jun 19 21:04:47 2017

 This section contains SUT (System Under Test) info as seen by
 some common utilities.  To remove or add to this section, see:
   http://www.spec.org/cpu2006/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz
       2 "physical id"s (chips)
       104 "processors"
    cores, siblings (Caution: counting these is hw and system dependent.  The
    following excerpts from /proc/cpuinfo might not be reliable.  Use with
    caution.)
       cpu cores : 26
       siblings  : 52
       physical 0: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 16 17 18 19 20 21 22 24 25
       26 27 28 29
       physical 1: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 16 17 18 19 20 21 22 24 25
       26 27 28 29
    cache size : 36608 KB

 From /proc/meminfo
    MemTotal:       790481628 kB
    HugePages_Total:       0
    Hugepagesize:       2048 kB

 From /etc/*release* /etc/*version*
    os-release:
       NAME="Red Hat Enterprise Linux Server"
       VERSION="7.3 (Maipo)"
       ID="rhel"
       ID_LIKE="fedora"
       VERSION_ID="7.3"
       PRETTY_NAME="Red Hat Enterprise Linux Server 7.3 (Maipo)"
       ANSI_COLOR="0;31"
       CPE_NAME="cpe:/o:redhat:enterprise_linux:7.3:GA:server"
    redhat-release: Red Hat Enterprise Linux Server release 7.3 (Maipo)
    system-release: Red Hat Enterprise Linux Server release 7.3 (Maipo)
    system-release-cpe: cpe:/o:redhat:enterprise_linux:7.3:ga:server

 uname -a:
    Linux localhost.localdomain 3.10.0-514.el7.x86_64 #1 SMP Wed Oct 19 11:24:13
    EDT 2016 x86_64 x86_64 x86_64 GNU/Linux

 run-level 3 Jun 15 18:29

 SPEC is set to: /spec17
    Filesystem     Type  Size  Used Avail Use% Mounted on
    /dev/sda2      xfs   898G   17G  882G   2% /
 Additional information from dmidecode:

    Warning: Use caution when you interpret this section. The 'dmidecode' program
    reads system data which is "intended to allow hardware to be accurately
    determined", but the intent may not be met, as there are frequent changes to
    hardware, firmware, and the "DMTF SMBIOS" standard.

   BIOS INSYDE Corp. 0.13 04/11/2017
   Memory:
    24x Samsung M393A4K40BB2-CTD 32 GB 2 rank 2666 MHz

 (End of data from sysinfo program)

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/spec17/libs/32:/spec17/libs/64:/spec17/sh10.2"

 Binaries compiled on a system with 1x Intel Core i7-4790 CPU + 32GB RAM
 memory using Redhat Enterprise Linux 7.2
 Transparent Huge Pages enabled with:
 echo always > /sys/kernel/mm/transparent_hugepage/enabled
 Filesystem page cache cleared with:
 echo 1>       /proc/sys/vm/drop_caches
 runspec command invoked through numactl i.e.:
 numactl --interleave=all runspec <etc>

Base Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks:

 icpc -m64 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

C++ benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

Fortran benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch 

Benchmarks using both Fortran and C:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

Peak Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks (except as noted below):

 icpc -m64 
450.soplex:  icpc -m32 -L/opt/intel/compilers_and_libraries_2017/linux/lib/ia32 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -D_FILE_OFFSET_BITS=64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  basepeak = yes 
482.sphinx3:  basepeak = yes 

C++ benchmarks:

444.namd:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -fno-alias   -auto-ilp32   -qopt-mem-layout-trans=3 
447.dealII:  basepeak = yes 
450.soplex:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -qopt-malloc-options=3   -qopt-mem-layout-trans=3 
453.povray:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -unroll4   -qopt-mem-layout-trans=3 

Fortran benchmarks:

410.bwaves:  -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch 
416.gamess:  basepeak = yes 
434.zeusmp:  basepeak = yes 
437.leslie3d:  basepeak = yes 
459.GemsFDTD:  Same as 410.bwaves 
465.tonto:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -unroll4   -auto   -inline-calloc   -qopt-malloc-options=3 

Benchmarks using both Fortran and C:

435.gromacs:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -qopt-prefetch   -auto-ilp32   -qopt-mem-layout-trans=3 
436.cactusADM:  basepeak = yes 
454.calculix:  basepeak = yes 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/Intel-ic17.0-official-linux64.html,
http://www.spec.org/cpu2006/flags/Huawei-Platform-Settings-SKL-V1.6.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/Intel-ic17.0-official-linux64.xml,
http://www.spec.org/cpu2006/flags/Huawei-Platform-Settings-SKL-V1.6.xml.