SPEC® CFP2006 Result

Copyright 2006-2017 Standard Performance Evaluation Corporation

Huawei

Kunlun 9008 (Intel Xeon E7-8867 v3)

CPU2006 license: 3175 Test date: Dec-2016
Test sponsor: Huawei Hardware Availability: Jan-2016
Tested by: Huawei Software Availability: Sep-2014
Benchmark results graph
Hardware
CPU Name: Intel Xeon E7-8867 v3
CPU Characteristics: Intel Turbo Boost Technology up to 3.30 GHz
CPU MHz: 2500
FPU: Integrated
CPU(s) enabled: 128 cores, 8 chips, 16 cores/chip, 2 threads/core
CPU(s) orderable: 4,8 chips
Primary Cache: 32 KB I + 32 KB D on chip per core
Secondary Cache: 256 KB I+D on chip per core
L3 Cache: 45 MB I+D on chip per chip
Other Cache: None
Memory: 1 TB (64 x 16 GB 2Rx4 PC4-2133P-R,
running at 1600 MHz)
Disk Subsystem: 2 x 600 GB SAS, 10K RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 7.1
(Maipo)
3.10.0-229.el7.x86_64
Compiler: C/C++: Version 16.0.0.101 of Intel C++ Studio XE
for Linux;
Fortran: Version 16.0.0.101 of Intel Fortran
Studio XE for Linux
Auto Parallel: No
File System: xfs
System State: Run level 3 (multi-user)
Base Pointers: 32/64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 256 1117 3120 1110 3140 1110 3130 256 1117 3120 1110 3140 1110 3130
416.gamess 256 1116 4490 1121 4470 1118 4480 256 1072 4670 1070 4680 1073 4670
433.milc 256 833 2820 832 2820 832 2820 256 833 2820 832 2820 832 2820
434.zeusmp 256 555 4200 549 4250 554 4200 256 555 4200 549 4250 554 4200
435.gromacs 256 353 5170 354 5160 355 5150 256 344 5310 340 5370 341 5360
436.cactusADM 256 667 4580 670 4570 668 4580 256 667 4580 670 4570 668 4580
437.leslie3d 256 1151 2090 1145 2100 1143 2110 256 1151 2090 1145 2100 1143 2110
444.namd 256 553 3710 558 3680 555 3700 256 549 3740 551 3730 553 3710
447.dealII 256 411 7120 413 7090 414 7080 256 411 7120 413 7090 414 7080
450.soplex 256 1004 2130 1007 2120 1007 2120 128 473 2260 472 2260 472 2260
453.povray 256 242 5630 244 5570 239 5700 256 208 6550 206 6620 205 6650
454.calculix 256 336 6290 331 6380 338 6240 256 336 6290 331 6380 338 6240
459.GemsFDTD 256 1397 1940 1393 1950 1396 1950 256 1397 1940 1393 1950 1396 1950
465.tonto 256 581 4330 582 4330 581 4340 256 549 4590 548 4600 550 4580
470.lbm 256 942 3730 944 3730 942 3740 256 942 3730 944 3730 942 3740
481.wrf 256 793 3610 793 3610 793 3610 256 793 3610 793 3610 793 3610
482.sphinx3 256 1395 3580 1391 3590 1385 3600 256 1395 3580 1391 3590 1385 3600

Submit Notes

 The numactl mechanism was used to bind copies to processors. The config file option 'submit'
 was used to generate numactl commands to bind each copy to a specific processor.
 For details, please see the config file.

Operating System Notes

 Stack size set to unlimited using "ulimit -s unlimited"
 Turbo mode set with:
 	cpupower -c all frequency-set -g performance

Platform Notes

 BIOS configuration:
 Set Power Efficiency Mode to Performance
 Set Lock_step to disabled
 Baseboard Management Controller used to adjust the fan speed to 100%
 Set C-State to C0/C1
 Sysinfo program /home/spec/config/sysinfo.rev6914
 $Rev: 6914 $ $Date:: 2014-06-25 #$ e3fbb8667b5a285932ceab81e28219e1
 running on localhost.localdomain Wed Dec 21 09:09:26 2016

 This section contains SUT (System Under Test) info as seen by
 some common utilities.  To remove or add to this section, see:
   http://www.spec.org/cpu2006/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : Intel(R) Xeon(R) CPU E7-8867 v3 @ 2.50GHz
       8 "physical id"s (chips)
       256 "processors"
    cores, siblings (Caution: counting these is hw and system dependent.  The
    following excerpts from /proc/cpuinfo might not be reliable.  Use with
    caution.)
       cpu cores : 16
       siblings  : 32
       physical 0: cores 0 1 2 3 4 8 9 10 11 16 17 18 20 24 25 27
       physical 1: cores 0 1 2 3 4 8 9 10 11 16 17 18 20 24 25 27
       physical 2: cores 0 1 2 3 4 8 9 10 11 16 17 18 20 24 25 27
       physical 3: cores 0 1 2 3 4 8 9 10 11 16 17 18 20 24 25 27
       physical 4: cores 0 1 2 3 4 8 9 10 11 16 17 18 20 24 25 27
       physical 5: cores 0 1 2 3 4 8 9 10 11 16 17 18 20 24 25 27
       physical 6: cores 0 1 2 3 4 8 9 10 11 16 17 18 20 24 25 27
       physical 7: cores 0 1 2 3 4 8 9 10 11 16 17 18 20 24 25 27
    cache size : 46080 KB

 From /proc/meminfo
    MemTotal:       1055990848 kB
    HugePages_Total:       0
    Hugepagesize:       2048 kB

 From /etc/*release* /etc/*version*
    os-release:
       NAME="Red Hat Enterprise Linux Server"
       VERSION="7.1 (Maipo)"
       ID="rhel"
       ID_LIKE="fedora"
       VERSION_ID="7.1"
       PRETTY_NAME="Red Hat Enterprise Linux Server 7.1 (Maipo)"
       ANSI_COLOR="0;31"
       CPE_NAME="cpe:/o:redhat:enterprise_linux:7.1:GA:server"
    redhat-release: Red Hat Enterprise Linux Server release 7.1 (Maipo)
    system-release: Red Hat Enterprise Linux Server release 7.1 (Maipo)
    system-release-cpe: cpe:/o:redhat:enterprise_linux:7.1:ga:server

 uname -a:
    Linux localhost.localdomain 3.10.0-229.el7.x86_64 #1 SMP Thu Jan 29 18:37:38
    EST 2015 x86_64 x86_64 x86_64 GNU/Linux

 run-level 3 Dec 20 11:03

 SPEC is set to: /home/spec
    Filesystem            Type  Size  Used Avail Use% Mounted on
    /dev/mapper/rhel-home xfs   1.1T   52G 1009G   5% /home
 Additional information from dmidecode:

    Warning: Use caution when you interpret this section. The 'dmidecode' program
    reads system data which is "intended to allow hardware to be accurately
    determined", but the intent may not be met, as there are frequent changes to
    hardware, firmware, and the "DMTF SMBIOS" standard.

   BIOS American Megatrends Inc. BLXSV106 08/13/2016
   Memory:
    1x Hynix HMA42GR7MFR4N-TFTD 16 GB 2 rank 2133 MHz, configured at 1600 MHz
    63x Micron 36ASF2G72PZ-2G1A2 16 GB 2 rank 2133 MHz, configured at 1600 MHz
    128x NO DIMM NO DIMM

 (End of data from sysinfo program)

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/home/spec/libs/32:/home/spec/libs/64:/home/spec/sh"

 Binaries compiled on a system with 1x Intel Core i5-4670K CPU + 32GB
 memory using RedHat EL 7.1
 Transparent Huge Pages enabled with:
 echo always > /sys/kernel/mm/transparent_hugepage/enabled
 Filesystem page cache cleared with:
 echo 1>       /proc/sys/vm/drop_caches
 runspec command invoked through numactl i.e.:
 numactl --interleave=all runspec <etc>

Base Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks:

 icpc -m64 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -opt-prefetch   -auto-p32   -ansi-alias   -opt-mem-layout-trans=3 

C++ benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -opt-prefetch   -auto-p32   -ansi-alias   -opt-mem-layout-trans=3 

Fortran benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -opt-prefetch 

Benchmarks using both Fortran and C:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -opt-prefetch   -auto-p32   -ansi-alias   -opt-mem-layout-trans=3 

Peak Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks (except as noted below):

 icpc -m64 
450.soplex:  icpc -m32 -L/opt/intel/compilers_and_libraries_2016/linux/compiler/lib/ia32_lin 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -D_FILE_OFFSET_BITS=64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  basepeak = yes 
482.sphinx3:  basepeak = yes 

C++ benchmarks:

444.namd:  -xCORE-AVX2(pass 2)   -prof-gen:threadsafe(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -par-num-threads=1(pass 1)   -opt-mem-layout-trans=3(pass 2)   -prof-use(pass 2)   -fno-alias   -auto-ilp32 
447.dealII:  basepeak = yes 
450.soplex:  -xCORE-AVX2(pass 2)   -prof-gen:threadsafe(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -par-num-threads=1(pass 1)   -opt-mem-layout-trans=3(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3 
453.povray:  -xCORE-AVX2(pass 2)   -prof-gen:threadsafe(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -par-num-threads=1(pass 1)   -opt-mem-layout-trans=3(pass 2)   -prof-use(pass 2)   -unroll4   -ansi-alias 

Fortran benchmarks:

410.bwaves:  basepeak = yes 
416.gamess:  -xCORE-AVX2(pass 2)   -prof-gen:threadsafe(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -par-num-threads=1(pass 1)   -prof-use(pass 2)   -unroll2   -inline-level=0   -scalar-rep- 
434.zeusmp:  basepeak = yes 
437.leslie3d:  basepeak = yes 
459.GemsFDTD:  basepeak = yes 
465.tonto:  -xCORE-AVX2(pass 2)   -prof-gen:threadsafe(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -par-num-threads=1(pass 1)   -prof-use(pass 2)   -unroll4   -auto   -inline-calloc   -opt-malloc-options=3 

Benchmarks using both Fortran and C:

435.gromacs:  -xCORE-AVX2(pass 2)   -prof-gen:threadsafe(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -par-num-threads=1(pass 1)   -opt-mem-layout-trans=3(pass 2)   -prof-use(pass 2)   -opt-prefetch   -auto-ilp32 
436.cactusADM:  basepeak = yes 
454.calculix:  basepeak = yes 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/Intel-ic16.0-official-linux64.html,
http://www.spec.org/cpu2006/flags/Huawei-Platform-Settings-V1.2-HSW-RevG.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/Intel-ic16.0-official-linux64.xml,
http://www.spec.org/cpu2006/flags/Huawei-Platform-Settings-V1.2-HSW-RevG.xml.