SPEC® CFP2006 Result

Copyright 2006-2017 Standard Performance Evaluation Corporation

Lenovo Global Technology

ThinkSystem SR950
(3.00 GHz, Intel Xeon Platinum 8158)

CPU2006 license: 9017 Test date: Aug-2017
Test sponsor: Lenovo Global Technology Hardware Availability: Sep-2017
Tested by: Lenovo Global Technology Software Availability: Apr-2017
Benchmark results graph
Hardware
CPU Name: Intel Xeon Platinum 8158
CPU Characteristics: Intel Turbo Boost Technology up to 3.70 GHz
CPU MHz: 3000
FPU: Integrated
CPU(s) enabled: 48 cores, 4 chips, 12 cores/chip, 2 threads/core
CPU(s) orderable: 2,4 chips
Primary Cache: 32 KB I + 32 KB D on chip per core
Secondary Cache: 1 MB I+D on chip per core
L3 Cache: 24.75 MB I+D on chip per chip
Other Cache: None
Memory: 1536 GB (48 x 32 GB 2Rx4 PC4-2666V-R)
Disk Subsystem: 1 x 800 GB SAS SSD
Other Hardware: None
Software
Operating System: SUSE Linux Enterprise Server 12 SP2 (x86_64)
Kernel 4.4.21-69-default
Compiler: C/C++: Version 17.0.3.191 of Intel C/C++
Compiler for Linux;
Fortran: Version 17.0.3.191 of Intel Fortran
Compiler for Linux
Auto Parallel: Yes
File System: tmpfs
System State: Run level 3 (multi-user)
Base Pointers: 32/64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 96 599 2180 600 2180 600 2180 48 298 2190 299 2180 299 2190
416.gamess 96 742 2530 752 2500 752 2500 96 729 2580 728 2580 728 2580
433.milc 96 411 2140 411 2140 411 2140 96 411 2140 411 2140 411 2140
434.zeusmp 96 306 2850 304 2880 304 2880 96 306 2850 304 2880 304 2880
435.gromacs 96 223 3070 223 3070 223 3070 96 221 3100 221 3110 221 3110
436.cactusADM 96 353 3250 353 3250 353 3250 96 353 3250 353 3250 353 3250
437.leslie3d 96 644 1400 632 1430 633 1430 48 279 1620 280 1610 280 1610
444.namd 96 376 2050 376 2050 378 2040 96 373 2060 375 2050 377 2040
447.dealII 96 281 3910 283 3880 282 3900 96 281 3910 283 3880 282 3900
450.soplex 96 525 1530 524 1530 522 1530 48 258 1550 257 1560 257 1560
453.povray 96 146 3490 146 3490 147 3480 96 125 4070 125 4090 125 4070
454.calculix 96 202 3920 203 3900 205 3870 96 202 3920 203 3900 205 3870
459.GemsFDTD 96 792 1290 792 1290 792 1290 48 418 1220 418 1220 418 1220
465.tonto 96 360 2630 352 2690 358 2640 96 334 2830 332 2840 332 2840
470.lbm 96 519 2540 519 2540 519 2540 96 519 2540 519 2540 519 2540
481.wrf 96 423 2540 420 2550 423 2540 96 423 2540 420 2550 423 2540
482.sphinx3 96 804 2330 802 2330 805 2320 96 804 2330 802 2330 805 2320

Submit Notes

 The numactl mechanism was used to bind copies to processors. The config file option 'submit'
 was used to generate numactl commands to bind each copy to a specific processor.
 For details, please see the config file.

Operating System Notes

 Stack size set to unlimited using "ulimit -s unlimited"
 Tmpfs filesystem can be set with:
  mount -t tmpfs -o size=800g tmpfs /home
 Process tuning setting:
  echo 50000     > /proc/sys/kernel/sched_cfs_bandwidth_slice_us
  echo 240000000 > /proc/sys/kernel/sched_latency_ns
  echo 5000000   > /proc/sys/kernel/sched_migration_cost_ns
  echo 100000000 > /proc/sys/kernel/sched_min_granularity_ns
  echo 150000000 > /proc/sys/kernel/sched_wakeup_granularity_ns

Platform Notes

BIOS configuration:
Choose Operating Mode set to Maximum Performance
SNC set to Enable
DCU Streamer Prefetcher set to Disable
Stale AtoS set to Enable
LLC dead line alloc set to Disable
 Sysinfo program /home/cpu2006-1.2-ic17.0u3/config/sysinfo.rev6993
 Revision 6993 of 2015-11-06 (b5e8d4b4eb51ed28d7f98696cbe290c1)
 running on Proton4S-SUSE12SP2 Thu Aug 17 14:01:07 2017

 This section contains SUT (System Under Test) info as seen by
 some common utilities.  To remove or add to this section, see:
   http://www.spec.org/cpu2006/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : Intel(R) Xeon(R) Platinum 8158 CPU @ 3.00GHz
       4 "physical id"s (chips)
       96 "processors"
    cores, siblings (Caution: counting these is hw and system dependent.  The
    following excerpts from /proc/cpuinfo might not be reliable.  Use with
    caution.)
       cpu cores : 12
       siblings  : 24
       physical 0: cores 0 1 2 3 4 8 9 11 17 18 19 20
       physical 1: cores 0 1 2 3 4 9 10 16 18 19 25 26
       physical 2: cores 0 1 2 3 4 8 9 11 17 18 19 20
       physical 3: cores 0 1 2 3 4 9 10 16 18 19 25 26
    cache size : 25344 KB

 From /proc/meminfo
    MemTotal:       1584766204 kB
    HugePages_Total:       0
    Hugepagesize:       2048 kB

 From /etc/*release* /etc/*version*
    SuSE-release:
       SUSE Linux Enterprise Server 12 (x86_64)
       VERSION = 12
       PATCHLEVEL = 2
       # This file is deprecated and will be removed in a future service pack or
       release.
       # Please check /etc/os-release for details about this release.
    os-release:
       NAME="SLES"
       VERSION="12-SP2"
       VERSION_ID="12.2"
       PRETTY_NAME="SUSE Linux Enterprise Server 12 SP2"
       ID="sles"
       ANSI_COLOR="0;32"
       CPE_NAME="cpe:/o:suse:sles:12:sp2"

 uname -a:
    Linux Proton4S-SUSE12SP2 4.4.21-69-default #1 SMP Tue Oct 25 10:58:20 UTC
    2016 (9464f67) x86_64 x86_64 x86_64 GNU/Linux

 run-level 3 Aug 17 13:59

 SPEC is set to: /home/cpu2006-1.2-ic17.0u3
    Filesystem     Type   Size  Used Avail Use% Mounted on
    tmpfs          tmpfs  800G  3.7G  797G   1% /home
 Additional information from dmidecode:

    Warning: Use caution when you interpret this section. The 'dmidecode' program
    reads system data which is "intended to allow hardware to be accurately
    determined", but the intent may not be met, as there are frequent changes to
    hardware, firmware, and the "DMTF SMBIOS" standard.

   BIOS Lenovo -[PSE105I-1.00]- 06/12/2017
   Memory:
    48x NO DIMM NO DIMM
    48x Samsung M393A4K40BB2-CTD 32 GB 2 rank 2666 MHz

 (End of data from sysinfo program)

General Notes

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/home/cpu2006-1.2-ic17.0u3/lib/ia32:/home/cpu2006-1.2-ic17.0u3/lib/intel64"
LD_LIBRARY_PATH = "$LD_LIBRARY_PATH:/home/cpu2006-1.2-ic17.0u3/sh10.2"

 Binaries compiled on a system with 1x Intel Core i7-4790 CPU + 32GB RAM
 memory using Redhat Enterprise Linux 7.2
 Transparent Huge Pages enabled by default
 Filesystem page cache cleared with:
 shell invocation of 'sync; echo 3 > /proc/sys/vm/drop_caches' prior to run
 runspec command invoked through numactl i.e.:
 numactl --interleave=all runspec <etc>

Base Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks:

 icpc -m64 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

C++ benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

Fortran benchmarks:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch 

Benchmarks using both Fortran and C:

 -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch   -auto-p32   -qopt-mem-layout-trans=3 

Peak Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks (except as noted below):

 icpc -m64 
450.soplex:  icpc -m32 -L/opt/intel/compilers_and_libraries_2017/linux/lib/ia32 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -D_FILE_OFFSET_BITS=64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  basepeak = yes 
482.sphinx3:  basepeak = yes 

C++ benchmarks:

444.namd:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -fno-alias   -auto-ilp32   -qopt-mem-layout-trans=3 
447.dealII:  basepeak = yes 
450.soplex:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -qopt-malloc-options=3   -qopt-mem-layout-trans=3 
453.povray:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -unroll4   -qopt-mem-layout-trans=3 

Fortran benchmarks:

410.bwaves:  -xCORE-AVX2   -ipo   -O3   -no-prec-div   -qopt-prefetch 
416.gamess:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -unroll2   -inline-level=0   -scalar-rep- 
434.zeusmp:  basepeak = yes 
437.leslie3d:  Same as 410.bwaves 
459.GemsFDTD:  Same as 410.bwaves 
465.tonto:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -unroll4   -auto   -inline-calloc   -qopt-malloc-options=3 

Benchmarks using both Fortran and C:

435.gromacs:  -prof-gen(pass 1)   -prof-use(pass 2)   -xCORE-AVX2(pass 2)   -par-num-threads=1(pass 1)   -qopt-prefetch   -auto-ilp32   -qopt-mem-layout-trans=3 
436.cactusADM:  basepeak = yes 
454.calculix:  basepeak = yes 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/Intel-ic17.0-official-linux64-revF.html,
http://www.spec.org/cpu2006/flags/Lenovo-Platform-Flags-V1.2-SKL-C.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/Intel-ic17.0-official-linux64-revF.xml,
http://www.spec.org/cpu2006/flags/Lenovo-Platform-Flags-V1.2-SKL-C.xml.