SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

NEC Corporation

Express5800/A1080a-D (Intel Xeon E7-8870)

CPU2006 license: 9006 Test date: Nov-2011
Test sponsor: NEC Corporation Hardware Availability: Nov-2011
Tested by: NEC Corporation Software Availability: May-2011
Benchmark results graph
Hardware
CPU Name: Intel Xeon E7-8870
CPU Characteristics: Intel Turbo Boost Technology up to 2.80 GHz
CPU MHz: 2400
FPU: Integrated
CPU(s) enabled: 20 cores, 2 chips, 10 cores/chip, 2 threads/core
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 32 KB I + 32 KB D on chip per core
Secondary Cache: 256 KB I+D on chip per core
L3 Cache: 30 MB I+D on chip per chip
Other Cache: None
Memory: 128 GB (32 x 4 GB 2Rx4 PC3-8500R-7, ECC)
Disk Subsystem: 2x300 GB SAS, 10000 RPM, RAID 0
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.1,
Kernel 2.6.32-131.0.15.el6.x86_64 on an x86_64
Compiler: C/C++/Fortran: Version 12.0.4.191 of Intel
Compiler XE Build 20110427
Auto Parallel: No
File System: ext4
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 40 2177 250 2507 217 2444 222 40 1942 280 2067 263 2419 225
416.gamess 40 1918 408 1865 420 1912 410 40 1818 431 1820 430 1855 422
433.milc 40 1303 282 1334 275 1301 282 40 1308 281 1307 281 1308 281
434.zeusmp 40 896 406 895 407 895 407 40 896 406 895 407 895 407
435.gromacs 40 764 374 762 375 763 374 40 746 383 746 383 746 383
436.cactusADM 40 1162 411 1195 400 1185 403 40 1162 411 1195 400 1185 403
437.leslie3d 40 1849 203 1850 203 1850 203 40 1860 202 1851 203 1848 203
444.namd 40 911 352 921 348 919 349 40 902 356 901 356 895 358
447.dealII 40 700 654 705 649 702 652 40 761 601 762 601 762 601
450.soplex 40 1559 214 1608 207 1538 217 40 1381 242 1396 239 1380 242
453.povray 40 399 534 399 533 397 536 40 327 651 326 654 327 650
454.calculix 40 720 458 721 458 722 457 40 720 458 721 458 722 457
459.GemsFDTD 40 3442 123 2460 173 2927 145 40 3442 123 2460 173 2927 145
465.tonto 40 852 462 852 462 852 462 40 841 468 840 469 843 467
470.lbm 40 2961 186 2952 186 2956 186 40 2937 187 2937 187 2936 187
481.wrf 40 1258 355 1253 356 1252 357 40 1258 355 1253 356 1252 357
482.sphinx3 40 2245 347 2247 347 2245 347 40 2093 373 2092 373 2090 373

Submit Notes

The config file option 'submit' was used.
 numactl was used to bind copies to the cores

Operating System Notes

 'ulimit -s unlimited' was used to set the stack size to unlimited prior to run
 echo 1 > /proc/sys/vm/zone_reclaim_mode
 'mount -t hugetlbfs nodev /mnt/hugepages' was used to enable large pages
 echo 18000 > /proc/sys/vm/nr_hugepages
 export HUGETLB_MORECORE=yes
 export LD_PRELOAD=/usr/lib64/libhugetlbfs.so

Platform Notes

 Patrol Scrubbing set to disabled in Maintenance Console

General Notes

 The Express5800/A1080a-S and the Express5800/A1080a-D models are electronically equivalent.
 The results have been measured on the Express5800/A1080a-S model.
 Binaries were compiled on RHEL 5.6

Base Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks:

 icpc -m64 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

C++ benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

Fortran benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static 

Benchmarks using both Fortran and C:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

Peak Compiler Invocation

C benchmarks (except as noted below):

 icc -m64 
482.sphinx3:  icc -m32 

C++ benchmarks (except as noted below):

 icpc -m64 
450.soplex:  icpc -m32 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 

Peak Optimization Flags

C benchmarks:

433.milc:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static   -auto-ilp32 
470.lbm:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3   -ansi-alias   -opt-prefetch   -static   -auto-ilp32 
482.sphinx3:  -xSSE4.2   -ipo   -O3   -no-prec-div   -unroll2 

C++ benchmarks:

444.namd:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -fno-alias   -auto-ilp32 
447.dealII:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static   -auto-ilp32 
450.soplex:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3   -B /usr/share/libhugetlbfs/ -Wl,-hugetlbfs-link=BDT 
453.povray:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll4   -ansi-alias   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 

Fortran benchmarks:

410.bwaves:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static 
416.gamess:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll2   -inline-level=0   -scalar-rep-   -static 
434.zeusmp:  basepeak = yes 
437.leslie3d:  -xSSE4.2   -ipo   -O3   -no-prec-div   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 
459.GemsFDTD:  basepeak = yes 
465.tonto:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll4   -auto   -inline-calloc   -opt-malloc-options=3   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 

Benchmarks using both Fortran and C:

435.gromacs:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-prefetch   -static   -auto-ilp32 
436.cactusADM:  basepeak = yes 
454.calculix:  basepeak = yes 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/Intel-ic12.0-linux64-revB.20110705.html,
http://www.spec.org/cpu2006/flags/NEC-platform-linux64-revC.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/Intel-ic12.0-linux64-revB.20110705.xml,
http://www.spec.org/cpu2006/flags/NEC-platform-linux64-revC.xml.