SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

NEC Corporation

Express5800/A1080a-S (Intel Xeon E7-8870)

CPU2006 license: 9006 Test date: Dec-2011
Test sponsor: NEC Corporation Hardware Availability: Nov-2011
Tested by: NEC Corporation Software Availability: May-2011
Benchmark results graph
Hardware
CPU Name: Intel Xeon E7-8870
CPU Characteristics: Intel Turbo Boost Technology up to 2.80 GHz
CPU MHz: 2400
FPU: Integrated
CPU(s) enabled: 40 cores, 4 chips, 10 cores/chip, 2 threads/core
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 32 KB I + 32 KB D on chip per core
Secondary Cache: 256 KB I+D on chip per core
L3 Cache: 30 MB I+D on chip per chip
Other Cache: None
Memory: 512 GB (64 x 8 GB 2Rx4 PC3-8500R-7, ECC)
Disk Subsystem: 2x300 GB SAS, 10000 RPM, RAID 0
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.1,
Kernel 2.6.32-131.0.15.el6.x86_64 on an x86_64
Compiler: C/C++/Fortran: Version 12.0.4.191 of Intel
Compiler XE Build 20110427
Auto Parallel: No
File System: ext4
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 80 1908 570 1915 568 1898 573 80 1915 568 1916 567 1898 573
416.gamess 80 1864 840 1867 839 1908 821 80 1852 846 1821 860 1822 860
433.milc 80 1285 572 1286 571 1285 572 80 1293 568 1291 569 1293 568
434.zeusmp 80 904 806 904 806 905 804 80 904 806 904 806 905 804
435.gromacs 80 763 748 762 750 763 749 80 748 764 746 766 746 766
436.cactusADM 80 1187 805 1182 808 1177 812 80 1187 805 1182 808 1177 812
437.leslie3d 80 1846 407 1846 407 1848 407 80 1843 408 1846 407 1854 406
444.namd 80 908 707 912 703 906 708 80 888 722 891 720 888 723
447.dealII 80 707 1300 707 1290 710 1290 80 766 1200 760 1200 760 1200
450.soplex 80 1433 465 1431 466 1430 466 80 1330 502 1330 502 1329 502
453.povray 80 397 1070 402 1060 401 1060 80 328 1300 325 1310 325 1310
454.calculix 80 744 887 743 889 742 889 80 744 887 743 889 742 889
459.GemsFDTD 80 2197 386 2199 386 2194 387 80 2197 386 2199 386 2194 387
465.tonto 80 855 921 865 910 863 912 80 849 927 845 931 843 934
470.lbm 80 2979 369 2980 369 2979 369 80 2941 374 2945 373 2936 374
481.wrf 80 1248 716 1249 716 1252 714 80 1248 716 1249 716 1252 714
482.sphinx3 80 2230 699 2230 699 2233 698 80 2084 748 2085 748 2083 749

Submit Notes

The config file option 'submit' was used.
 numactl was used to bind copies to the cores

Operating System Notes

 'ulimit -s unlimited' was used to set the stack size to unlimited prior to run
 echo 1 > /proc/sys/vm/zone_reclaim_mode
 'mount -t hugetlbfs nodev /mnt/hugepages' was used to enable large pages
 echo 36000 > /proc/sys/vm/nr_hugepages
 export HUGETLB_MORECORE=yes
 export LD_PRELOAD=/usr/lib64/libhugetlbfs.so

Platform Notes

 Patrol Scrubbing set to disabled in Maintenance Console

General Notes

 The Express5800/A1080a-S and the Express5800/A1080a-D models are electronically equivalent.
 The results have been measured on the Express5800/A1080a-S model.
 Binaries were compiled on RHEL 5.6

Base Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks:

 icpc -m64 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

C++ benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

Fortran benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static 

Benchmarks using both Fortran and C:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

Peak Compiler Invocation

C benchmarks (except as noted below):

 icc -m64 
482.sphinx3:  icc -m32 

C++ benchmarks (except as noted below):

 icpc -m64 
450.soplex:  icpc -m32 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 

Peak Optimization Flags

C benchmarks:

433.milc:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static   -auto-ilp32 
470.lbm:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3   -ansi-alias   -opt-prefetch   -static   -auto-ilp32 
482.sphinx3:  -xSSE4.2   -ipo   -O3   -no-prec-div   -unroll2 

C++ benchmarks:

444.namd:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -fno-alias   -auto-ilp32 
447.dealII:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static   -auto-ilp32 
450.soplex:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3   -B /usr/share/libhugetlbfs/ -Wl,-hugetlbfs-link=BDT 
453.povray:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll4   -ansi-alias   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 

Fortran benchmarks:

410.bwaves:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static 
416.gamess:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll2   -inline-level=0   -scalar-rep-   -static 
434.zeusmp:  basepeak = yes 
437.leslie3d:  -xSSE4.2   -ipo   -O3   -no-prec-div   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 
459.GemsFDTD:  basepeak = yes 
465.tonto:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll4   -auto   -inline-calloc   -opt-malloc-options=3   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 

Benchmarks using both Fortran and C:

435.gromacs:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-prefetch   -static   -auto-ilp32 
436.cactusADM:  basepeak = yes 
454.calculix:  basepeak = yes 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/Intel-ic12.0-linux64-revB.20110705.html,
http://www.spec.org/cpu2006/flags/NEC-platform-linux64-revC.20111206.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/Intel-ic12.0-linux64-revB.20110705.xml,
http://www.spec.org/cpu2006/flags/NEC-platform-linux64-revC.20111206.xml.