SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

NEC Corporation

Express5800/A1080a-D (Intel Xeon E7-8850)

CPU2006 license: 9006 Test date: Dec-2011
Test sponsor: NEC Corporation Hardware Availability: Nov-2011
Tested by: NEC Corporation Software Availability: May-2011
Benchmark results graph
Hardware
CPU Name: Intel Xeon E7-8850
CPU Characteristics: Intel Turbo Boost Technology up to 2.40 GHz
CPU MHz: 2000
FPU: Integrated
CPU(s) enabled: 40 cores, 4 chips, 10 cores/chip, 2 threads/core
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 32 KB I + 32 KB D on chip per core
Secondary Cache: 256 KB I+D on chip per core
L3 Cache: 24 MB I+D on chip per chip
Other Cache: None
Memory: 512 GB (64 x 8 GB 2Rx4 PC3-8500R-7, ECC)
Disk Subsystem: 2x300 GB SAS, 10000 RPM, RAID 0
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.1,
Kernel 2.6.32-131.0.15.el6.x86_64 on an x86_64
Compiler: C/C++/Fortran: Version 12.0.4.191 of Intel
Compiler XE Build 20110427
Auto Parallel: No
File System: ext4
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 80 2277 477 2284 476 2271 479 80 2289 475 2281 477 2276 478
416.gamess 80 2177 719 2178 719 2180 719 80 2127 737 2129 736 2135 734
433.milc 80 1495 491 1494 492 1494 492 80 1503 489 1503 489 1501 489
434.zeusmp 80 1055 690 1054 691 1057 689 80 1055 690 1054 691 1057 689
435.gromacs 80 882 648 881 648 882 647 80 864 661 866 659 865 660
436.cactusADM 80 1338 715 1334 717 1330 719 80 1338 715 1334 717 1330 719
437.leslie3d 80 2180 345 2174 346 2182 345 80 2162 348 2165 347 2161 348
444.namd 80 1047 613 1053 609 1053 609 80 1027 625 1028 624 1026 625
447.dealII 80 823 1110 811 1130 814 1120 80 878 1040 887 1030 882 1040
450.soplex 80 1744 383 1745 382 1741 383 80 1617 413 1618 412 1618 412
453.povray 80 471 903 467 912 469 908 80 381 1120 381 1120 378 1120
454.calculix 80 858 769 857 770 855 772 80 858 769 857 770 855 772
459.GemsFDTD 80 2548 333 2548 333 2556 332 80 2548 333 2548 333 2556 332
465.tonto 80 1047 752 1048 751 1061 742 80 1029 765 1041 756 1032 763
470.lbm 80 3511 313 3512 313 3507 313 80 3487 315 3489 315 3492 315
481.wrf 80 1468 609 1463 611 1465 610 80 1468 609 1463 611 1465 610
482.sphinx3 80 2705 576 2703 577 2714 575 80 2536 615 2529 616 2528 617

Submit Notes

The config file option 'submit' was used.
 numactl was used to bind copies to the cores

Operating System Notes

 'ulimit -s unlimited' was used to set the stack size to unlimited prior to run
 echo 1 > /proc/sys/vm/zone_reclaim_mode
 'mount -t hugetlbfs nodev /mnt/hugepages' was used to enable large pages
 echo 36000 > /proc/sys/vm/nr_hugepages
 export HUGETLB_MORECORE=yes
 export LD_PRELOAD=/usr/lib64/libhugetlbfs.so

Platform Notes

 Patrol Scrubbing set to disabled in Maintenance Console

General Notes

 The Express5800/A1080a-S and the Express5800/A1080a-D models are electronically equivalent.
 The results have been measured on the Express5800/A1080a-S model.
 Binaries were compiled on RHEL 5.6

Base Compiler Invocation

C benchmarks:

 icc -m64 

C++ benchmarks:

 icpc -m64 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

C++ benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

Fortran benchmarks:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static 

Benchmarks using both Fortran and C:

 -xSSE4.2   -ipo   -O3   -no-prec-div   -static   -ansi-alias 

Peak Compiler Invocation

C benchmarks (except as noted below):

 icc -m64 
482.sphinx3:  icc -m32 

C++ benchmarks (except as noted below):

 icpc -m64 
450.soplex:  icpc -m32 

Fortran benchmarks:

 ifort -m64 

Benchmarks using both Fortran and C:

 icc -m64   ifort -m64 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -nofor_main 
436.cactusADM:  -DSPEC_CPU_LP64   -nofor_main 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -nofor_main 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 

Peak Optimization Flags

C benchmarks:

433.milc:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static   -auto-ilp32 
470.lbm:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3   -ansi-alias   -opt-prefetch   -static   -auto-ilp32 
482.sphinx3:  -xSSE4.2   -ipo   -O3   -no-prec-div   -unroll2 

C++ benchmarks:

444.namd:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -fno-alias   -auto-ilp32 
447.dealII:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static   -auto-ilp32 
450.soplex:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-malloc-options=3   -B /usr/share/libhugetlbfs/ -Wl,-hugetlbfs-link=BDT 
453.povray:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll4   -ansi-alias   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 

Fortran benchmarks:

410.bwaves:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -static 
416.gamess:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll2   -inline-level=0   -scalar-rep-   -static 
434.zeusmp:  basepeak = yes 
437.leslie3d:  -xSSE4.2   -ipo   -O3   -no-prec-div   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 
459.GemsFDTD:  basepeak = yes 
465.tonto:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -unroll4   -auto   -inline-calloc   -opt-malloc-options=3   -B /usr/share/libhugetlbfs/ -Wl,-melf_x86_64 -Wl,-hugetlbfs-link=BDT 

Benchmarks using both Fortran and C:

435.gromacs:  -xSSE4.2(pass 2)   -prof-gen(pass 1)   -ipo(pass 2)   -O3(pass 2)   -no-prec-div(pass 2)   -prof-use(pass 2)   -opt-prefetch   -static   -auto-ilp32 
436.cactusADM:  basepeak = yes 
454.calculix:  basepeak = yes 
481.wrf:  basepeak = yes 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/Intel-ic12.0-linux64-revB.20110705.html,
http://www.spec.org/cpu2006/flags/NEC-platform-linux64-revC.20111206.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/Intel-ic12.0-linux64-revB.20110705.xml,
http://www.spec.org/cpu2006/flags/NEC-platform-linux64-revC.20111206.xml.