SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Dell Inc.

PowerEdge R415 (AMD Opteron 4164 EE, 1.80 GHz)

CPU2006 license: 55 Test date: Jul-2010
Test sponsor: Dell Inc. Hardware Availability: Jul-2010
Tested by: Dell Inc. Software Availability: Jul-2010
Benchmark results graph
Hardware
CPU Name: AMD Opteron 4164 EE
CPU Characteristics:
CPU MHz: 1800
FPU: Integrated
CPU(s) enabled: 12 cores, 2 chips, 6 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 32 GB (8 x 4 GB 2Rx4 PC3-10600R-9, ECC)
Disk Subsystem: 1 x 146 GB 10000 RPM SAS
Other Hardware: None
Software
Operating System: SUSE Linux Enterprise Server 11 (x86_64)
Kernel 2.6.27.19-5-default
Compiler: x86 Open64 4.2.4 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 12 1347 121   1357 120   1351 121   12 1335 122   1334 122   1333 122  
416.gamess 12 1653 142   1659 142   1654 142   12 1528 154   1541 153   1529 154  
433.milc 12 1252 88.0 1240 88.9 1240 88.8 12 1122 98.2 1123 98.1 1123 98.1
434.zeusmp 12 857 127   859 127   860 127   12 843 129   842 130   841 130  
435.gromacs 12 749 114   748 114   754 114   12 582 147   589 145   585 147  
436.cactusADM 12 988 145   976 147   984 146   2 123 195   118 202   126 189  
437.leslie3d 12 1270 88.8 1268 88.9 1272 88.7 12 1268 89.0 1269 88.9 1269 88.9
444.namd 12 851 113   851 113   850 113   12 794 121   794 121   793 121  
447.dealII 12 706 194   700 196   710 193   12 641 214   635 216   637 216  
450.soplex 12 1233 81.2 1176 85.1 1176 85.1 12 1094 91.4 1037 96.5 1034 96.8
453.povray 12 401 159   401 159   399 160   12 355 180   355 180   357 179  
454.calculix 12 590 168   593 167   592 167   12 557 178   559 177   559 177  
459.GemsFDTD 12 1563 81.4 1558 81.7 1567 81.2 12 1476 86.2 1475 86.3 1471 86.5
465.tonto 12 814 145   814 145   818 144   12 742 159   741 159   739 160  
470.lbm 12 981 168   1013 163   981 168   12 981 168   1013 163   981 168  
481.wrf 12 965 139   961 140   969 138   12 927 145   928 144   926 145  
482.sphinx3 12 1442 162   1445 162   1439 162   12 1423 164   1418 165   1421 165  

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=5400 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "450"
LD_LIBRARY_PATH = "/cpu2006/amd1002-rate-libs-revC/64:/cpu2006/amd1002-rate-libs-revC/32"
OMP_NUM_THREADS = "6"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on SLES10 SP2 with binutils 2.18

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=barcelona   -mso   -Ofast   -OPT:malloc_alg=1   -HP:bdt=2m 

C++ benchmarks:

 -march=barcelona   -mso   -Ofast   -static   -INLINE:aggressive=on   -OPT:malloc_alg=1   -HP:bdt=2m 

Fortran benchmarks:

 -march=barcelona   -mso   -Ofast   -HP 

Benchmarks using both Fortran and C:

 -march=barcelona   -mso   -Ofast   -OPT:malloc_alg=1   -HP:bdt=2m   -HP 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=barcelona   -mso   -Ofast   -CG:movnti=1   -CG:local_sched_alg=1   -CG:locs_shallow_depth=1   -HP:bdt=2m:heap=2m   -LNO:prefetch=3 
470.lbm:  basepeak = yes 
482.sphinx3:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:malloc_alg=2   -CG:sse_cse_regs=0   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:local_sched_alg=1   -INLINE:aggressive=on 

C++ benchmarks:

444.namd:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -CG:compute_to=on   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=barcelona   -mso   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m 
453.povray:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on 

Fortran benchmarks:

410.bwaves:  -march=barcelona   -mso   -O3   -OPT:Ofast   -OPT:treeheight=on   -LNO:blocking=off   -LNO:prefetch_ahead=5   -LNO:ignore_feedback=off   -WOPT:aggstr=0   -HP:bdt=2m:heap=2m   -CG:cmp_peep=on 
416.gamess:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:prefetch=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -march=barcelona   -mso   -Ofast   -LNO:blocking=off   -LNO:interchange=off   -OPT:treeheight=on   -OPT:unroll_size=256   -CG:cmp_peep=on   -GRA:prioritize_by_density=on   -HP 
437.leslie3d:  -march=barcelona   -mso   -Ofast   -HP:bdt=2m:heap=2m 
459.GemsFDTD:  -march=barcelona   -mso   -Ofast   -LNO:fission=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:local_sched_alg=1   -HP 
465.tonto:  -march=barcelona   -mso   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -mso   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -apo   -LNO:prefetch_ahead=1   -HP:bdt=2m:heap=2m   -LANG:heap_allocation_threshold=100 
454.calculix:  -march=barcelona   -mso   -Ofast   -CG:load_exe=0   -CG:ptr_load_use=0   -CG:local_sched_alg=2   -CG:compute_to=on   -LNO:prefetch_ahead=30   -WOPT:unroll=2   -GRA:optimize_boundary=on   -HP:bdt=2m:heap=2m 
481.wrf:  -march=barcelona   -mso   -Ofast   -LNO:blocking=off   -LNO:prefetch_ahead=10   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -m3dnow   -HP 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-424-flags-rate-revC.20100901.html,
http://www.spec.org/cpu2006/flags/amd-platform-rate-revC.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-424-flags-rate-revC.20100901.xml,
http://www.spec.org/cpu2006/flags/amd-platform-rate-revC.xml.