SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Dell Inc.

PowerEdge R815 (AMD Opteron 6176 SE, 2.30 GHz)

CPU2006 license: 55 Test date: Apr-2010
Test sponsor: Dell Inc. Hardware Availability: Mar-2010
Tested by: Dell Inc. Software Availability: Feb-2010
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6176 SE
CPU Characteristics:
CPU MHz: 2300
FPU: Integrated
CPU(s) enabled: 24 cores, 2 chips, 12 cores/chip
CPU(s) orderable: 2,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 12 MB I+D on chip per chip, 6 MB shared / 6 cores
Other Cache: None
Memory: 64 GB (16 x 4 GB DDR3-1333 DR RDIMM, CL9, ECC)
Disk Subsystem: 1 x 148 GB 10000 RPM SAS
Other Hardware: None
Software
Operating System: SUSE Linux Enterprise Server 11 (x86_64), Kernel
2.6.27.19-5-smp
Compiler: x86 Open64 4.2.3 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 24 1239 263 1234 264 1235 264 24 1221 267 1222 267 1221 267
416.gamess 24 1309 359 1311 359 1308 359 24 1217 386 1215 387 1209 389
433.milc 24 1140 193 1139 193 1139 193 24 1035 213 1036 213 1035 213
434.zeusmp 24 760 287 760 287 758 288 24 720 303 723 302 721 303
435.gromacs 24 596 287 596 287 596 287 24 463 370 473 362 469 365
436.cactusADM 24 896 320 898 320 897 320 4 103 466 101 473 101 473
437.leslie3d 24 1187 190 1188 190 1188 190 24 1190 190 1185 190 1185 190
444.namd 24 681 283 677 284 677 284 24 626 308 618 311 617 312
447.dealII 24 599 458 612 448 607 452 24 495 555 490 560 493 557
450.soplex 24 1214 165 1078 186 1078 186 24 1085 185 961 208 940 213
453.povray 24 314 407 314 406 313 408 24 303 421 303 421 303 421
454.calculix 24 481 412 483 410 483 410 24 453 437 452 438 452 438
459.GemsFDTD 24 1457 175 1465 174 1471 173 24 1402 182 1400 182 1398 182
465.tonto 24 671 352 670 352 673 351 24 606 390 606 390 607 389
470.lbm 24 863 382 866 381 864 382 24 863 382 866 381 864 382
481.wrf 24 864 310 865 310 870 308 24 839 320 837 320 838 320
482.sphinx3 24 1309 357 1314 356 1304 359 24 1288 363 1280 365 1281 365

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=10800 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "450"
LD_LIBRARY_PATH = "/root/cpu2006-1.1/amd1002mc-rate-revA-libs/64:/root/cpu2006-1.1/amd1002mc-rate-revA-libs/32"
OMP_NUM_THREADS = "6"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64
Binaries were compiled on SLES10 SP2 with binutils 2.18

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -march=barcelona   -mso   -Ofast   -OPT:malloc_alg=1   -HP:bdt=2m 

C++ benchmarks:

 -march=barcelona   -mso   -Ofast   -static   -INLINE:aggressive=on   -OPT:malloc_alg=1   -HP:bdt=2m 

Fortran benchmarks:

 -march=barcelona   -mso   -Ofast   -HP 

Benchmarks using both Fortran and C:

 -march=barcelona   -mso   -Ofast   -OPT:malloc_alg=1   -HP:bdt=2m   -HP 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -march=barcelona   -mso   -Ofast   -CG:movnti=1   -CG:local_sched_alg=1   -CG:locs_shallow_depth=1   -HP:bdt=2m:heap=2m   -LNO:prefetch=3 
470.lbm:  basepeak = yes 
482.sphinx3:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:malloc_alg=2   -CG:sse_cse_regs=0   -CG:locs_shallow_depth=1   -CG:cmp_peep=on   -CG:local_sched_alg=1   -INLINE:aggressive=on 

C++ benchmarks:

444.namd:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:ignore_feedback=off   -CG:local_sched_alg=2   -CG:load_exe=0   -CG:compute_to=on   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m 
447.dealII:  -march=barcelona   -mso   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m 
453.povray:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on 

Fortran benchmarks:

410.bwaves:  -march=barcelona   -mso   -O3   -OPT:Ofast   -OPT:treeheight=on   -LNO:blocking=off   -LNO:prefetch_ahead=5   -LNO:ignore_feedback=off   -WOPT:aggstr=0   -HP:bdt=2m:heap=2m   -CG:cmp_peep=on 
416.gamess:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:fu=6   -LNO:blocking=0   -LNO:prefetch=0   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -march=barcelona   -mso   -Ofast   -LNO:blocking=off   -LNO:interchange=off   -OPT:treeheight=on   -OPT:unroll_size=256   -CG:cmp_peep=on   -GRA:prioritize_by_density=on   -HP 
437.leslie3d:  -march=barcelona   -mso   -Ofast   -HP:bdt=2m:heap=2m 
459.GemsFDTD:  -march=barcelona   -mso   -Ofast   -LNO:fission=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:local_sched_alg=1   -HP 
465.tonto:  -march=barcelona   -mso   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -mso   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -march=barcelona   -mso   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -apo   -LNO:prefetch_ahead=1   -HP:bdt=2m:heap=2m   -LANG:heap_allocation_threshold=100 
454.calculix:  -march=barcelona   -mso   -Ofast   -CG:load_exe=0   -CG:ptr_load_use=0   -CG:local_sched_alg=2   -CG:compute_to=on   -LNO:prefetch_ahead=30   -WOPT:unroll=2   -GRA:optimize_boundary=on   -HP:bdt=2m:heap=2m 
481.wrf:  -march=barcelona   -mso   -Ofast   -LNO:blocking=off   -LNO:prefetch_ahead=10   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -m3dnow   -HP 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-4.2.3-flags-revA.html,
http://www.spec.org/cpu2006/flags/amd-platform-revA.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-4.2.3-flags-revA.xml,
http://www.spec.org/cpu2006/flags/amd-platform-revA.xml.