SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Dell Inc.

PowerEdge R905 (AMD Opteron 8384, 2.70 GHz)

CPU2006 license: 55 Test date: Oct-2008
Test sponsor: Dell Inc. Hardware Availability: Nov-2008
Tested by: Dell Inc. Software Availability: Oct-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 8384
CPU Characteristics:
CPU MHz: 2700
FPU: Integrated
CPU(s) enabled: 16 cores, 4 chips, 4 cores/chip
CPU(s) orderable: 2,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 64 GB (16 x 4 GB DDR2-800)
Disk Subsystem: 1 x 73 GB SAS, 10000 RPM
Other Hardware: None
Software
Operating System: SUSE Linux Enterprise Server 10 (x86_64) SP2,
Kernel 2.6.16-60.0.21-smp
Compiler: PGI Server Complete Version 7.2
PathScale Compiler Suite Version 3.2
Auto Parallel: Yes
File System: ReiserFS
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18
32-bit and 64-bit libhugetlbfs libraries

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 16 1469 148 1448 150 1441 151 16 1441 151 1468 148 1435 152
416.gamess 16 1172 267 1173 267 1174 267 16 1053 298 1052 298 1051 298
433.milc 16 1183 124 1189 124 1178 125 16 1183 124 1189 124 1178 125
434.zeusmp 16 730 199 731 199 728 200 16 676 215 675 216 677 215
435.gromacs 16 478 239 478 239 479 239 16 391 292 391 292 391 292
436.cactusADM 16 916 209 916 209 914 209 4 170 281 171 280 170 281
437.leslie3d 16 1378 109 1378 109 1382 109 16 1244 121 1244 121 1244 121
444.namd 16 624 206 622 206 620 207 16 539 238 536 239 536 239
447.dealII 16 643 284 643 284 644 284 16 489 375 494 370 489 374
450.soplex 16 1103 121 1090 122 1090 122 16 1006 133 1000 133 994 134
453.povray 16 302 282 302 282 303 281 16 261 326 261 326 261 326
454.calculix 16 477 276 479 275 477 277 16 404 327 404 327 404 327
459.GemsFDTD 16 1427 119 1446 117 1419 120 16 1377 123 1380 123 1368 124
465.tonto 16 645 244 643 245 646 244 16 530 297 531 296 532 296
470.lbm 16 1786 123 1786 123 1786 123 16 1786 123 1786 123 1785 123
481.wrf 16 902 198 905 197 904 198 16 870 205 877 204 875 204
482.sphinx3 16 1426 219 1435 217 1421 219 16 1310 238 1310 238 1310 238

Submit Notes

 The config file option 'submit' was used.
 'numactl' was used to bind copies to the cores

Operating System Notes

 The libhugetlbfs libraries were installed using the
 installation rpms that came with the distribution.

 'ulimit -s unlimited' was used to set environment stack size
 'ulimit -l 2097152'  was used to set environment locked pages in memory limit

 Set vm/nr_hugepages=14336 in /etc/sysctl.conf
 mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_MORECORE = "yes"
LD_LIBRARY_PATH = "/root/cpu2006-1.1/amd909gh-libs/64:/root/cpu2006-1.1/amd909gh-libs/32"
NCPUS = "4"

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc=huge   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:4 

C++ benchmarks:

 -Mipa=jobs:4 

Fortran benchmarks:

 -Mipa=jobs:4 

Benchmarks using both Fortran and C:

 -Mipa=jobs:4 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks (except as noted below):

 pathCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 pathf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 
437.leslie3d:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
435.gromacs:  pathcc   pathf95 
481.wrf:  pathcc   pathf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mloop32   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
482.sphinx3:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -fno-exceptions   -m32 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -L/usr/lib -lhugetlbfs(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on 

Fortran benchmarks:

410.bwaves:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc   -Mprefetch=nta   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT(pass 2)   -L/usr/lib64 -lhugetlbfs(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256 
434.zeusmp:  -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Mprefetch=distance:8   -Mprefetch=t0   -Msmartalloc=huge   -Msmartalloc=hugebss   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mvect=fuse   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:prefer_lru_reg=off   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 
465.tonto:  -march=barcelona   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -OPT:rsqrt=2   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 
436.cactusADM:  -Mvect=cachesize:6291456   -fastsse   -Mconcur   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -march=barcelona   -Ofast   -LNO:blocking=off   -LNO:prefetch_ahead=10   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -OPT:malloc_alg=1   -m3dnow   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:4(pass 2) 

C++ benchmarks:

444.namd:  -Mipa=jobs:4(pass 2) 

Fortran benchmarks (except as noted below):

 -Mipa=jobs:4(pass 2) 
416.gamess:  No flags used 
459.GemsFDTD:  No flags used 
465.tonto:  No flags used 

Benchmarks using both Fortran and C (except as noted below):

 -Mipa=jobs:4(pass 2) 
435.gromacs:  No flags used 
481.wrf:  No flags used 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.html,
http://www.spec.org/cpu2006/flags/CPU2006_flags.20090710.html,
http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.xml,
http://www.spec.org/cpu2006/flags/CPU2006_flags.20090710.xml,
http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.xml.