SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Dell Inc.

PowerEdge M605 (AMD Opteron 2382, 2.60 GHz)

CPU2006 license: 55 Test date: Nov-2008
Test sponsor: Dell Inc. Hardware Availability: Nov-2008
Tested by: Dell Inc. Software Availability: Oct-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 2382
CPU Characteristics:
CPU MHz: 2600
FPU: Integrated
CPU(s) enabled: 8 cores, 2 chips, 4 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 32 GB (8 x 4 GB DDR2-800)
Disk Subsystem: 1 x 80 GB 5400 RPM SATA
Other Hardware: None
Software
Operating System: SUSE Linux Enterprise Server 10 (x86_64) SP2,
Kernel 2.6.16.60-0.21-smp
Compiler: PGI Server Complete Version 7.2
PathScale Compiler Suite Version 3.2
Auto Parallel: Yes
File System: ReiserFS
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18
32-bit and 64-bit libhugetlbfs libraries

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 8 1044 104   1045 104   1045 104   8 1004 108   1003 108   1005 108  
416.gamess 8 1215 129   1212 129   1212 129   8 1091 144   1088 144   1090 144  
433.milc 8 939 78.2 937 78.4 938 78.3 8 939 78.2 937 78.4 938 78.3
434.zeusmp 8 660 110   663 110   666 109   8 612 119   613 119   614 118  
435.gromacs 8 492 116   491 116   490 117   8 401 143   400 143   401 143  
436.cactusADM 8 810 118   815 117   809 118   2 163 147   161 148   161 148  
437.leslie3d 8 1149 65.5 1150 65.4 1147 65.6 8 1069 70.3 1066 70.5 1068 70.4
444.namd 8 641 100   643 99.7 641 100   8 555 116   554 116   554 116  
447.dealII 8 607 151   607 151   605 151   8 473 194   464 197   467 196  
450.soplex 8 929 71.8 927 72.0 928 71.9 8 865 77.1 881 75.8 844 79.0
453.povray 8 311 137   313 136   313 136   8 270 158   271 157   270 158  
454.calculix 8 480 138   480 137   481 137   8 404 163   405 163   405 163  
459.GemsFDTD 8 1330 63.8 1331 63.8 1356 62.6 8 1277 66.5 1276 66.5 1277 66.4
465.tonto 8 647 122   650 121   646 122   8 531 148   530 149   530 149  
470.lbm 8 1765 62.3 1764 62.3 1764 62.3 8 1763 62.3 1764 62.3 1764 62.3
481.wrf 8 825 108   823 109   820 109   8 770 116   768 116   770 116  
482.sphinx3 8 1138 137   1119 139   1124 139   8 1026 152   1034 151   1029 151  

Submit Notes

The config file option 'submit' was used.
 'numactl' was used to bind copies to the cores

Operating System Notes

 The libhugetlbfs libraries were installed using the
 installation rpms that came with the distribution.

 'ulimit -s unlimited' was used to set environment stack size
 'ulimit -l 2097152'  was used to set environment locked pages in memory limit

 Set vm/nr_hugepages=7168 in /etc/sysctl.conf
 mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_MORECORE = "yes"
LD_LIBRARY_PATH = "/root/cpu2006_1.1/amd909gh-libs/64:/root/cpu2006_1.1/amd909gh-libs/32"
NCPUS = "4"

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc=huge   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:4 

C++ benchmarks:

 -Mipa=jobs:4 

Fortran benchmarks:

 -Mipa=jobs:4 

Benchmarks using both Fortran and C:

 -Mipa=jobs:4 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks (except as noted below):

 pathCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 pathf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 
437.leslie3d:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
435.gromacs:  pathcc   pathf95 
481.wrf:  pathcc   pathf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mloop32   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
482.sphinx3:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -fno-exceptions   -m32 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -L/usr/lib -lhugetlbfs(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on 

Fortran benchmarks:

410.bwaves:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc   -Mprefetch=nta   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT(pass 2)   -L/usr/lib64 -lhugetlbfs(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256 
434.zeusmp:  -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Mprefetch=distance:8   -Mprefetch=t0   -Msmartalloc=huge   -Msmartalloc=hugebss   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mvect=fuse   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:prefer_lru_reg=off   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 
465.tonto:  -march=barcelona   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -OPT:rsqrt=2   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 
436.cactusADM:  -Mvect=cachesize:6291456   -fastsse   -Mconcur   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -march=barcelona   -Ofast   -LNO:blocking=off   -LNO:prefetch_ahead=10   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -OPT:malloc_alg=1   -m3dnow   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:4(pass 2) 

C++ benchmarks:

444.namd:  -Mipa=jobs:4(pass 2) 

Fortran benchmarks (except as noted below):

 -Mipa=jobs:4(pass 2) 
416.gamess:  No flags used 
459.GemsFDTD:  No flags used 
465.tonto:  No flags used 

Benchmarks using both Fortran and C (except as noted below):

 -Mipa=jobs:4(pass 2) 
435.gromacs:  No flags used 
481.wrf:  No flags used 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.html,
http://www.spec.org/cpu2006/flags/CPU2006_flags.20090710.html,
http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.xml,
http://www.spec.org/cpu2006/flags/CPU2006_flags.20090710.xml,
http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.xml.