SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Sun Microsystems

Sun Fire X4140 (AMD Opteron 2389 2.9GHz)

CPU2006 license: 6 Test date: Sep-2009
Test sponsor: Sun Microsystems Hardware Availability: May-2009
Tested by: Sun Microsystems Software Availability: Jul-2009
Benchmark results graph
Hardware
CPU Name: AMD Opteron 2389
CPU Characteristics:
CPU MHz: 2900
FPU: Integrated
CPU(s) enabled: 8 cores, 2 chips, 4 cores/chip
CPU(s) orderable: 1 or 2 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 32 GB (8x4GB, DDR2-667, CL5, Reg, Dual Rank)
Disk Subsystem: SAS, 72 GB,10 K RPM
Other Hardware: See additional details below
Software
Operating System: Red Hat Enterprise Linux Server release 5.3,
Kernel 2.6.18-128.el5 for x86_64
Compiler: PGI Server Complete Version 8.0
x86 Open64 4.2.2 Compiler Suite (from AMD)
Auto Parallel: Yes
File System: ReiserFS
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 8 1206 90.2 1206 90.2 1206 90.1 8 1172 92.8 1170 92.9 1173 92.7
416.gamess 8 1074 146   1073 146   1074 146   8 993 158   996 157   993 158  
433.milc 8 1096 67.0 1096 67.0 1096 67.0 8 1096 67.0 1096 67.0 1096 67.0
434.zeusmp 8 665 109   664 110   663 110   8 624 117   620 117   625 116  
435.gromacs 8 447 128   447 128   447 128   8 364 157   365 157   365 157  
436.cactusADM 8 841 114   837 114   840 114   2 163 147   163 147   164 146  
437.leslie3d 8 1328 56.6 1330 56.5 1331 56.5 8 1252 60.1 1249 60.2 1248 60.2
444.namd 8 558 115   557 115   558 115   8 505 127   505 127   505 127  
447.dealII 8 581 158   579 158   583 157   8 435 210   430 213   431 212  
450.soplex 8 1009 66.2 1010 66.1 1009 66.1 8 943 70.8 930 71.7 931 71.7
453.povray 8 288 148   288 148   288 148   8 239 178   239 178   239 178  
454.calculix 8 420 157   420 157   420 157   8 372 178   369 179   372 177  
459.GemsFDTD 8 1542 55.1 1566 54.2 1541 55.1 8 1514 56.0 1513 56.1 1517 56.0
465.tonto 8 615 128   614 128   616 128   8 499 158   503 157   503 156  
470.lbm 8 2069 53.1 2069 53.1 2068 53.1 8 2067 53.2 2073 53.0 2067 53.2
481.wrf 8 913 97.9 914 97.7 913 97.8 8 878 102   875 102   872 102  
482.sphinx3 8 1177 132   1175 133   1174 133   8 1079 145   1081 144   1081 144  

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit
Set vm/nr_hugepages=7168 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

Platform Notes

   Default BIOS settings used except:
   DCT Unganged Mode set to "Always" to enable unganged mode.

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "896"
LD_LIBRARY_PATH = "/data1/SPECcpu2006v1.1-pegasus215/amd0905is-libs/64:/data1/SPECcpu2006v1.1-pegasus215/amd0905is-libs/32"
NCPUS = "4"
PGI_HUGE_PAGES = "896"
The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64.

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

C++ benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

Fortran benchmarks:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mvect=short   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Mvect=short   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:4 

C++ benchmarks:

 -Mipa=jobs:4 

Fortran benchmarks:

 -Mipa=jobs:4 

Benchmarks using both Fortran and C:

 -Mipa=jobs:4 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks (except as noted below):

 openCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 openf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 
437.leslie3d:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
435.gromacs:  opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mloop32   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
482.sphinx3:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mfprelaxed   -Msmartalloc   -tp shanghai-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp shanghai-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -LNO:opt=0   -Wf,-fno-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -TENV:frame_pointer=off 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32   -HP:bdt=2m 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on   -HP:bdt=2m:heap=2m 

Fortran benchmarks:

410.bwaves:  -fastsse   -Msmartalloc   -Mprefetch=nta   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256   -HP:bdt=2m:heap=2m 
434.zeusmp:  -fastsse   -Mfprelaxed   -Mprefetch=distance:8   -Mprefetch=t0   -Msmartalloc=huge   -Msmartalloc=hugebss   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=fuse   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -HP 
465.tonto:  -march=barcelona   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -HP 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m 
436.cactusADM:  -fastsse   -Mconcur   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp shanghai-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=short   -Msmartalloc=huge   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 
481.wrf:  -fastsse   -Mvect=noaltcode   -Msmartalloc=huge   -Mprefetch=distance:8   -Mfprelaxed   -tp shanghai-64   -Bstatic_pgi 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:4(pass 2) 

C++ benchmarks:

444.namd:  -Mipa=jobs:4(pass 2) 

Fortran benchmarks:

410.bwaves:  -Mipa=jobs:4 
434.zeusmp:  -Mipa=jobs:4 
437.leslie3d:  -Mipa=jobs:4(pass 2) 

Benchmarks using both Fortran and C:

436.cactusADM:  -Mipa=jobs:4 
454.calculix:  -Mipa=jobs:4(pass 2) 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/amd-platform.20090929.html,
http://www.spec.org/cpu2006/flags/pgi80_linux_flags.20090915.html,
http://www.spec.org/cpu2006/flags/x86-open64-4.2.2-flags-revE.20090915.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/amd-platform.20090929.xml,
http://www.spec.org/cpu2006/flags/pgi80_linux_flags.20090915.xml,
http://www.spec.org/cpu2006/flags/x86-open64-4.2.2-flags-revE.20090915.xml.