SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

IBM Corporation

IBM System x3755 (AMD Opteron 8360 SE)

CPU2006 license: 11 Test date: Apr-2008
Test sponsor: IBM Corporation Hardware Availability: Jul-2008
Tested by: Advanced Micro Devices Software Availability: May-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 8360 SE
CPU Characteristics:
CPU MHz: 2500
FPU: Integrated
CPU(s) enabled: 16 cores, 4 chips, 4 cores/chip
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 2 MB I+D on chip per chip
Other Cache: None
Memory: 32 GB (16 x 2 GB, DDR2-667, CL5, Reg, Dual Rank)
Disk Subsystem: 1 x 73.4 GB SAS, 15000 RPM
Other Hardware: None
Software
Operating System: SuSE Linux Enterprise Server 10 (x86_64) SP1,
Kernel 2.6.16.46-0.12-smp
Compiler: PGI Server Complete Version 7.2
PathScale Compiler Suite Version 3.1
Auto Parallel: No
File System: ReiserFS
System State: Runlevel 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 16 1857 117   1856 117   1859 117   16 1753 124   1763 123   1759 124  
416.gamess 16 1274 246   1278 245   1275 246   16 1175 267   1179 266   1173 267  
433.milc 16 1392 106   1394 105   1390 106   16 1364 108   1363 108   1362 108  
434.zeusmp 16 861 169   870 167   863 169   16 871 167   872 167   864 169  
435.gromacs 16 618 185   618 185   618 185   16 503 227   502 227   503 227  
436.cactusADM 16 1148 167   1167 164   1157 165   16 1067 179   1067 179   1071 178  
437.leslie3d 16 1688 89.1 1686 89.2 1686 89.2 16 1552 96.9 1555 96.7 1557 96.6
444.namd 16 700 183   696 184   696 184   16 615 209   614 209   613 209  
447.dealII 16 867 211   855 214   827 221   16 594 308   601 305   608 301  
450.soplex 16 1431 93.2 1359 98.2 1357 98.4 16 1393 95.8 1349 98.9 1350 98.9
453.povray 16 344 248   342 249   342 249   16 286 297   286 298   286 298  
454.calculix 16 557 237   555 238   554 238   16 556 237   557 237   555 238  
459.GemsFDTD 16 1905 89.1 1912 88.8 1913 88.7 16 1814 93.6 1812 93.7 1809 93.8
465.tonto 16 808 195   810 194   808 195   16 716 220   713 221   710 222  
470.lbm 16 2376 92.5 2393 91.9 2380 92.4 16 2290 96.0 2289 96.1 2290 96.0
481.wrf 16 1078 166   1079 166   1083 165   16 1079 166   1081 165   1079 166  
482.sphinx3 16 2355 132   2346 133   2345 133   16 2132 146   2123 147   2121 147  

Operating System Notes

 'numactl' was used to bind copies to the cores
 Environment variable PGI_HUGE_PAGES set to 150
 'ulimit -s unlimited' was used to set environment stack size
 'ulimit -l 4915200' was used to set environment locked pages in memory quantity
 Set vm/nr_hugepages=2400 in /etc/sysctl.conf
 mount -t hugetlbfs nodev /mnt/hugepages

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -fast   -Mipa=jobs:4   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -fast   -Mipa=jobs:4   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   --zc_eh   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -fast   -Mipa=jobs:4   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -fast   -Mipa=jobs:4   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -w 

C++ benchmarks:

 -w 

Fortran benchmarks:

 -w 

Benchmarks using both Fortran and C:

 -w 

Peak Compiler Invocation

C benchmarks (except as noted below):

 pathcc 
433.milc:  pgcc 

C++ benchmarks (except as noted below):

 pathCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 pathf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
436.cactusADM:  pathcc   pathf95 
481.wrf:  pathcc   pathf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -fastsse   -Msmartalloc=huge:150   -Msafeptr   -Mfprelaxed   -Mipa=jobs:4   -Mipa=inline   -Mipa=arg   -Mipa=const   -Mipa=ptr   -Mipa=shape   -tp barcelona-64   -Bstatic_pgi 
470.lbm:  -march=barcelona   -Ofast   -m3dnow 
482.sphinx3:  -march=barcelona   -Ofast 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mipa=jobs:4(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mpfo(pass 2)   -fast   -Mfprelaxed   -Msmartalloc=huge:150   --zc_eh   -Mnodepchk   -Munroll=n:4   -Munroll=m:8   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -OPT:malloc_alg=1   -m32   -fno-exceptions 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -m32   -O3   -TENV:frame_pointer=off   -LNO:prefetch=1 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:load_exe=0 

Fortran benchmarks:

410.bwaves:  -Mpfi(pass 1)   -Mipa=jobs:4(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mpfo(pass 2)   -fastsse   -Mfprelaxed   -Msmartalloc   -Mprefetch=distance:12   -Mprefetch=nta   -tp barcelona-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256 
434.zeusmp:  -fastsse   -Mfprelaxed   -Msmartalloc=huge:150   -Mipa=jobs:4   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
437.leslie3d:  -march=barcelona   -Ofast   -m3dnow   -OPT:unroll_size=256   -CG:load_exe=0   -OPT:malloc_alg=1 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -OPT:malloc_alg=1 
465.tonto:  -march=barcelona   -Ofast   -OPT:malloc_alg=1   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525 

Benchmarks using both Fortran and C:

435.gromacs:  -fast   -Mfpapprox=rsqrt   -Mipa=jobs:4   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   -tp barcelona-64   -Bstatic_pgi 
436.cactusADM:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -WOPT:aggstr=0 
454.calculix:  -fastsse   -Mfprelaxed   -Msmartalloc=huge:150   -Mipa=jobs:4   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -march=barcelona   -Ofast   -LNO:blocking=off   -LNO:prefetch_ahead=10   -OPT:malloc_alg=1   -m3dnow   -LANG:copyinout=off   -IPA:callee_limit=5000 

Peak Other Flags

C benchmarks:

433.milc:  -w 

C++ benchmarks:

444.namd:  -w 

Fortran benchmarks:

410.bwaves:  -w 
434.zeusmp:  -w 

Benchmarks using both Fortran and C:

435.gromacs:  -w 
454.calculix:  -w 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/amd123GH-flags.20090714.03.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/amd123GH-flags.20090714.03.xml.