SPEC® CFP2006 Result

Copyright 2006-2016 Standard Performance Evaluation Corporation

IBM Corporation

IBM BladeCenter LS42 (AMD Opteron 8347 HE)

CPU2006 license: 11 Test date: Jun-2008
Test sponsor: IBM Corporation Hardware Availability: Sep-2008
Tested by: IBM Corporation Software Availability: May-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 8347 HE
CPU Characteristics:
CPU MHz: 1900
FPU: Integrated
CPU(s) enabled: 16 cores, 4 chips, 4 cores/chip
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 2 MB I+D on chip per chip
Other Cache: None
Memory: 64 GB (16 x 4 GB DDR2-6400 ECC)
Disk Subsystem: 1 x 73 GB SAS, 10000 RPM
Other Hardware: None
Software
Operating System: SuSE Linux Enterprise Server 10 (x86_64) SP1,
Kernel 2.6.16.46-0.12-smp
Compiler: PGI Server Complete Version 7.2
PathScale Compiler Suite Version 3.1
Auto Parallel: No
File System: ext2
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 16 2366 91.9 2373 91.6 2388 91.1 16 2260 96.2 2260 96.2 2265 96.0
416.gamess 16 1678 187   1680 186   1674 187   16 1551 202   1588 197   1550 202  
433.milc 16 1809 81.2 1823 80.6 1818 80.8 16 1814 81.0 1816 80.9 1829 80.3
434.zeusmp 16 1025 142   1020 143   1028 142   16 1024 142   1025 142   1024 142  
435.gromacs 16 805 142   805 142   804 142   16 652 175   653 175   652 175  
436.cactusADM 16 1272 150   1272 150   1271 150   16 1306 146   1304 147   1293 148  
437.leslie3d 16 2170 69.3 2174 69.2 2173 69.2 16 1994 75.4 1981 75.9 1980 76.0
444.namd 16 914 140   913 141   914 140   16 806 159   805 159   805 159  
447.dealII 16 1077 170   1060 173   1038 176   16 732 250   728 251   734 249  
450.soplex 16 1610 82.9 1609 82.9 1606 83.1 16 1581 84.4 1572 84.9 1574 84.8
453.povray 16 449 190   450 189   449 190   16 376 226   376 226   376 227  
454.calculix 16 715 185   715 185   714 185   16 714 185   715 185   714 185  
459.GemsFDTD 16 2400 70.7 2402 70.7 2413 70.4 16 2225 76.3 2237 75.9 2231 76.1
465.tonto 16 1014 155   1005 157   1003 157   16 878 179   875 180   876 180  
470.lbm 16 2653 82.9 2661 82.6 2652 82.9 16 2698 81.5 2744 80.1 2695 81.6
481.wrf 16 1355 132   1349 132   1348 133   16 1373 130   1380 130   1379 130  
482.sphinx3 16 2930 106   2936 106   2946 106   16 2707 115   2705 115   2708 115  

Operating System Notes

 'numactl' was used to bind copies to the cores
 'ulimit -s unlimited' was used to set environment stack size
 'ulimit -l 4915200'  was used to set environment locked pages in memory limit
 Environment variable PGI_HUGE_PAGES set to 896
 Set vm/nr_hugepages=14336 in /etc/sysctl.conf
 mount -t hugetlbfs nodev /mnt/hugepages
 Processor Performance States Disabled in BIOS
 Memory ChipKill Disabled in BIOS

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -fast   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -fast   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   --zc_eh   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -fast   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -fast   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -w   -Mipa=jobs:4 

C++ benchmarks:

 -w   -Mipa=jobs:4 

Fortran benchmarks:

 -w   -Mipa=jobs:4 

Benchmarks using both Fortran and C:

 -w   -Mipa=jobs:4 

Peak Compiler Invocation

C benchmarks (except as noted below):

 pathcc 
433.milc:  pgcc 

C++ benchmarks (except as noted below):

 pathCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 pathf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
436.cactusADM:  pathcc   pathf95 
481.wrf:  pathcc   pathf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -fastsse   -Msmartalloc=huge:150   -Msafeptr   -Mfprelaxed   -Mipa=inline   -Mipa=arg   -Mipa=const   -Mipa=ptr   -Mipa=shape   -tp barcelona-64   -Bstatic_pgi 
470.lbm:  -march=barcelona   -Ofast   -m3dnow 
482.sphinx3:  -march=barcelona   -Ofast 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mpfo(pass 2)   -fast   -Mfprelaxed   -Msmartalloc=huge:150   --zc_eh   -Mnodepchk   -Munroll=n:4   -Munroll=m:8   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -OPT:malloc_alg=1   -m32   -fno-exceptions 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -m32   -O3   -TENV:frame_pointer=off   -LNO:prefetch=1 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:load_exe=0 

Fortran benchmarks:

410.bwaves:  -Mpfi(pass 1)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mpfo(pass 2)   -fastsse   -Mfprelaxed   -Msmartalloc   -Mprefetch=distance:12   -Mprefetch=nta   -tp barcelona-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256 
434.zeusmp:  -fastsse   -Mfprelaxed   -Msmartalloc=huge:150   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
437.leslie3d:  -march=barcelona   -Ofast   -m3dnow   -OPT:unroll_size=256   -CG:load_exe=0   -OPT:malloc_alg=1 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -OPT:malloc_alg=1 
465.tonto:  -march=barcelona   -Ofast   -OPT:malloc_alg=1   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525 

Benchmarks using both Fortran and C:

435.gromacs:  -fast   -Mfpapprox=rsqrt   -Mipa=fast   -Mipa=inline   -Mfprelaxed   -Msmartalloc=huge:150   -tp barcelona-64   -Bstatic_pgi 
436.cactusADM:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -WOPT:aggstr=0 
454.calculix:  -fastsse   -Mfprelaxed   -Msmartalloc=huge:150   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -march=barcelona   -Ofast   -LNO:blocking=off   -LNO:prefetch_ahead=10   -OPT:malloc_alg=1   -m3dnow   -LANG:copyinout=off   -IPA:callee_limit=5000 

Peak Other Flags

C benchmarks:

433.milc:  -w   -Mipa=jobs:4 

C++ benchmarks:

444.namd:  -w   -Mipa=jobs:4(pass 2) 

Fortran benchmarks:

410.bwaves:  -w   -Mipa=jobs:4(pass 2) 
434.zeusmp:  -w   -Mipa=jobs:4 

Benchmarks using both Fortran and C:

435.gromacs:  -w   -Mipa=jobs:4 
454.calculix:  -w   -Mipa=jobs:4 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/amd123GH-flags.20090714.01.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/amd123GH-flags.20090714.01.xml.