SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

IBM Corporation

IBM BladeCenter LS42 (AMD Opteron 8347 HE)

SPECfp®2006 = 15.3

CPU2006 license: 11 Test date: Aug-2008
Test sponsor: IBM Corporation Hardware Availability: Sep-2008
Tested by: IBM Corporation Software Availability: May-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 8347 HE
CPU Characteristics:
CPU MHz: 1900
FPU: Integrated
CPU(s) enabled: 16 cores, 4 chips, 4 cores/chip
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 2 MB I+D on chip per chip
Other Cache: None
Memory: 64 GB (16 x 4 GB DDR2-6400 ECC)
Disk Subsystem: 1 x 73 GB SAS, 10000 RPM
Other Hardware: None
Software
Operating System: SuSE Linux Enterprise Server 10 (x86_64) SP1,
Kernel 2.6.16.46-0.12-smp
Compiler: PGI Server Complete Version 7.2
Auto Parallel: Yes
File System: ReiserFS
System State: Run level 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18.50

Results Table

Benchmark Base Peak
Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 303   44.8  291   46.7  320 42.4  241   56.5  241   56.3  242   56.2 
416.gamess 1719   11.4  1723   11.4  1724 11.4  1747   11.2  1747   11.2  1752   11.2 
433.milc 792   11.6  794   11.6  796 11.5  768   12.0  768   11.9  775   11.8 
434.zeusmp 882   10.3  882   10.3  883 10.3  882   10.3  882   10.3  883   10.3 
435.gromacs 755   9.46 755   9.45 755 9.46 620   11.5  621   11.5  622   11.5 
436.cactusADM 98.1 122    96.7 124    101 118    97.4 123    94.7 126    95.6 125   
437.leslie3d 963   9.76 965   9.74 965 9.74 954   9.86 824   11.4  822   11.4 
444.namd 1032   7.77 1033   7.77 1034 7.76 929   8.63 932   8.61 927   8.65
447.dealII 907   12.6  915   12.5  914 12.5  821   13.9  822   13.9  820   14.0 
450.soplex 1128   7.39 1125   7.41 1130 7.38 1128   7.39 1125   7.41 1130   7.38
453.povray 463   11.5  477   11.2  474 11.2  461   11.5  517   10.3  572   9.30
454.calculix 704   11.7  704   11.7  707 11.7  640   12.9  637   13.0  639   12.9 
459.GemsFDTD 437   24.3  431   24.6  431 24.6  437   24.3  431   24.6  431   24.6 
465.tonto 924   10.6  923   10.7  923 10.7  836   11.8  837   11.8  834   11.8 
470.lbm 997   13.8  997   13.8  997 13.8  991   13.9  993   13.8  994   13.8 
481.wrf 725   15.4  722   15.5  722 15.5  672   16.6  666   16.8  671   16.7 
482.sphinx3 1549   12.6  1570   12.4  1637 11.9  1382   14.1  1378   14.1  1377   14.2 

Submit Notes

The config file option 'submit' was used.

Operating System Notes

 'numactl' was used to bind copies to the cores.
 Environment stack size set to 'unlimited'.
 'ulimit -l 2097152' was used to set environment locked pages in memory quantity.
 NCPUS set to number of cores.
 PGI_HUGE_PAGES set to 896.
 Set vm/nr_hugepages=14336 in /etc/sysctl.conf
 mount -t hugetlbfs none /mnt/hugepages
 Processor Performance States Disabled in BIOS
 Memory ChipKill Disabled in BIOS

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -fastsse   -Msmartalloc=huge:896   -Mconcur   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -fastsse   -Msmartalloc=huge:896   -Mfprelaxed   -Mconcur   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -fastsse   -Mfprelaxed   -Msmartalloc=huge:896   -Mconcur   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -fastsse   -Msmartalloc=huge:896   -Mconcur   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:8 

C++ benchmarks:

 -Mipa=jobs:8 

Fortran benchmarks:

 -Mipa=jobs:8 

Benchmarks using both Fortran and C:

 -Mipa=jobs:8 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -fastsse   -Msmartalloc=huge:896   -Msafeptr   -Mconcur   -Mfprelaxed   -Mipa=inline   -Mipa=arg   -Mipa=const   -Mipa=ptr   -Mipa=shape   -tp barcelona-64   -Bstatic_pgi 
470.lbm:  -fastsse   -Msmartalloc=huge:896   -Mprefetch=t0   -Mloop32   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
482.sphinx3:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mfprelaxed   -Msmartalloc   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge:896   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -fastsse   -alias=ansi   -Msmartalloc=huge:896   -Mprefetch=t0   -Mnovect   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-32   -Bstatic_pgi 
450.soplex:  basepeak = yes 
453.povray:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inlinenopfo:3(pass 2)   -Mipa=staticfunc(pass 2)   -fastsse   -Msmartalloc=huge:896   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

410.bwaves:  -fastsse   -Msmartalloc   -Mprefetch=distance:12   -Mprefetch=nta   -Mconcur   -Mloop32   -Mpre   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
416.gamess:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=noaltcode   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
434.zeusmp:  basepeak = yes 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mconcur=noaltcode(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=fuse   -Msmartalloc=huge:896   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
459.GemsFDTD:  basepeak = yes 
465.tonto:  -fastsse   -O4   -Mvect=noaltcode   -Msmartalloc=huge:896   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

435.gromacs:  -fastsse   -Msmartalloc=huge:896   -Mfprelaxed   -Mconcur   -Mfpapprox=rsqrt   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
436.cactusADM:  -fastsse   -Msmartalloc=huge:896   -Mfprelaxed   -Mconcur   -Mdse   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Msmartalloc=huge:896   -Mloop32   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -fastsse   -Mvect=noaltcode   -Msmartalloc   -Mprefetch=distance:8   -Mconcur=noaltcode   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:8(pass 2) 

C++ benchmarks:

 -Mipa=jobs:8(pass 2) 

Fortran benchmarks:

 -Mipa=jobs:8 

Benchmarks using both Fortran and C (except as noted below):

 -Mipa=jobs:8(pass 2) 
481.wrf:  No flags used 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/pgi72_flags.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/pgi72_flags.xml.