SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

IBM Corporation

IBM BladeCenter LS42 (AMD Opteron 8384)

SPECfp®2006 = 23.8

CPU2006 license: 11 Test date: Oct-2008
Test sponsor: IBM Corporation Hardware Availability: Nov-2008
Tested by: Advanced Micro Devices Software Availability: May-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 8384
CPU Characteristics:
CPU MHz: 2700
FPU: Integrated
CPU(s) enabled: 8 cores, 2 chips, 4 cores/chip
CPU(s) orderable: 1,2,3,4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 32 GB (8 x 4 GB DDR2-6400 ECC)
Disk Subsystem: 1 x 73 GB SAS, 10000 RPM
Other Hardware: None
Software
Operating System: SuSE Linux Enterprise Server 10 (x86_64) SP2,
Kernel 2.6.16.60-0.21-smp
Compiler: PGI Server Complete Version 7.2
Auto Parallel: Yes
File System: ReiserFS
System State: Run level 3 (Full multiuser with network)
Base Pointers: 32/64-bit
Peak Pointers: 64-bit
Other Software: binutils 2.18.50
32-bit and 64-bit libhugetlbfs libraries

Results Table

Benchmark Base Peak
Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 283 48.0 248 54.8 277 49.0 283 48.0 248 54.8 277 49.0
416.gamess 1191 16.4 1189 16.5 1194 16.4 1129 17.3 1129 17.3 1130 17.3
433.milc 465 19.7 464 19.8 464 19.8 454 20.2 454 20.2 454 20.2
434.zeusmp 567 16.0 572 15.9 569 16.0 532 17.1 536 17.0 527 17.3
435.gromacs 464 15.4 463 15.4 464 15.4 385 18.5 384 18.6 385 18.5
436.cactusADM 104 115   110 108   105 114   108 111   107 112   112 107  
437.leslie3d 522 18.0 521 18.1 521 18.1 554 17.0 554 17.0 494 19.0
444.namd 612 13.1 612 13.1 612 13.1 533 15.0 533 15.0 531 15.1
447.dealII 547 20.9 547 20.9 546 21.0 490 23.3 490 23.3 489 23.4
450.soplex 580 14.4 580 14.4 580 14.4 580 14.4 580 14.4 580 14.4
453.povray 315 16.9 315 16.9 312 17.1 287 18.5 287 18.5 288 18.5
454.calculix 471 17.5 471 17.5 471 17.5 382 21.6 383 21.5 381 21.6
459.GemsFDTD 340 31.2 340 31.2 340 31.2 340 31.2 340 31.2 340 31.2
465.tonto 596 16.5 595 16.5 596 16.5 539 18.3 543 18.1 538 18.3
470.lbm 458 30.0 458 30.0 457 30.0 458 30.0 458 30.0 457 30.0
481.wrf 456 24.5 457 24.5 456 24.5 410 27.2 407 27.5 408 27.4
482.sphinx3 978 19.9 979 19.9 975 20.0 778 25.1 778 25.0 779 25.0

Submit Notes

The config file option 'submit' was used.
 'numactl' was used to bind copies to the cores.

General Notes

The libhugetlbfs libraries were installed using the
installation rpms that came with the distribution.

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set vm/nr_hugepages=7168 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

Environment variables set by runspec before the start of the run:
LD_LIBRARY_PATH = "/root/work/cpu2006v1.1/pgi72/linux_lib64:/root/work/cpu2006v1.1/pgi72/linux_lib32"
PGI_HUGE_PAGES = "7168"
NCPUS = "8"
The powersaved was disabled, set the CPU frequency to its maximum.

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mconcur   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mconcur   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc=huge   -Mconcur   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mconcur   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:8 

C++ benchmarks:

 -Mipa=jobs:8 

Fortran benchmarks:

 -Mipa=jobs:8 

Benchmarks using both Fortran and C:

 -Mipa=jobs:8 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Msafeptr   -Mconcur   -Mfprelaxed   -Mipa=inline   -Mipa=arg   -Mipa=const   -Mipa=ptr   -Mipa=shape   -tp barcelona-64   -Bstatic_pgi 
470.lbm:  basepeak = yes 
482.sphinx3:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -Mvect=cachesize:6291456   -fastsse   -alias=ansi   -Msmartalloc=huge   -Mprefetch=t0   -Mnovect   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-32   -Bstatic_pgi 
450.soplex:  basepeak = yes 
453.povray:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inlinenopfo:3(pass 2)   -Mipa=staticfunc(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

410.bwaves:  basepeak = yes 
416.gamess:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mvect=noaltcode   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
434.zeusmp:  -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Mconcur   -Mprefetch=distance:8   -Mprefetch=t0   -Msmartalloc=huge   -Msmartalloc=hugebss   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mconcur=noaltcode(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mvect=fuse   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
459.GemsFDTD:  basepeak = yes 
465.tonto:  -Mvect=cachesize:6291456   -fastsse   -O4   -Mvect=noaltcode   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

435.gromacs:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mconcur   -Mfpapprox=rsqrt   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
436.cactusADM:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mconcur   -Mdse   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mloop32   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -Mvect=cachesize:6291456   -fastsse   -Mvect=noaltcode   -Msmartalloc=huge   -Mprefetch=distance:8   -Mconcur=noaltcode   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:8(pass 2) 

C++ benchmarks:

 -Mipa=jobs:8(pass 2) 

Fortran benchmarks:

 -Mipa=jobs:8 

Benchmarks using both Fortran and C (except as noted below):

 -Mipa=jobs:8(pass 2) 
481.wrf:  No flags used 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.01.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.01.xml.