SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Dell Inc.

PowerEdge M905 (AMD Opteron 8380, 2.50 GHz)

SPECfp®2006 = 20.9

CPU2006 license: 55 Test date: Nov-2008
Test sponsor: Dell Inc. Hardware Availability: Nov-2008
Tested by: Dell Inc. Software Availability: Oct-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 8380
CPU Characteristics:
CPU MHz: 2500
FPU: Integrated
CPU(s) enabled: 16 cores, 4 chips, 4 cores/chip
CPU(s) orderable: 4 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 6 MB I+D on chip per chip
Other Cache: None
Memory: 64 GB (16 x 4 GB DDR2-800)
Disk Subsystem: 1 x 36 GB 15000 RPM SAS (OS), 1 x 73 GB 10000
RPM SAS (CPU2006)
Other Hardware: None
Software
Operating System: SUSE Linux Enterprise Server 10 (x86_64) SP2,
Kernel 2.6.16.60-0.21-smp
Compiler: PGI Server Complete Version 7.2
PathScale Compiler Suite Version 3.2
Auto Parallel: Yes
File System: ReiserFS
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: binutils 2.18
32-bit and 64-bit libhugetlbfs libraries

Results Table

Benchmark Base Peak
Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 683 19.9 593 22.9 590 23.0 606 22.4 700 19.4 699 19.4
416.gamess 1263 15.5 1263 15.5 1263 15.5 1131 17.3 1131 17.3 1133 17.3
433.milc 499 18.4 503 18.3 502 18.3 499 18.4 503 18.3 502 18.3
434.zeusmp 605 15.0 606 15.0 605 15.1 548 16.6 636 14.3 547 16.6
435.gromacs 505 14.1 505 14.1 505 14.1 411 17.4 410 17.4 411 17.4
436.cactusADM 615 19.4 616 19.4 618 19.4 116 103   113 106   116 103  
437.leslie3d 550 17.1 642 14.6 642 14.6 522 18.0 519 18.1 520 18.1
444.namd 665 12.1 665 12.1 666 12.0 576 13.9 574 14.0 575 14.0
447.dealII 589 19.4 632 18.1 590 19.4 441 25.9 441 26.0 441 25.9
450.soplex 626 13.3 623 13.4 699 11.9 665 12.5 601 13.9 601 13.9
453.povray 326 16.3 325 16.4 324 16.4 279 19.1 282 18.9 280 19.0
454.calculix 492 16.8 492 16.8 493 16.7 412 20.0 412 20.0 421 19.6
459.GemsFDTD 645 16.5 772 13.7 649 16.3 574 18.5 574 18.5 576 18.4
465.tonto 615 16.0 623 15.8 615 16.0 493 20.0 487 20.2 487 20.2
470.lbm 483 28.4 483 28.5 484 28.4 541 25.4 540 25.4 540 25.4
481.wrf 553 20.2 551 20.3 552 20.2 499 22.4 539 20.7 498 22.4
482.sphinx3 992 19.6 1161 16.8 1143 17.0 891 21.9 944 20.6 940 20.7

Submit Notes

The config file option 'submit' was used.
 'numactl' was used to bind copies to the cores

Operating System Notes

 The libhugetlbfs libraries were installed using the
 installation rpms that came with the distribution.

 'ulimit -s unlimited' was used to set environment stack size
 'ulimit -l 2097152'  was used to set environment locked pages in memory limit

 Set vm/nr_hugepages=14336 in /etc/sysctl.conf
 mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_MORECORE = "yes"
LD_LIBRARY_PATH = "/root/cpu2006-1.1/amd909gh-libs/64:/root/cpu2006-1.1/amd909gh-libs/32"
NCPUS = "8"

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc=huge   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:4 

C++ benchmarks:

 -Mipa=jobs:4 

Fortran benchmarks:

 -Mipa=jobs:4 

Benchmarks using both Fortran and C:

 -Mipa=jobs:4 

Peak Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks (except as noted below):

 pathCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 pathf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 
437.leslie3d:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
435.gromacs:  pathcc   pathf95 
481.wrf:  pathcc   pathf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  basepeak = yes 
470.lbm:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mloop32   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
482.sphinx3:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Msmartalloc   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -fno-exceptions   -m32 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -L/usr/lib -lhugetlbfs(pass 2)   -O3   -INLINE:aggressive=on   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -OPT:malloc_alg=1   -CG:load_exe=0   -fno-exceptions   -m32 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -INLINE:aggressive=on 

Fortran benchmarks:

410.bwaves:  -Mvect=cachesize:6291456   -fastsse   -Msmartalloc   -Mprefetch=nta   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT(pass 2)   -L/usr/lib64 -lhugetlbfs(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256 
434.zeusmp:  -Mvect=cachesize:6291456   -fastsse   -Mfprelaxed   -Mprefetch=distance:8   -Mprefetch=t0   -Msmartalloc=huge   -Msmartalloc=hugebss   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Mvect=fuse   -Msmartalloc=huge   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -LNO:prefetch_ahead=1   -CG:load_exe=0   -CG:prefer_lru_reg=off   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 
465.tonto:  -march=barcelona   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 

Benchmarks using both Fortran and C:

435.gromacs:  -march=barcelona   -Ofast   -OPT:rsqrt=2   -OPT:malloc_alg=1   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 
436.cactusADM:  -Mvect=cachesize:6291456   -fastsse   -Mconcur   -Msmartalloc=huge   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -Mvect=cachesize:6291456   -fastsse   -Msmartalloc=huge   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -march=barcelona   -Ofast   -LNO:blocking=off   -LNO:prefetch_ahead=10   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -OPT:malloc_alg=1   -m3dnow   -Wl,-T/usr/share/libhugetlbfs/ldscripts/elf_x86_64.xBDT   -L/usr/lib64 -lhugetlbfs 

Peak Other Flags

C benchmarks:

 -Mipa=jobs:4(pass 2) 

C++ benchmarks:

444.namd:  -Mipa=jobs:4(pass 2) 

Fortran benchmarks (except as noted below):

 -Mipa=jobs:4(pass 2) 
416.gamess:  No flags used 
459.GemsFDTD:  No flags used 
465.tonto:  No flags used 

Benchmarks using both Fortran and C (except as noted below):

 -Mipa=jobs:4(pass 2) 
435.gromacs:  No flags used 
481.wrf:  No flags used 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.html,
http://www.spec.org/cpu2006/flags/CPU2006_flags.20090710.html,
http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/pgi72_linux_flags.20090713.xml,
http://www.spec.org/cpu2006/flags/CPU2006_flags.20090710.xml,
http://www.spec.org/cpu2006/flags/amd-platform-amd909gh.xml.