SPEC® CFP2006 Result

Copyright 2006-2016 Standard Performance Evaluation Corporation

IBM Corporation

IBM System x3655 (AMD Opteron 2352)

CPU2006 license: 11 Test date: Jun-2008
Test sponsor: IBM Corporation Hardware Availability: Jul-2008
Tested by: Advanced Micro Devices Software Availability: Jun-2008
Benchmark results graph
Hardware
CPU Name: AMD Opteron 2352
CPU Characteristics:
CPU MHz: 2100
FPU: Integrated
CPU(s) enabled: 8 cores, 2 chips, 4 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 64 KB I + 64 KB D on chip per core
Secondary Cache: 512 KB I+D on chip per core
L3 Cache: 2 MB I+D on chip per chip
Other Cache: None
Memory: 16 GB (8 x 2 GB, DDR2-667 CL5 Reg Dual Rank)
Disk Subsystem: 1 x 73.4 GB SAS, 15000 RPM
Other Hardware: None
Software
Operating System: SuSE Linux Enterprise Server 10 (x86_64) SP1,
Kernel 2.6.16.46-0.12-smp
Compiler: PGI Server Complete Version 7.2
PathScale Compiler Suite Version 3.2
Auto Parallel: No
File System: ext3
System State: Runlevel 3 (Full multiuser with network)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 8 1561 69.7 1544 70.4 1552 70.0 8 1453 74.8 1455 74.7 1459 74.5
416.gamess 8 1521 103   1513 104   1519 103   8 1397 112   1401 112   1395 112  
433.milc 8 1294 56.8 1294 56.8 1293 56.8 8 1270 57.8 1271 57.8 1271 57.8
434.zeusmp 8 893 81.6 891 81.7 896 81.2 8 893 81.6 891 81.7 896 81.2
435.gromacs 8 688 83.0 689 83.0 689 82.9 8 566 101   566 101   565 101  
436.cactusADM 8 1124 85.1 1123 85.1 1124 85.1 8 994 96.1 986 96.9 990 96.5
437.leslie3d 8 1565 48.1 1564 48.1 1567 48.0 8 1454 51.7 1453 51.8 1454 51.7
444.namd 8 827 77.6 826 77.7 826 77.7 8 725 88.5 724 88.7 724 88.7
447.dealII 8 879 104   880 104   875 105   8 624 147   628 146   623 147  
450.soplex 8 1302 51.2 1300 51.3 1295 51.5 8 1278 52.2 1280 52.1 1285 51.9
453.povray 8 401 106   400 106   400 106   8 345 123   344 124   344 124  
454.calculix 8 620 106   621 106   624 106   8 527 125   528 125   529 125  
459.GemsFDTD 8 1930 44.0 1933 43.9 1938 43.8 8 1730 49.1 1734 48.9 1732 49.0
465.tonto 8 907 86.8 911 86.4 908 86.7 8 781 101   775 102   775 102  
470.lbm 8 2420 45.4 2358 46.6 2359 46.6 8 2346 46.8 2347 46.8 2348 46.8
481.wrf 8 1115 80.1 1117 80.0 1118 79.9 8 1049 85.1 1049 85.2 1045 85.6
482.sphinx3 8 1873 83.2 1873 83.3 1872 83.3 8 1755 88.9 1754 88.9 1754 88.9

Operating System Notes

 'numactl' was used to bind copies to the cores
 Environment variable PGI_HUGE_PAGES set to 150
 'ulimit -s unlimited' was used to set environment stack size
 'ulimit -l 2097152'  was used to set environment locked pages in memory limit
 Set vm/nr_hugepages=1200 in /etc/sysctl.conf
 mount -t hugetlbfs nodev /mnt/hugepages

Base Compiler Invocation

C benchmarks:

 pgcc 

C++ benchmarks:

 pgcpp 

Fortran benchmarks:

 pgf95 

Benchmarks using both Fortran and C:

 pgcc   pgf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -Mnomain 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -fastsse   -Msmartalloc=huge:150   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

 -fastsse   -Msmartalloc=huge:150   -Mfprelaxed   --zc_eh   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Fortran benchmarks:

 -fastsse   -Mfprelaxed   -Msmartalloc=huge:150   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Benchmarks using both Fortran and C:

 -fastsse   -Msmartalloc=huge:150   -Mfprelaxed   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 

Base Other Flags

C benchmarks:

 -Mipa=jobs:4 

C++ benchmarks:

 -Mipa=jobs:4 

Fortran benchmarks:

 -Mipa=jobs:4 

Benchmarks using both Fortran and C:

 -Mipa=jobs:4 

Peak Compiler Invocation

C benchmarks (except as noted below):

 pgcc 
470.lbm:  pathcc 

C++ benchmarks (except as noted below):

 pathCC 
444.namd:  pgcpp 

Fortran benchmarks (except as noted below):

 pathf95 
410.bwaves:  pgf95 
434.zeusmp:  pgf95 
437.leslie3d:  pgf95 

Benchmarks using both Fortran and C (except as noted below):

 pgcc   pgf95 
436.cactusADM:  pathcc   pathf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64   -Mnomain 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64   -Mnomain 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LP64   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LINUX 
482.sphinx3:  -DSPEC_CPU_LP64 

Peak Optimization Flags

C benchmarks:

433.milc:  -fastsse   -Msmartalloc=huge:150   -Msafeptr   -Mfprelaxed   -Mipa=inline   -Mipa=arg   -Mipa=const   -Mipa=ptr   -Mipa=shape   -tp barcelona-64   -Bstatic_pgi 
470.lbm:  -march=barcelona   -Ofast   -CG:sse_cse_regs=0   -CG:locs_shallow_depth=1   -m3dnow 
482.sphinx3:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mfprelaxed   -Msmartalloc   -tp barcelona-64   -Bstatic_pgi 

C++ benchmarks:

444.namd:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Munroll=n:4   -Munroll=m:8   -Msmartalloc=huge:150   -Mnodepchk   -Mfprelaxed   --zc_eh   -tp barcelona-64   -Bstatic_pgi 
447.dealII:  -march=barcelona   -Ofast   -static   -INLINE:aggressive=on   -fno-exceptions   -m32 
450.soplex:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -TENV:frame_pointer=off   -LNO:prefetch=1   -OPT:malloc_alg=1   -CG:load_exe=0   -m32 
453.povray:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast 

Fortran benchmarks:

410.bwaves:  -Mpfi(pass 1)   -Mpfo(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Msmartalloc   -Mprefetch=distance:12   -Mprefetch=nta   -Mpre   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
416.gamess:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O2   -OPT:Ofast   -OPT:ro=3   -OPT:unroll_size=256 
434.zeusmp:  basepeak = yes 
437.leslie3d:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Mvect=fuse   -Msmartalloc=huge:150   -Mprefetch=distance:8   -Mprefetch=t0   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
459.GemsFDTD:  -march=barcelona   -Ofast   -LNO:fission=2   -LNO:simd=2   -LNO:prefetch_ahead=1   -CG:load_exe=0 
465.tonto:  -march=barcelona   -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -IPA:plimit=525 

Benchmarks using both Fortran and C:

435.gromacs:  -fastsse   -Msmartalloc=huge:150   -Mfprelaxed   -Mfpapprox=rsqrt   -Mipa=fast   -Mipa=inline   -tp barcelona-64   -Bstatic_pgi 
436.cactusADM:  -march=barcelona   -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:blocking=off 
454.calculix:  -Mpfi=indirect(pass 1)   -Mpfo=indirect(pass 2)   -Mipa=fast(pass 2)   -Mipa=inline(pass 2)   -fastsse   -Msmartalloc=huge:150   -Mprefetch=t0   -Mpre   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 
481.wrf:  -fastsse   -Mvect=noaltcode   -Msmartalloc   -Mprefetch=distance:8   -Mfprelaxed   -tp barcelona-64   -Bstatic_pgi 

Peak Other Flags

C benchmarks (except as noted below):

 -Mipa=jobs:4(pass 2) 
470.lbm:  No flags used 

C++ benchmarks:

444.namd:  -Mipa=jobs:4(pass 2) 

Fortran benchmarks (except as noted below):

 -Mipa=jobs:4(pass 2) 
416.gamess:  No flags used 
459.GemsFDTD:  No flags used 
465.tonto:  No flags used 

Benchmarks using both Fortran and C (except as noted below):

 -Mipa=jobs:4(pass 2) 
436.cactusADM:  No flags used 
481.wrf:  No flags used 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/amd421GH-flags.20090713.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/amd421GH-flags.20090713.xml.