SPEC® CFP2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Hewlett-Packard Company

ProLiant BL465c Gen8
(2.50 GHz AMD Opteron 6380)

CPU2006 license: 3 Test date: Sep-2012
Test sponsor: Hewlett-Packard Company Hardware Availability: Dec-2012
Tested by: Hewlett-Packard Company Software Availability: Jun-2012
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6380
CPU Characteristics: AMD Turbo CORE technology up to 3.40 GHz
CPU MHz: 2500
FPU: Integrated
CPU(s) enabled: 32 cores, 2 chips, 16 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 512 KB I on chip per chip,
64 KB I shared / 2 cores;
16 KB D on chip per core
Secondary Cache: 16 MB I+D on chip per chip, 2 MB shared / 2 cores
L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 8 cores
Other Cache: None
Memory: 128 GB (16 x 8 GB 2Rx4 PC3-12800R-11, ECC)
Disk Subsystem: 2 x 200 GB SSD SAS, RAID 1
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.3,
Kernel 2.6.32-279.el6.x86_64
Compiler: C/C++/Fortran: Version 4.5.2 of
x86 Open64 Compiler Suite (from AMD)
Auto Parallel: No
File System: ext3
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: 32/64-bit
Other Software: None

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
410.bwaves 32 1353 321 1353 321 1353 321 32 1330 327 1330 327 1330 327
416.gamess 32 1799 348 1799 348 1810 346 32 1656 378 1653 379 1655 379
433.milc 32 1088 270 1089 270 1088 270 32 929 316 930 316 930 316
434.zeusmp 32 661 440 664 439 664 438 32 649 449 644 452 650 448
435.gromacs 32 543 421 542 421 543 421 32 439 521 438 521 440 520
436.cactusADM 32 732 523 734 521 733 521 32 668 573 669 572 670 571
437.leslie3d 32 1355 222 1355 222 1355 222 32 1039 289 1041 289 1057 285
444.namd 32 756 339 755 340 755 340 32 635 404 638 402 634 405
447.dealII 32 492 744 488 750 492 743 32 451 811 447 820 453 809
450.soplex 32 994 268 994 268 994 269 32 910 293 909 294 909 294
453.povray 32 367 464 368 463 367 463 32 326 523 319 533 319 534
454.calculix 32 392 673 393 672 393 672 32 379 697 377 700 376 701
459.GemsFDTD 32 1654 205 1655 205 1656 205 32 1448 235 1447 235 1447 235
465.tonto 32 783 402 776 406 777 405 32 708 445 721 436 716 440
470.lbm 32 1052 418 1002 439 1003 438 32 1002 439 1007 437 1003 439
481.wrf 32 904 396 901 397 907 394 32 904 395 902 396 902 396
482.sphinx3 32 1816 343 1813 344 1815 344 32 1383 451 1381 452 1423 438

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst

Set vm/nr_hugepages=28672 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

Platform Notes

 BIOS configuration:
  HP Power Profile set to Maximum Performace
  Thermal Configuration set to Maximum Cooling
  Collaborative Power Control set to Disable
  Minimum Processor Idle Power C1e State to Enabled
  Processor Power and Utilization Monitoring set to Disable

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "896"
LD_LIBRARY_PATH = "/cpu2006/amd1206-rate-libs-revA/32:/cpu2006/amd1206-rate-libs-revA/64"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on a system with 2x AMD Opteron 6386SE chips + 128GB Memory using RHEL 6.3

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Base Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
447.dealII:  -DSPEC_CPU_LP64 
450.soplex:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 
482.sphinx3:  -DSPEC_CPU_LP64 

Base Optimization Flags

C benchmarks:

 -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso   -march=bdver1 

C++ benchmarks:

 -Ofast   -static   -CG:load_exe=0   -OPT:malloc_alg=1   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -D__OPEN64_FAST_SET   -march=bdver1 

Fortran benchmarks:

 -Ofast   -LNO:blocking=off   -LNO:simd_peel_align=on   -OPT:rsqrt=2   -OPT:unroll_size=256   -HP:bd=2m:heap=2m   -mso   -march=bdver1 

Benchmarks using both Fortran and C:

 -Ofast   -OPT:malloc_alg=1   -HP:bd=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -mso   -march=bdver1   -LNO:blocking=off   -LNO:simd_peel_align=on   -OPT:rsqrt=2   -OPT:unroll_size=256 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Fortran benchmarks:

 openf95 

Benchmarks using both Fortran and C:

 opencc   openf95 

Peak Portability Flags

410.bwaves:  -DSPEC_CPU_LP64 
416.gamess:  -DSPEC_CPU_LP64 
433.milc:  -DSPEC_CPU_LP64 
434.zeusmp:  -DSPEC_CPU_LP64 
435.gromacs:  -DSPEC_CPU_LP64 
436.cactusADM:  -DSPEC_CPU_LP64   -fno-second-underscore 
437.leslie3d:  -DSPEC_CPU_LP64 
444.namd:  -DSPEC_CPU_LP64 
453.povray:  -DSPEC_CPU_LP64 
454.calculix:  -DSPEC_CPU_LP64 
459.GemsFDTD:  -DSPEC_CPU_LP64 
465.tonto:  -DSPEC_CPU_LP64 
470.lbm:  -DSPEC_CPU_LP64 
481.wrf:  -DSPEC_CPU_LINUX   -DSPEC_CPU_CASE_FLAG   -DSPEC_CPU_LP64   -fno-second-underscore 

Peak Optimization Flags

C benchmarks:

433.milc:  -Ofast   -CG:movnti=1   -CG:locs_best=on   -HP:bdt=2m:heap=2m   -IPA:plimit=7000   -IPA:callee_limit=1200   -OPT:struct_array_copy=2   -OPT:alias=field_sensitive   -mso   -march=bdver1 
470.lbm:  -Ofast   -CG:cmp_peep=on   -OPT:keep_ext=on   -HP:bdt=2m:heap=2m   -IPA:plimit=8000   -IPA:small_pu=100   -march=bdver1   -mso 
482.sphinx3:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -m32   -IPA:plimit=1000   -OPT:malloc_alg=2   -CG:cmp_peep=on   -CG:p2align=0   -CG:load_exe=1   -CG:dsched=on   -INLINE:aggressive=on   -LNO:prefetch=2   -LNO:prefetch_ahead=4   -mso   -march=bdver2 

C++ benchmarks:

444.namd:  -Ofast   -IPA:plimit=3000   -LNO:ignore_feedback=off   -CG:local_sched_alg=0   -CG:load_exe=0   -OPT:unroll_size=256   -fno-exceptions   -HP:bdt=2m:heap=2m   -LNO:if_select_conv=1   -OPT:alias=disjoint   -LNO:psimd_iso_unroll=ON   -march=bdver1 
447.dealII:  -Ofast   -D__OPEN64_FAST_SET   -static   -INLINE:aggressive=on   -LNO:opt=1   -LNO:simd=2   -fno-emit-exceptions   -m32   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -CG:cmp_peep=on   -CG:movext_icmp=off   -TENV:frame_pointer=off   -march=bdver1 
450.soplex:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:ignore_feedback=off   -INLINE:aggressive=on   -OPT:RO=1   -OPT:IEEE_arith=3   -OPT:IEEE_NaN_Inf=off   -OPT:fold_unsigned_relops=on   -fno-exceptions   -CG:p2align=0   -m32   -mno-fma4   -HP:bdt=2m:heap=2m   -WOPT:sib=on   -march=bdver1 
453.povray:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:pre_local_sched=off   -CG:p2align=0   -CG:p2align_split=on   -CG:dsched=on   -INLINE:aggressive=on   -HP:bd=2m:heap=2m   -OPT:transform=2   -OPT:alias=disjoint   -WOPT:aggcm=0   -march=bdver2 

Fortran benchmarks:

410.bwaves:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:Ofast   -OPT:treeheight=on   -LNO:blocking=off   -LNO:ignore_feedback=off   -LNO:fu=4   -LNO:loop_model_simd=on   -LNO:simd_rm_unity_remainder=on   -WOPT:aggstr=0   -HP:bdt=2m:heap=2m   -CG:cmp_peep=on   -march=bdver1 
416.gamess:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:fu=6   -LNO:blocking=0   -LNO:simd=2   -OPT:ro=3   -OPT:recip=on   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -WOPT:sib=on   -march=bdver1 
434.zeusmp:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:blocking=off   -LNO:interchange=off   -IPA:plimit=1500   -HP:bdt=2m:heap=2m   -march=bdver1 
437.leslie3d:  -Ofast   -CG:pre_minreg_level=2   -LNO:simd=0   -LNO:fusion=2   -HP:bdt=2m:heap=2m   -mso   -march=bdver1 
459.GemsFDTD:  -Ofast   -IPA:plimit=1500   -OPT:unroll_size=1024   -OPT:unroll_times_max=16   -LNO:fission=2   -CG:local_sched_alg=2   -HP   -march=bdver1 
465.tonto:  -Ofast   -OPT:alias=no_f90_pointer_alias   -LNO:blocking=off   -CG:load_exe=1   -CG:local_sched_alg=3   -IPA:plimit=525   -HP:bdt=2m:heap=2m   -march=bdver1 

Benchmarks using both Fortran and C:

435.gromacs:  -Ofast   -OPT:rsqrt=2   -HP:bdt=2m:heap=2m   -CG:local_sched_alg=2   -CG:load_exe=3   -GRA:unspill=on   -march=bdver1   -LNO:simd=3 
436.cactusADM:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:blocking=off   -LNO:prefetch=2   -LNO:pf2=0   -LNO:prefetch_ahead=4   -HP   -CG:locs_shallow_depth=1   -CG:load_exe=0   -CG:dsched=on   -WOPT:sib=on   -march=bdver1 
454.calculix:  -Ofast   -OPT:unroll_size=256   -OPT:alias=disjoint   -GRA:optimize_boundary=on   -CG:dsched=on   -HP:bdt=2m:heap=2m   -march=bdver1 
481.wrf:  -Ofast   -LNO:blocking=off   -LANG:copyinout=off   -IPA:callee_limit=5000   -GRA:prioritize_by_density=on   -HP   -WOPT:sib=on   -march=bdver1 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.html,
http://www.spec.org/cpu2006/flags/HP-Platform-Flags-AMD-V1.2-A.20121205.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.xml,
http://www.spec.org/cpu2006/flags/HP-Platform-Flags-AMD-V1.2-A.20121205.xml.