SPEC® CINT2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Hewlett-Packard Company

ProLiant DL385p Gen8
(3.20 GHz AMD Opteron 6328)

CPU2006 license: 3 Test date: Apr-2014
Test sponsor: Hewlett-Packard Company Hardware Availability: Nov-2012
Tested by: Hewlett-Packard Company Software Availability: Nov-2013
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6328
CPU Characteristics: AMD Turbo CORE technology up to 3.80 GHz
CPU MHz: 3200
FPU: Integrated
CPU(s) enabled: 16 cores, 2 chips, 8 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 256 KB I on chip per chip,
64 KB I shared / 2 cores;
16 KB D on chip per core
Secondary Cache: 8 MB I+D on chip per chip, 2 MB shared / 2 cores
L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 4 cores
Other Cache: None
Memory: 128 GB (16 x 8 GB 2Rx4 PC3-12800R-11, ECC)
Disk Subsystem: 1 x 400 GB SSD SAS, RAID 0
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.5,
Kernel 2.6.32-431.el6.x86_64
Compiler: C/C++: Version 4.5.2 of x86 Open64
Compiler Suite (from AMD)
Auto Parallel: No
File System: ext3
System State: Run level 3 (multi-user)
Base Pointers: 32/64-bit
Peak Pointers: 32/64-bit
Other Software: SmartHeap 10.0 32-bit Library for Linux

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
400.perlbench 16 683 229 683 229 686 228 16 576 272 574 272 574 272
401.bzip2 16 764 202 772 200 782 197 16 729 212 729 212 738 209
403.gcc 16 516 249 517 249 514 251 16 464 278 460 280 462 279
429.mcf 16 479 305 479 304 479 304 16 275 531 274 532 273 534
445.gobmk 16 723 232 723 232 722 232 16 659 255 659 255 659 255
456.hmmer 16 298 500 306 489 299 499 16 266 561 268 557 266 561
458.sjeng 16 866 224 863 224 866 224 16 835 232 834 232 834 232
462.libquantum 16 127 2600 128 2590 129 2580 16 127 2600 128 2590 129 2580
464.h264ref 16 1275 278 1240 285 1248 284 16 1014 349 1014 349 1017 348
471.omnetpp 16 379 264 378 264 380 263 16 379 264 378 264 380 263
473.astar 16 504 223 502 224 501 224 16 415 271 415 271 416 270
483.xalancbmk 16 314 351 316 350 315 350 16 310 357 310 356 310 356

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst

Set vm/nr_hugepages=14336 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

Platform Notes

 BIOS Configuration:
  HP Power Profile set to Maximum Performance
  AMD-VI (IOMMU) set to Enabled
  Minimum Processor Idle Power State set to C1E State (AMD C1 Clock Ramping)
  Thermal Configuration set to Increased Cooling
  Processor Power and Utilization Monitoring set to Disabled
  Memory Refrest Rate set to 1x Refresh

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "896"
LD_LIBRARY_PATH = "/cpu2006/amd1206-rate-libs-revA/32:/cpu2006/amd1206-rate-libs-revA/64"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on a system with 2x AMD Opteron 6386SE chips + 128GB Memory using RHEL 6.3

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Base Portability Flags

400.perlbench:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX_X64 
401.bzip2:  -DSPEC_CPU_LP64 
403.gcc:  -DSPEC_CPU_LP64 
429.mcf:  -DSPEC_CPU_LP64 
445.gobmk:  -DSPEC_CPU_LP64 
456.hmmer:  -DSPEC_CPU_LP64 
458.sjeng:  -DSPEC_CPU_LP64 
462.libquantum:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX 
464.h264ref:  -DSPEC_CPU_LP64 
483.xalancbmk:  -DSPEC_CPU_LINUX 

Base Optimization Flags

C benchmarks:

 -Ofast   -CG:local_sched_alg=1   -INLINE:aggressive=ON   -IPA:plimit=8000   -IPA:small_pu=100   -HP:bd=2m:heap=2m   -mso   -LNO:prefetch=2   -march=bdver1 

C++ benchmarks:

 -Ofast   -m32   -INLINE:aggressive=on   -CG:cmp_peep=on   -D__OPEN64_FAST_SET   -march=bdver1   -L/root/work/libraries/SmartHeap-10/lib -lsmartheap 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Peak Portability Flags

400.perlbench:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX_X64 
401.bzip2:  -DSPEC_CPU_LP64 
445.gobmk:  -DSPEC_CPU_LP64 
456.hmmer:  -DSPEC_CPU_LP64 
458.sjeng:  -DSPEC_CPU_LP64 
462.libquantum:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX 
464.h264ref:  -DSPEC_CPU_LP64 
473.astar:  -DSPEC_CPU_LP64 
483.xalancbmk:  -DSPEC_CPU_LINUX 

Peak Optimization Flags

C benchmarks:

400.perlbench:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:prefetch=2   -LNO:opt=0   -IPA:plimit=20000   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -WOPT:if_conv=0   -WOPT:sib=on   -CG:local_sched_alg=1   -CG:unroll_fb_req=on   -CG:movext_icmp=off   -HP:bd=2m:heap=2m   -march=bdver1   -GRA:aggr_loop_splitting=off   -GRA:loop_splitting=off 
401.bzip2:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:prefetch=2   -LNO:pf2=0   -OPT:alias=disjoint   -OPT:goto=off   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -march=bdver2 
403.gcc:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:trip_count=256   -CG:cmp_peep=on   -CG:pre_minreg_level=2   -m32   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -IPA:small_pu=200   -WOPT:sib=on   -march=bdver2   -mno-fma4 
429.mcf:  -O3   -OPT:unroll_times_max=5   -ipa   -INLINE:aggressive=on   -CG:gcm=off   -CG:dsched=on   -GRA:prioritize_by_density=on   -m32   -HP:bdt=2m:heap=2m   -mso   -march=bdver1 
445.gobmk:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:unroll_size=256   -OPT:unroll_times_max=8   -OPT:keep_ext=on   -IPA:plimit=750   -IPA:min_hotness=300   -IPA:pu_reorder=1   -LNO:ignore_feedback=off   -WOPT:if_conv=2   -HP:bd=2m:heap=2m   -march=bdver1 
456.hmmer:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:prefetch=2   -OPT:alias=disjoint   -OPT:unroll_times_max=16   -OPT:unroll_size=512   -OPT:unroll_level=2   -OPT:keep_ext=on   -CG:cflow=0   -CG:cmp_peep=on   -CG:pre_local_sched=off   -HP:bdt=2m:heap=2m   -CG:p2align=0   -CG:load_exe=3   -CG:dsched=on   -march=bdver1 
458.sjeng:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:ptr_load_use=0   -CG:divrem_opt=on   -CG:movext_icmp=off   -CG:locs_best=on   -LNO:full_unroll=10   -IPA:pu_reorder=2   -HP:heap=2m:bd=2m   -WOPT:sib=on   -march=bdver1 
462.libquantum:  basepeak = yes 
464.h264ref:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -OPT:unroll_size=256   -OPT:unroll_times_max=2   -IPA:plimit=20000   -OPT:alias=disjoint   -CG:ptr_load_use=0   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -march=bdver1 

C++ benchmarks:

471.omnetpp:  basepeak = yes 
473.astar:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -WOPT:if_conv=0   -WOPT:sib=on   -CG:divrem_opt=on   -CG:p2align=1   -CG:dsched=on   -GRA:optimize_boundary=on   -OPT:alias=disjoint   -INLINE:aggressive=on   -IPA:small_pu=3000   -IPA:plimit=3000   -HP:bdt=2m:heap=2m   -march=bdver1 
483.xalancbmk:  -Ofast   -LNO:prefetch=2   -OPT:unroll_size=512   -OPT:unroll_times_max=8   -D__OPEN64_FAST_SET   -INLINE:aggressive=on   -m32   -CG:cmp_peep=on   -CG:local_sched=off   -CG:p2align=1   -GRA:unspill=on   -TENV:frame_pointer=off   -fno-emit-exceptions   -march=bdver2   -mno-fma4   -L/root/work/libraries/SmartHeap-10/lib -lsmartheap 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-III.html,
http://www.spec.org/cpu2006/flags/HP-Platform-Flags-AMD-V1.2-revC.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-III.xml,
http://www.spec.org/cpu2006/flags/HP-Platform-Flags-AMD-V1.2-revC.xml.