SPEC® CINT2006 Result

Copyright 2006-2014 Standard Performance Evaluation Corporation

Supermicro (Test Sponsor: Advanced Micro Devices)

Supermicro A+ Server 1022G-NTF,
AMD Opteron 6378

CPU2006 license: 49 Test date: Sep-2012
Test sponsor: Advanced Micro Devices Hardware Availability: Nov-2012
Tested by: Advanced Micro Devices Software Availability: Aug-2012
Benchmark results graph
Hardware
CPU Name: AMD Opteron 6378
CPU Characteristics: AMD Turbo CORE technology up to 3.30 GHz
CPU MHz: 2400
FPU: Integrated
CPU(s) enabled: 32 cores, 2 chips, 16 cores/chip
CPU(s) orderable: 1,2 chips
Primary Cache: 512 KB I on chip per chip,
64 KB I shared / 2 cores;
16 KB D on chip per core
Secondary Cache: 16 MB I+D on chip per chip, 2 MB shared / 2 cores
L3 Cache: 16 MB I+D on chip per chip, 8 MB shared / 8 cores
Other Cache: None
Memory: 128 GB (16 x 8 GB 2Rx4 PC3-12800R-11, ECC)
Disk Subsystem: 1 x 250 GB SATA, 7200 RPM
Other Hardware: None
Software
Operating System: Red Hat Enterprise Linux Server release 6.3,
Kernel 2.6.32-279.el6.x86_64
Compiler: C/C++: Version 4.5.2 of x86 Open64 Compiler Suite
(from AMD)
Auto Parallel: No
File System: ext3
System State: Run level 3 (Full multiuser with network)
Base Pointers: 32/64-bit
Peak Pointers: 32/64-bit
Other Software: SmartHeap 10.0 32-bit Library for Linux

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
400.perlbench 32 896 349 938 333 902 347 32 758 413 785 398 756 413
401.bzip2 32 1042 296 1067 289 1053 293 32 996 310 979 316 983 314
403.gcc 32 732 352 715 360 718 359 32 642 401 642 401 644 400
429.mcf 32 674 433 672 434 673 433 32 369 791 370 789 372 785
445.gobmk 32 957 351 958 351 958 350 32 873 384 874 384 874 384
456.hmmer 32 410 729 410 728 411 727 32 363 823 365 819 364 821
458.sjeng 32 1151 336 1149 337 1151 337 32 1105 350 1106 350 1106 350
462.libquantum 32 171 3880 172 3850 171 3880 32 171 3880 172 3850 171 3880
464.h264ref 32 1701 416 1654 428 1654 428 32 1357 522 1356 522 1357 522
471.omnetpp 32 532 376 533 376 532 376 32 532 376 533 376 532 376
473.astar 32 709 317 706 318 708 317 32 568 395 568 396 571 394
483.xalancbmk 32 438 504 439 503 439 503 32 433 510 434 509 435 508

Submit Notes

The config file option 'submit' was used.
'numactl' was used to bind copies to the cores.
See the configuration file for details.

Operating System Notes

'ulimit -s unlimited' was used to set environment stack size
'ulimit -l 2097152'  was used to set environment locked pages in memory limit

Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst

Set vm/nr_hugepages=28672 in /etc/sysctl.conf
mount -t hugetlbfs nodev /mnt/hugepages

General Notes

Environment variables set by runspec before the start of the run:
HUGETLB_LIMIT = "896"
LD_LIBRARY_PATH = "/root/work/cpu2006v1.2/amd1206-rate-libs-revA/32:/root/work/cpu2006v1.2/amd1206-rate-libs-revA/64"

The x86 Open64 Compiler Suite is only available from (and supported by) AMD at
http://developer.amd.com/cpu/open64

Binaries were compiled on a system with 2x AMD Opteron 6386SE chips + 128GB Memory using RHEL 6.3

Base Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Base Portability Flags

400.perlbench:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX_X64 
401.bzip2:  -DSPEC_CPU_LP64 
403.gcc:  -DSPEC_CPU_LP64 
429.mcf:  -DSPEC_CPU_LP64 
445.gobmk:  -DSPEC_CPU_LP64 
456.hmmer:  -DSPEC_CPU_LP64 
458.sjeng:  -DSPEC_CPU_LP64 
462.libquantum:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX 
464.h264ref:  -DSPEC_CPU_LP64 
483.xalancbmk:  -DSPEC_CPU_LINUX 

Base Optimization Flags

C benchmarks:

 -Ofast   -CG:local_sched_alg=1   -INLINE:aggressive=ON   -IPA:plimit=8000   -IPA:small_pu=100   -HP:bd=2m:heap=2m   -mso   -LNO:prefetch=2   -march=bdver1 

C++ benchmarks:

 -Ofast   -m32   -INLINE:aggressive=on   -CG:cmp_peep=on   -D__OPEN64_FAST_SET   -march=bdver1   -L/root/work/libraries/SmartHeap-10/lib -lsmartheap 

Peak Compiler Invocation

C benchmarks:

 opencc 

C++ benchmarks:

 openCC 

Peak Portability Flags

400.perlbench:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX_X64 
401.bzip2:  -DSPEC_CPU_LP64 
445.gobmk:  -DSPEC_CPU_LP64 
456.hmmer:  -DSPEC_CPU_LP64 
458.sjeng:  -DSPEC_CPU_LP64 
462.libquantum:  -DSPEC_CPU_LP64   -DSPEC_CPU_LINUX 
464.h264ref:  -DSPEC_CPU_LP64 
473.astar:  -DSPEC_CPU_LP64 
483.xalancbmk:  -DSPEC_CPU_LINUX 

Peak Optimization Flags

C benchmarks:

400.perlbench:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:prefetch=2   -LNO:opt=0   -IPA:plimit=20000   -OPT:unroll_times_max=8   -OPT:unroll_size=256   -OPT:unroll_level=2   -OPT:keep_ext=on   -WOPT:if_conv=0   -WOPT:sib=on   -CG:local_sched_alg=1   -CG:unroll_fb_req=on   -CG:movext_icmp=off   -HP:bd=2m:heap=2m   -march=bdver1   -GRA:aggr_loop_splitting=off   -GRA:loop_splitting=off 
401.bzip2:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -LNO:prefetch=2   -LNO:pf2=0   -OPT:alias=disjoint   -OPT:goto=off   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -march=bdver2 
403.gcc:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:trip_count=256   -CG:cmp_peep=on   -CG:pre_minreg_level=2   -m32   -HP:bdt=2m:heap=2m   -GRA:unspill=on   -IPA:small_pu=200   -WOPT:sib=on   -march=bdver2   -mno-fma4 
429.mcf:  -O3   -OPT:unroll_times_max=5   -ipa   -INLINE:aggressive=on   -CG:gcm=off   -CG:dsched=on   -GRA:prioritize_by_density=on   -m32   -HP:bdt=2m:heap=2m   -mso   -march=bdver1 
445.gobmk:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -OPT:unroll_size=256   -OPT:unroll_times_max=8   -OPT:keep_ext=on   -IPA:plimit=750   -IPA:min_hotness=300   -IPA:pu_reorder=1   -LNO:ignore_feedback=off   -WOPT:if_conv=2   -HP:bd=2m:heap=2m   -march=bdver1 
456.hmmer:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -LNO:prefetch=2   -OPT:alias=disjoint   -OPT:unroll_times_max=16   -OPT:unroll_size=512   -OPT:unroll_level=2   -OPT:keep_ext=on   -CG:cflow=0   -CG:cmp_peep=on   -CG:pre_local_sched=off   -HP:bdt=2m:heap=2m   -CG:p2align=0   -CG:load_exe=3   -CG:dsched=on   -march=bdver1 
458.sjeng:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -CG:ptr_load_use=0   -CG:divrem_opt=on   -CG:movext_icmp=off   -CG:locs_best=on   -LNO:full_unroll=10   -IPA:pu_reorder=2   -HP:heap=2m:bd=2m   -WOPT:sib=on   -march=bdver1 
462.libquantum:  basepeak = yes 
464.h264ref:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -O3   -OPT:unroll_size=256   -OPT:unroll_times_max=2   -IPA:plimit=20000   -OPT:alias=disjoint   -CG:ptr_load_use=0   -CG:local_sched_alg=1   -HP:bdt=2m:heap=2m   -march=bdver1 

C++ benchmarks:

471.omnetpp:  basepeak = yes 
473.astar:  -fb_create fbdata(pass 1)   -fb_opt fbdata(pass 2)   -Ofast   -WOPT:if_conv=0   -WOPT:sib=on   -CG:divrem_opt=on   -CG:p2align=1   -CG:dsched=on   -GRA:optimize_boundary=on   -OPT:alias=disjoint   -INLINE:aggressive=on   -IPA:small_pu=3000   -IPA:plimit=3000   -HP:bdt=2m:heap=2m   -march=bdver1 
483.xalancbmk:  -Ofast   -LNO:prefetch=2   -OPT:unroll_size=512   -OPT:unroll_times_max=8   -D__OPEN64_FAST_SET   -INLINE:aggressive=on   -m32   -CG:cmp_peep=on   -CG:local_sched=off   -CG:p2align=1   -GRA:unspill=on   -TENV:frame_pointer=off   -fno-emit-exceptions   -march=bdver2   -mno-fma4   -L/root/work/libraries/SmartHeap-10/lib -lsmartheap 

The flags file that was used to format this result can be browsed at
http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.html.

You can also download the XML flags source by saving the following link:
http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.xml.