SPEC(R) CFP2006 Summary Tyan Tyan YR190-B8228, AMD Opteron 4386 Test Sponsor: Advanced Micro Devices Tue Sep 18 07:56:48 2012 CPU2006 License: 49 Test date: Sep-2012 Test sponsor: Advanced Micro Devices Hardware availability: Dec-2012 Tested by: Advanced Micro Devices Software availability: Aug-2012 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 16 1362 160 S 16 1329 164 S 410.bwaves 16 1360 160 * 16 1334 163 S 410.bwaves 16 1352 161 S 16 1330 163 * 416.gamess 16 1454 215 S 16 1325 236 S 416.gamess 16 1453 216 * 16 1325 236 * 416.gamess 16 1432 219 S 16 1324 237 S 433.milc 16 1094 134 * 16 944 156 S 433.milc 16 1095 134 S 16 943 156 S 433.milc 16 1094 134 S 16 943 156 * 434.zeusmp 16 626 233 S 16 599 243 * 434.zeusmp 16 618 235 * 16 603 242 S 434.zeusmp 16 617 236 S 16 598 243 S 435.gromacs 16 437 262 * 16 353 324 S 435.gromacs 16 436 262 S 16 354 323 * 435.gromacs 16 437 261 S 16 355 322 S 436.cactusADM 16 727 263 S 16 661 289 S 436.cactusADM 16 723 264 S 16 661 289 * 436.cactusADM 16 724 264 * 16 661 289 S 437.leslie3d 16 1369 110 * 16 1046 144 S 437.leslie3d 16 1370 110 S 16 1046 144 * 437.leslie3d 16 1369 110 S 16 1046 144 S 444.namd 16 600 214 * 16 510 252 * 444.namd 16 599 214 S 16 510 252 S 444.namd 16 608 211 S 16 510 252 S 447.dealII 16 419 437 * 16 376 487 S 447.dealII 16 415 441 S 16 373 491 S 447.dealII 16 426 430 S 16 373 491 * 450.soplex 16 1003 133 S 16 918 145 * 450.soplex 16 1004 133 S 16 919 145 S 450.soplex 16 1004 133 * 16 918 145 S 453.povray 16 295 288 S 16 258 330 S 453.povray 16 295 288 * 16 258 329 * 453.povray 16 295 289 S 16 259 329 S 454.calculix 16 322 410 S 16 306 431 S 454.calculix 16 322 410 * 16 308 429 * 454.calculix 16 321 411 S 16 310 425 S 459.GemsFDTD 16 1675 101 * 16 1463 116 * 459.GemsFDTD 16 1676 101 S 16 1464 116 S 459.GemsFDTD 16 1675 101 S 16 1463 116 S 465.tonto 16 663 238 S 16 596 264 S 465.tonto 16 656 240 * 16 599 263 * 465.tonto 16 654 241 S 16 599 263 S 470.lbm 16 1013 217 S 16 1013 217 S 470.lbm 16 1012 217 S 16 1012 217 S 470.lbm 16 1012 217 * 16 1012 217 * 481.wrf 16 906 197 S 16 903 198 * 481.wrf 16 904 198 * 16 902 198 S 481.wrf 16 903 198 S 16 905 198 S 482.sphinx3 16 1785 175 S 16 1271 245 S 482.sphinx3 16 1787 174 * 16 1284 243 S 482.sphinx3 16 1790 174 S 16 1282 243 * ============================================================================== 410.bwaves 16 1360 160 * 16 1330 163 * 416.gamess 16 1453 216 * 16 1325 236 * 433.milc 16 1094 134 * 16 943 156 * 434.zeusmp 16 618 235 * 16 599 243 * 435.gromacs 16 437 262 * 16 354 323 * 436.cactusADM 16 724 264 * 16 661 289 * 437.leslie3d 16 1369 110 * 16 1046 144 * 444.namd 16 600 214 * 16 510 252 * 447.dealII 16 419 437 * 16 373 491 * 450.soplex 16 1004 133 * 16 918 145 * 453.povray 16 295 288 * 16 258 329 * 454.calculix 16 322 410 * 16 308 429 * 459.GemsFDTD 16 1675 101 * 16 1463 116 * 465.tonto 16 656 240 * 16 599 263 * 470.lbm 16 1012 217 * 16 1012 217 * 481.wrf 16 904 198 * 16 903 198 * 482.sphinx3 16 1787 174 * 16 1282 243 * SPECfp(R)_rate_base2006 206 SPECfp_rate2006 232 HARDWARE -------- CPU Name: AMD Opteron 4386 CPU Characteristics: AMD Turbo CORE technology up to 3.80 GHz CPU MHz: 3100 FPU: Integrated CPU(s) enabled: 16 cores, 2 chips, 8 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 256 KB I on chip per chip, 64 KB I shared / 2 cores; 16 KB D on chip per core Secondary Cache: 8 MB I+D on chip per chip, 2 MB shared / 2 cores L3 Cache: 8 MB I+D on chip per chip Other Cache: None Memory: 64 GB (4 x 16 GB 2Rx4 PC3-12800R-11, ECC) Disk Subsystem: 1 x 128 GB SSD Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 6.3, Kernel 2.6.32-279.el6.x86_64 Compiler: C/C++/Fortran: Version 4.5.2 of x86 Open64 Compiler Suite (from AMD) Auto Parallel: No File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set transparent_hugepage=never as a boot parameter in /boot/grub/menu.lst Set vm/nr_hugepages=14336 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages General Notes ------------- Environment variables set by runspec before the start of the run: HUGETLB_LIMIT = "896" LD_LIBRARY_PATH = "/root/work/cpu2006v1.2/amd1206-rate-libs-revA/32:/root/work/cpu2006v1.2/amd1206-rate-libs-revA/64" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Binaries were compiled on a system with 2x AMD Opteron 6386SE chips + 128GB Memory using RHEL 6.3 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 C++ benchmarks: -Ofast -static -CG:load_exe=0 -OPT:malloc_alg=1 -INLINE:aggressive=on -HP:bd=2m:heap=2m -D__OPEN64_FAST_SET -march=bdver1 Fortran benchmarks: -Ofast -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 -HP:bd=2m:heap=2m -mso -march=bdver1 Benchmarks using both Fortran and C: -Ofast -OPT:malloc_alg=1 -HP:bd=2m:heap=2m -IPA:plimit=8000 -IPA:small_pu=100 -mso -march=bdver1 -LNO:blocking=off -LNO:simd_peel_align=on -OPT:rsqrt=2 -OPT:unroll_size=256 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LP64 -fno-second-underscore Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -Ofast -CG:movnti=1 -CG:locs_best=on -HP:bdt=2m:heap=2m -IPA:plimit=7000 -IPA:callee_limit=1200 -OPT:struct_array_copy=2 -OPT:alias=field_sensitive -mso -march=bdver1 470.lbm: basepeak = yes 482.sphinx3: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -m32 -IPA:plimit=1000 -OPT:malloc_alg=2 -CG:cmp_peep=on -CG:p2align=0 -CG:load_exe=1 -CG:dsched=on -INLINE:aggressive=on -LNO:prefetch=2 -LNO:prefetch_ahead=4 -mso -march=bdver2 C++ benchmarks: 444.namd: -Ofast -IPA:plimit=3000 -LNO:ignore_feedback=off -CG:local_sched_alg=0 -CG:load_exe=0 -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m -LNO:if_select_conv=1 -OPT:alias=disjoint -LNO:psimd_iso_unroll=ON -march=bdver1 447.dealII: -Ofast -D__OPEN64_FAST_SET -static -INLINE:aggressive=on -LNO:opt=1 -LNO:simd=2 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -CG:movext_icmp=off -TENV:frame_pointer=off -march=bdver1 450.soplex: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:ignore_feedback=off -INLINE:aggressive=on -OPT:RO=1 -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -fno-exceptions -CG:p2align=0 -m32 -mno-fma4 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 453.povray: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -CG:pre_local_sched=off -CG:p2align=0 -CG:p2align_split=on -CG:dsched=on -INLINE:aggressive=on -HP:bd=2m:heap=2m -OPT:transform=2 -OPT:alias=disjoint -WOPT:aggcm=0 -march=bdver2 Fortran benchmarks: 410.bwaves: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:Ofast -OPT:treeheight=on -LNO:blocking=off -LNO:ignore_feedback=off -LNO:fu=4 -LNO:loop_model_simd=on -LNO:simd_rm_unity_remainder=on -WOPT:aggstr=0 -HP:bdt=2m:heap=2m -CG:cmp_peep=on -march=bdver1 416.gamess: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:fu=6 -LNO:blocking=0 -LNO:simd=2 -OPT:ro=3 -OPT:recip=on -CG:local_sched_alg=1 -HP:bdt=2m:heap=2m -WOPT:sib=on -march=bdver1 434.zeusmp: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:interchange=off -IPA:plimit=1500 -HP:bdt=2m:heap=2m -march=bdver1 437.leslie3d: -Ofast -CG:pre_minreg_level=2 -LNO:simd=0 -LNO:fusion=2 -HP:bdt=2m:heap=2m -mso -march=bdver1 459.GemsFDTD: -Ofast -IPA:plimit=1500 -OPT:unroll_size=1024 -OPT:unroll_times_max=16 -LNO:fission=2 -CG:local_sched_alg=2 -HP -march=bdver1 465.tonto: -Ofast -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -CG:local_sched_alg=3 -IPA:plimit=525 -HP:bdt=2m:heap=2m -march=bdver1 Benchmarks using both Fortran and C: 435.gromacs: -Ofast -OPT:rsqrt=2 -HP:bdt=2m:heap=2m -CG:local_sched_alg=2 -CG:load_exe=3 -GRA:unspill=on -march=bdver1 -LNO:simd=3 436.cactusADM: -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:blocking=off -LNO:prefetch=2 -LNO:pf2=0 -LNO:prefetch_ahead=4 -HP -CG:locs_shallow_depth=1 -CG:load_exe=0 -CG:dsched=on -WOPT:sib=on -march=bdver1 454.calculix: -Ofast -OPT:unroll_size=256 -OPT:alias=disjoint -GRA:optimize_boundary=on -CG:dsched=on -HP:bdt=2m:heap=2m -march=bdver1 481.wrf: -Ofast -LNO:blocking=off -LANG:copyinout=off -IPA:callee_limit=5000 -GRA:prioritize_by_density=on -HP -WOPT:sib=on -march=bdver1 The flags file that was used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.html You can also download the XML flags source by saving the following link: http://www.spec.org/cpu2006/flags/x86-open64-452-flags-rate-revA-II.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.2. Report generated on Thu Jul 24 13:23:54 2014 by CPU2006 ASCII formatter v6932. Originally published on 4 December 2012.