SPEC(R) CFP2006 Summary Tyan Tyan YR190B8228, AMD Opteron 4164 EE Test Sponsor: Advanced Micro Devices Thu Dec 2 11:59:31 2010 CPU2006 License: 49 Test date: Dec-2010 Test sponsor: Advanced Micro Devices Hardware availability: Aug-2010 Tested by: Advanced Micro Devices Software availability: May-2010 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 13590 231 58.8 S 13590 139 97.7 S 410.bwaves 13590 231 58.8 * 13590 139 97.8 S 410.bwaves 13590 230 59.0 S 13590 139 97.8 * 416.gamess 19580 1687 11.6 S 19580 1544 12.7 * 416.gamess 19580 1687 11.6 * 19580 1542 12.7 S 416.gamess 19580 1685 11.6 S 19580 1545 12.7 S 433.milc 9180 598 15.4 S 9180 443 20.7 S 433.milc 9180 599 15.3 * 9180 443 20.7 * 433.milc 9180 602 15.3 S 9180 441 20.8 S 434.zeusmp 9100 297 30.7 S 9100 287 31.7 * 434.zeusmp 9100 297 30.6 S 9100 286 31.8 S 434.zeusmp 9100 297 30.6 * 9100 287 31.7 S 435.gromacs 7140 734 9.72 S 7140 568 12.6 S 435.gromacs 7140 737 9.69 S 7140 569 12.6 * 435.gromacs 7140 735 9.72 * 7140 569 12.6 S 436.cactusADM 11950 170 70.2 * 11950 116 103 S 436.cactusADM 11950 169 70.5 S 11950 115 104 S 436.cactusADM 11950 172 69.6 S 11950 116 103 * 437.leslie3d 9400 642 14.6 * 9400 600 15.7 S 437.leslie3d 9400 644 14.6 S 9400 594 15.8 S 437.leslie3d 9400 641 14.7 S 9400 596 15.8 * 444.namd 8020 864 9.28 S 8020 787 10.2 S 444.namd 8020 862 9.31 S 8020 785 10.2 S 444.namd 8020 864 9.29 * 8020 785 10.2 * 447.dealII 11440 620 18.4 S 11440 547 20.9 S 447.dealII 11440 622 18.4 S 11440 547 20.9 * 447.dealII 11440 621 18.4 * 11440 548 20.9 S 450.soplex 8340 707 11.8 S 8340 629 13.3 S 450.soplex 8340 709 11.8 * 8340 625 13.3 * 450.soplex 8340 711 11.7 S 8340 625 13.3 S 453.povray 5320 397 13.4 S 5320 383 13.9 S 453.povray 5320 394 13.5 * 5320 382 13.9 * 453.povray 5320 393 13.5 S 5320 381 14.0 S 454.calculix 8250 503 16.4 S 8250 472 17.5 S 454.calculix 8250 501 16.5 S 8250 471 17.5 * 454.calculix 8250 502 16.4 * 8250 471 17.5 S 459.GemsFDTD 10610 367 28.9 S 10610 352 30.2 S 459.GemsFDTD 10610 366 29.0 S 10610 352 30.1 * 459.GemsFDTD 10610 366 29.0 * 10610 352 30.1 S 465.tonto 9840 675 14.6 S 9840 638 15.4 S 465.tonto 9840 675 14.6 * 9840 638 15.4 S 465.tonto 9840 674 14.6 S 9840 638 15.4 * 470.lbm 13740 684 20.1 S 13740 83.3 165 S 470.lbm 13740 681 20.2 S 13740 82.8 166 S 470.lbm 13740 681 20.2 * 13740 83.2 165 * 481.wrf 11170 411 27.2 S 11170 411 27.2 S 481.wrf 11170 414 27.0 S 11170 414 27.0 S 481.wrf 11170 411 27.1 * 11170 411 27.1 * 482.sphinx3 19490 1025 19.0 S 19490 982 19.9 S 482.sphinx3 19490 1029 18.9 * 19490 989 19.7 * 482.sphinx3 19490 1033 18.9 S 19490 993 19.6 S ============================================================================== 410.bwaves 13590 231 58.8 * 13590 139 97.8 * 416.gamess 19580 1687 11.6 * 19580 1544 12.7 * 433.milc 9180 599 15.3 * 9180 443 20.7 * 434.zeusmp 9100 297 30.6 * 9100 287 31.7 * 435.gromacs 7140 735 9.72 * 7140 569 12.6 * 436.cactusADM 11950 170 70.2 * 11950 116 103 * 437.leslie3d 9400 642 14.6 * 9400 596 15.8 * 444.namd 8020 864 9.29 * 8020 785 10.2 * 447.dealII 11440 621 18.4 * 11440 547 20.9 * 450.soplex 8340 709 11.8 * 8340 625 13.3 * 453.povray 5320 394 13.5 * 5320 382 13.9 * 454.calculix 8250 502 16.4 * 8250 471 17.5 * 459.GemsFDTD 10610 366 29.0 * 10610 352 30.1 * 465.tonto 9840 675 14.6 * 9840 638 15.4 * 470.lbm 13740 681 20.2 * 13740 83.2 165 * 481.wrf 11170 411 27.1 * 11170 411 27.1 * 482.sphinx3 19490 1029 18.9 * 19490 989 19.7 * SPECfp(R)_base2006 19.2 SPECfp2006 24.7 HARDWARE -------- CPU Name: AMD Opteron 4164 EE CPU Characteristics: CPU MHz: 1800 FPU: Integrated CPU(s) enabled: 12 cores, 2 chips, 6 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 512 KB I+D on chip per core L3 Cache: 6 MB I+D on chip per chip Other Cache: None Memory: 32 GB (4 x 8 GB 2Rx4 PC3-10600R-9, ECC) Disk Subsystem: 1 x 128 GB SATA SSD Crucial RealSSD C300 CTFDDAC128MAG-1G1 Other Hardware: None SOFTWARE -------- Operating System: SUSE Linux Enterprise Server 11 (x86_64), Kernel 2.6.27.19-5-default Compiler: x86 Open64 4.2.3.2 Compiler Suite (from AMD) Auto Parallel: Yes File System: ext3 System State: Run level 3 (Full multiuser with network) Base Pointers: 64-bit Peak Pointers: 32/64-bit Other Software: None Submit Notes ------------ The config file option 'submit' was used. 'numactl' was used to bind copies to the cores. See the configuration file for details. Operating System Notes ---------------------- 'ulimit -s unlimited' was used to set environment stack size 'ulimit -l 2097152' was used to set environment locked pages in memory limit Set vm/nr_hugepages=2000 in /etc/sysctl.conf mount -t hugetlbfs nodev /mnt/hugepages powersave -f was used to set the CPU frequency to its maximum. Binaries were compiled on SLES10 SP2 with binutils 2.18 General Notes ------------- Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/root/work/cpu2006/amd1002-speed-libs-revA/64:/root/work/cpu2006/amd1002-speed-libs-revA/32" O64_OMP_AFFINITY_MAP = "0,1,2,3,4,5,6,7,8,9,10,11" O64_OMP_SPIN_USER_LOCK = "true" The x86 Open64 Compiler Suite is only available from (and supported by) AMD at http://developer.amd.com/cpu/open64 Base Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Base Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LP64 450.soplex: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Base Optimization Flags ----------------------- C benchmarks: -march=barcelona -Ofast -HP:bdt=2m:heap=2m C++ benchmarks: -march=barcelona -Ofast -static -INLINE:aggressive=on -HP:bdt=2m:heap=2m Fortran benchmarks: -march=barcelona -Ofast -apo -LNO:parallel_overhead=10000 -LNO:fusion_peeling_limit=0 -HP:bdt=2m:heap=2m Benchmarks using both Fortran and C: -march=barcelona -Ofast -HP:bdt=2m:heap=2m -apo -LNO:parallel_overhead=10000 -LNO:fusion_peeling_limit=0 Peak Compiler Invocation ------------------------ C benchmarks: opencc C++ benchmarks: openCC Fortran benchmarks: openf95 Benchmarks using both Fortran and C: opencc openf95 Peak Portability Flags ---------------------- 410.bwaves: -DSPEC_CPU_LP64 416.gamess: -DSPEC_CPU_LP64 433.milc: -DSPEC_CPU_LP64 434.zeusmp: -DSPEC_CPU_LP64 435.gromacs: -DSPEC_CPU_LP64 436.cactusADM: -DSPEC_CPU_LP64 -fno-second-underscore 437.leslie3d: -DSPEC_CPU_LP64 444.namd: -DSPEC_CPU_LP64 453.povray: -DSPEC_CPU_LP64 454.calculix: -DSPEC_CPU_LP64 459.GemsFDTD: -DSPEC_CPU_LP64 465.tonto: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_LP64 -DSPEC_CPU_LINUX -DSPEC_CPU_CASE_FLAG -fno-second-underscore 482.sphinx3: -DSPEC_CPU_LP64 Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -march=barcelona -Ofast -apo -CG:movnti=1 -CG:local_sched_alg=1 -CG:locs_shallow_depth=1 -CG:compute_to=on -HP:bdt=2m:heap=2m -LNO:prefetch=3 470.lbm: -march=barcelona -Ofast -mso -apo -CG:sse_cse_regs=0 -LNO:prefetch_ahead=4 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:compute_to=on -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -OPT:keep_ext=on -OPT:alias=restricted -m3dnow -IPA:inline=off 482.sphinx3: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -OPT:malloc_alg=2 -CG:sse_cse_regs=0 -CG:locs_shallow_depth=1 -CG:cmp_peep=on -CG:local_sched_alg=1 -INLINE:aggressive=on C++ benchmarks: 444.namd: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -LNO:ignore_feedback=off -CG:local_sched_alg=2 -CG:load_exe=0 -CG:compute_to=on -OPT:unroll_size=256 -fno-exceptions -HP:bdt=2m:heap=2m 447.dealII: -march=barcelona -Ofast -static -INLINE:aggressive=on -LNO:opt=0 -fno-emit-exceptions -m32 -OPT:unroll_times_max=8 -OPT:unroll_size=256 -OPT:unroll_level=2 -HP:bdt=2m:heap=2m -GRA:unspill=on -CG:cmp_peep=on -TENV:frame_pointer=off 450.soplex: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -INLINE:aggressive=on -OPT:IEEE_arith=3 -OPT:IEEE_NaN_Inf=off -OPT:fold_unsigned_relops=on -CG:load_exe=0 -fno-exceptions -m32 -HP:bdt=2m:heap=2m 453.povray: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -INLINE:aggressive=on -HP:bdt=2m:heap=2m Fortran benchmarks: 410.bwaves: -march=barcelona -Ofast -apo -OPT:malloc_alg=2 -CG:use_prefetchnta=on -CG:cmp_peep=on -LNO:blocking=off -LNO:prefetch=3 -LNO:prefetch_ahead=5 -LNO:ignore_feedback=off -LNO:apo_use_feedback=on -WOPT:aggstr=0 416.gamess: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -O3 -LNO:fu=6 -LNO:blocking=0 -LNO:prefetch=0 -OPT:Ofast -OPT:ro=3 -OPT:unroll_size=256 -HP:bdt=2m:heap=2m 434.zeusmp: -march=barcelona -Ofast -apo -LNO:blocking=off -LNO:interchange=off -LNO:fusion_peeling_limit=0 -OPT:treeheight=on -OPT:unroll_size=256 -CG:cmp_peep=on -CG:compute_to=on -GRA:prioritize_by_density=on -HP:bdt=2m:heap=2m 437.leslie3d: -march=barcelona -Ofast -apo -OPT:unroll_size=256 -LNO:prefetch_ahead=4 -LNO:parallel_overhead=32768 -GRA:prioritize_by_density=on -m3dnow -HP:bdt=2m:heap=2m 459.GemsFDTD: -march=barcelona -Ofast -apo -LNO:fission=2 -LNO:prefetch_ahead=1 -CG:load_exe=0 -CG:local_sched_alg=1 -HP 465.tonto: -march=barcelona -Ofast -apo -OPT:alias=no_f90_pointer_alias -LNO:blocking=off -CG:load_exe=1 -IPA:plimit=525 -HP Benchmarks using both Fortran and C: 435.gromacs: -march=barcelona -Ofast -apo -OPT:rsqrt=2 -HP:bdt=2m:heap=2m 436.cactusADM: -march=barcelona -fb_create fbdata(pass 1) -fb_opt fbdata(pass 2) -Ofast -apo -LANG:heap_allocation_threshold=1000 -LNO:prefetch_ahead=1 -HP:bdt=2m:heap=2m 454.calculix: -march=barcelona -Ofast -LNO:prefetch_ahead=30 -CG:load_exe=0 -CG:ptr_load_use=0 -CG:local_sched_alg=2 -CG:compute_to=on -WOPT:unroll=2 -GRA:optimize_boundary=on -HP:bdt=2m:heap=2m -apo 481.wrf: basepeak = yes The flags files that were used to format this result can be browsed at http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.html http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.html You can also download the XML flags sources by saving the following links: http://www.spec.org/cpu2006/flags/x86-open64-423-flags-speed-revA.20101207.xml http://www.spec.org/cpu2006/flags/amd-platform-speed-revA.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.1. Report generated on Wed Jul 23 15:20:03 2014 by CPU2006 ASCII formatter v6932. Originally published on 3 February 2011.