SPEC(R) CFP2006 Summary IBM Corporation IBM PowerLinux 7R4 (4.0 GHz, 32 core, RHEL, GCC) Sat May 18 13:02:42 2013 CPU2006 License: 11 Test date: May-2013 Test sponsor: IBM Corporation Hardware availability: Aug-2013 Tested by: IBM Corporation Software availability: May-2013 Base Base Base Peak Peak Peak Benchmarks Copies Run Time Rate Copies Run Time Rate -------------- ------ --------- --------- ------ --------- --------- 410.bwaves 128 1874 928 S 32 441 986 * 410.bwaves 128 1870 930 S 32 441 987 S 410.bwaves 128 1872 929 * 32 441 985 S 416.gamess 128 2346 1070 * 128 2284 1100 S 416.gamess 128 2341 1070 S 128 2244 1120 S 416.gamess 128 2347 1070 S 128 2251 1110 * 433.milc 128 3739 314 S 32 902 326 * 433.milc 128 3730 315 S 32 903 325 S 433.milc 128 3738 314 * 32 901 326 S 434.zeusmp 128 1019 1140 S 128 977 1190 S 434.zeusmp 128 1008 1160 S 128 978 1190 S 434.zeusmp 128 1015 1150 * 128 977 1190 * 435.gromacs 128 717 1280 S 128 698 1310 S 435.gromacs 128 714 1280 S 128 701 1300 * 435.gromacs 128 715 1280 * 128 706 1290 S 436.cactusADM 128 901 1700 S 64 389 1970 S 436.cactusADM 128 907 1690 S 64 388 1970 * 436.cactusADM 128 902 1700 * 64 388 1970 S 437.leslie3d 128 1964 613 S 32 417 721 S 437.leslie3d 128 1966 612 * 32 416 723 S 437.leslie3d 128 1968 611 S 32 417 721 * 444.namd 128 783 1310 S 128 783 1310 S 444.namd 128 780 1320 S 128 780 1320 S 444.namd 128 782 1310 * 128 782 1310 * 447.dealII 128 606 2420 * 128 610 2400 S 447.dealII 128 611 2400 S 128 599 2440 S 447.dealII 128 601 2440 S 128 600 2440 * 450.soplex 128 2009 531 S 128 2009 531 S 450.soplex 128 1545 691 S 128 1545 691 S 450.soplex 128 1550 689 * 128 1550 689 * 453.povray 128 490 1390 S 128 417 1630 S 453.povray 128 493 1380 S 128 418 1630 * 453.povray 128 493 1380 * 128 418 1630 S 454.calculix 128 778 1360 S 128 762 1390 S 454.calculix 128 779 1360 * 128 764 1380 S 454.calculix 128 782 1350 S 128 762 1390 * 459.GemsFDTD 128 2892 470 S 64 1391 488 S 459.GemsFDTD 128 2895 469 * 64 1390 488 S 459.GemsFDTD 128 2896 469 S 64 1391 488 * 465.tonto 128 971 1300 S 128 781 1610 S 465.tonto 128 969 1300 * 128 784 1610 S 465.tonto 128 967 1300 S 128 782 1610 * 470.lbm 128 3314 531 S 32 345 1270 * 470.lbm 128 3314 531 * 32 347 1270 S 470.lbm 128 3310 531 S 32 345 1270 S 481.wrf 128 1348 1060 S 96 997 1080 S 481.wrf 128 1330 1080 S 96 998 1070 * 481.wrf 128 1342 1070 * 96 1001 1070 S 482.sphinx3 128 2244 1110 S 64 858 1450 S 482.sphinx3 128 2287 1090 S 64 861 1450 * 482.sphinx3 128 2256 1110 * 64 866 1440 S ============================================================================== 410.bwaves 128 1872 929 * 32 441 986 * 416.gamess 128 2346 1070 * 128 2251 1110 * 433.milc 128 3738 314 * 32 902 326 * 434.zeusmp 128 1015 1150 * 128 977 1190 * 435.gromacs 128 715 1280 * 128 701 1300 * 436.cactusADM 128 902 1700 * 64 388 1970 * 437.leslie3d 128 1966 612 * 32 417 721 * 444.namd 128 782 1310 * 128 782 1310 * 447.dealII 128 606 2420 * 128 600 2440 * 450.soplex 128 1550 689 * 128 1550 689 * 453.povray 128 493 1380 * 128 418 1630 * 454.calculix 128 779 1360 * 128 762 1390 * 459.GemsFDTD 128 2895 469 * 64 1391 488 * 465.tonto 128 969 1300 * 128 782 1610 * 470.lbm 128 3314 531 * 32 345 1270 * 481.wrf 128 1342 1070 * 96 998 1070 * 482.sphinx3 128 2256 1110 * 64 861 1450 * SPECfp(R)_rate_base2006 983 SPECfp_rate2006 1110 HARDWARE -------- CPU Name: POWER7+ CPU Characteristics: Intelligent Energy Optimization enabled, up to 4.431 GHz CPU MHz: 4060 FPU: Integrated CPU(s) enabled: 32 cores, 8 chips, 4 cores/chip, 4 threads/core CPU(s) orderable: 16, 32 cores Primary Cache: 32 KB I + 32 KB D on chip per core Secondary Cache: 256 KB I+D on chip per core L3 Cache: 10 MB I+D on chip per core Other Cache: None Memory: 256 GB (64 x 4 GB) DDR3 1066 MHz Disk Subsystem: 1 x 300 GB SAS SFF 15K RPM Other Hardware: None SOFTWARE -------- Operating System: Red Hat Enterprise Linux Server release 6.4 (ppc64) kernel 2.6.32-358.6.1.el6.ppc64 Compiler: C/C++/Fortran: Version 4.7.3 of IBM Advance Toolchain 6.0-4 gcc/g++/gfortran compiler Auto Parallel: No File System: ext4 System State: Run level 3 (multi-user) Base Pointers: 32-bit Peak Pointers: 32/64-bit Other Software: -IBM Advance Toolchain 6.0-4 -IBM Mathematical Acceleration Subsystem (MASS) libraries 7.1.0.2 -Post-Link Optimization for Linux on POWER, version 5.6.2-1 Compiler Invocation Notes ------------------------- For more information about IBM Advance Toolchain, including support, see ftp://ftp.unicamp.br/pub/linuxpatch/toolchain/at/redhat/RHEL6/at6.0/release_notes.at6.0-6.0-4.html Peak Tuning Notes ----------------- Post-Link optimization tool used for: 410.bwaves with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 416.gamess with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 429.mcf with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 433.milc with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 434.zeusmp with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 437.leslie3d with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 453.povray with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 454.calculix with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 459.GemsFDTD with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 465.tonto with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 470.lbm with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase 481.wrf with options -O4 -omullX for optimization phase, and -imullX for instrumentation phase Submit Notes ------------ The config file option 'submit' was used to assign benchmark copy to specific kernel thread using the "numactl" command (see flags file for details). Operating System Notes ---------------------- ulimit -s (stack) set to 1048576. Large pages reserved as follows by root user: echo 8448 > /proc/sys/vm/nr_hugepages The Mathematical Acceleration Subsystem libraries are shipped with IBM XL C/C++ version 12.1 and IBM XL Fortran version 14.1 compiler products. crashkernel was set to 256 MB in /etc/yaboot.conf file. General Notes ------------- Environment variables set by runspec before the start of the run: HUGETLB_ELFMAP = "RW" HUGETLB_MORECORE = "yes" HUGETLB_VERBOSE = "0" TCMALLOC_MEMFS_MALLOC_PATH = "/libhugetlbfs" XLFRTEOPTS = "intrinthds=1" Base Compiler Invocation ------------------------ C benchmarks: /opt/at6.0/bin/gcc C++ benchmarks: /opt/at6.0/bin/g++ Fortran benchmarks: /opt/at6.0/bin/gfortran Benchmarks using both Fortran and C: /opt/at6.0/bin/gcc /opt/at6.0/bin/gfortran Base Portability Flags ---------------------- 447.dealII: -DSPEC_CPU_LINUX 481.wrf: -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX_PPC 482.sphinx3: -fsigned-char Base Optimization Flags ----------------------- C benchmarks: -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass C++ benchmarks: -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass --param max-inline-insns-auto=200 -fno-associative-math -flto -fwhole-program -fuse-linker-plugin -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass Fortran benchmarks: -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass Benchmarks using both Fortran and C: -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass Peak Compiler Invocation ------------------------ C benchmarks: /opt/at6.0/bin/gcc C++ benchmarks: /opt/at6.0/bin/g++ Fortran benchmarks: /opt/at6.0/bin/gfortran Benchmarks using both Fortran and C: /opt/at6.0/bin/gcc /opt/at6.0/bin/gfortran Peak Portability Flags ---------------------- 436.cactusADM: -DSPEC_CPU_LP64 447.dealII: -DSPEC_CPU_LINUX 459.GemsFDTD: -DSPEC_CPU_LP64 470.lbm: -DSPEC_CPU_LP64 481.wrf: -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX_PPC -DSPEC_CPU_LP64 482.sphinx3: -fsigned-char Peak Optimization Flags ----------------------- C benchmarks: 433.milc: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -flto -fwhole-program -fuse-linker-plugin -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass 470.lbm: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m64 -mcmodel=medium -fvect-cost-model -mveclibabi=mass -flto -fwhole-program -fuse-linker-plugin -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib64 -L /opt/ibmcmp/xlmass/7.1/lib64 -Wl,-rpath,/opt/at6.0/lib64 -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib64 -Wl,-dynamic-linker,/opt/at6.0/lib64/ld64.so.1 -lhugetlbfs -lmassvp7_64 -lmass_simdp7_64 -lmass_64 482.sphinx3: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass C++ benchmarks: 444.namd: basepeak = yes 447.dealII: -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -flto -fwhole-program -fuse-linker-plugin --param max-inline-insns-auto=200 -fno-associative-math -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass -ltcmalloc -lstdc++ -lpthread 450.soplex: basepeak = yes 453.povray: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -flto -fwhole-program -fuse-linker-plugin --param max-inline-insns-auto=200 -fno-associative-math -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass -ltcmalloc -lstdc++ -lpthread Fortran benchmarks: 410.bwaves: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m64 -mcmodel=medium -fvect-cost-model -mveclibabi=mass -flto -fwhole-program -fuse-linker-plugin -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib64 -L /opt/ibmcmp/xlmass/7.1/lib64 -Wl,-rpath,/opt/at6.0/lib64 -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib64 -Wl,-dynamic-linker,/opt/at6.0/lib64/ld64.so.1 -lhugetlbfs -lmassvp7_64 -lmass_simdp7_64 -lmass_64 416.gamess: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass -ltcmalloc -lstdc++ -lpthread 434.zeusmp: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -flto -fwhole-program -fuse-linker-plugin -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass -ltcmalloc -lstdc++ -lpthread 437.leslie3d: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -flto -fwhole-program -fuse-linker-plugin -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass 459.GemsFDTD: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m64 -mcmodel=medium -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib64 -L /opt/ibmcmp/xlmass/7.1/lib64 -Wl,-rpath,/opt/at6.0/lib64 -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib64 -Wl,-dynamic-linker,/opt/at6.0/lib64/ld64.so.1 -lhugetlbfs -lmassvp7_64 -lmass_simdp7_64 -lmass_64 465.tonto: Same as 416.gamess Benchmarks using both Fortran and C: 435.gromacs: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass 436.cactusADM: -fprofile-generate(pass 1) -fprofile-use(pass 2) -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m64 -mcmodel=medium -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib64 -L /opt/ibmcmp/xlmass/7.1/lib64 -Wl,-rpath,/opt/at6.0/lib64 -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib64 -Wl,-dynamic-linker,/opt/at6.0/lib64/ld64.so.1 -lhugetlbfs -lmassvp7_64 -lmass_simdp7_64 -lmass_64 454.calculix: -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m32 -fvect-cost-model -mveclibabi=mass -flto -fwhole-program -fuse-linker-plugin -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib -L /opt/ibmcmp/xlmass/7.1/lib -Wl,-rpath,/opt/at6.0/lib -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib -lhugetlbfs -lmassvp7 -lmass_simdp7 -lmass -ltcmalloc -lstdc++ -lpthread 481.wrf: -ffast-math -O3 -mcpu=power7 -mtune=power7 -mrecip=rsqrt -fpeel-loops -funroll-loops -mpopcntd -m64 -mcmodel=medium -fvect-cost-model -mveclibabi=mass -Wl,-q -Wl,-Map=link.map,--cref -L /opt/at6.0/lib64 -L /opt/ibmcmp/xlmass/7.1/lib64 -Wl,-rpath,/opt/at6.0/lib64 -Wl,-rpath,/opt/ibmcmp/xlmass/7.1/lib64 -Wl,-dynamic-linker,/opt/at6.0/lib64/ld64.so.1 -lhugetlbfs -lmassvp7_64 -lmass_simdp7_64 -lmass_64 The flags file that was used to format this result can be browsed at http://www.spec.org/cpu2006/flags/IBM-Linux-AT.20130813.html You can also download the XML flags source by saving the following link: http://www.spec.org/cpu2006/flags/IBM-Linux-AT.20130813.xml SPEC and SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2014 Standard Performance Evaluation Corporation Tested with SPEC CPU2006 v1.2. Report generated on Thu Jul 24 16:16:33 2014 by CPU2006 ASCII formatter v6932. Originally published on 13 August 2013.