| CPU2006 license: | 11 | Test date: | Aug-2011 |
|---|---|---|---|
| Test sponsor: | IBM Corporation | Hardware Availability: | Oct-2011 |
| Tested by: | IBM Corporation | Software Availability: | Jul-2011 |
| Hardware | |
|---|---|
| CPU Name: | POWER7 |
| CPU Characteristics: | Intelligent Energy Optimization enabled, up to 3.780 GHz |
| CPU MHz: | 3444 |
| FPU: | Integrated |
| CPU(s) enabled: | 96 cores, 16 chips, 6 cores/chip, 4 threads/core |
| CPU(s) orderable: | 24,48,72,96 cores |
| Primary Cache: | 32 KB I + 32 KB D on chip per core |
| Secondary Cache: | 256 KB I+D on chip per core |
| L3 Cache: | 4 MB I+D on chip per core |
| Other Cache: | None |
| Memory: | 1 TB (64 x 16 GB) DDR3 1066 MHz |
| Disk Subsystem: | 10 x 146.8 GB Raid0 SAS SFF 15K RPM |
| Other Hardware: | None |
| Software | |
|---|---|
| Operating System: | SUSE Linux Enterprise Server 11 SP1 (ppc64), Kernel 2.6.32.12-0.7-ppc64 |
| Compiler: | C/C++: Version 11.1 of IBM XL C/C++ for Linux; Fortran: Version 13.1 of IBM XL Fortran for Linux |
| Auto Parallel: | No |
| File System: | ext2 |
| System State: | Run level 3 (multi-user) |
| Base Pointers: | 32-bit |
| Peak Pointers: | 32/64-bit |
| Other Software: | -IBM Post-Link Optimization for Linux on POWER, version 5.6.0-4 -MicroQuill SmartHeap 9 -Apache C++ Standard Library V4.2.1 |
| Benchmark | Base | Peak | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
| 410.bwaves | 384 | 1958 | 2670 | 1969 | 2650 | 1970 | 2650 | 384 | 1958 | 2670 | 1958 | 2670 | 1956 | 2670 |
| 416.gamess | 384 | 2534 | 2970 | 2538 | 2960 | 2544 | 2960 | 384 | 2381 | 3160 | 2383 | 3160 | 2382 | 3160 |
| 433.milc | 384 | 1259 | 2800 | 1240 | 2840 | 1245 | 2830 | 96 | 299 | 2950 | 296 | 2970 | 297 | 2960 |
| 434.zeusmp | 384 | 1303 | 2680 | 1301 | 2690 | 1304 | 2680 | 384 | 1100 | 3180 | 1098 | 3180 | 1100 | 3180 |
| 435.gromacs | 384 | 1135 | 2420 | 1137 | 2410 | 1129 | 2430 | 384 | 881 | 3110 | 880 | 3120 | 884 | 3100 |
| 436.cactusADM | 384 | 1245 | 3690 | 1247 | 3680 | 1244 | 3690 | 192 | 525 | 4370 | 523 | 4390 | 525 | 4370 |
| 437.leslie3d | 384 | 1912 | 1890 | 1911 | 1890 | 1920 | 1880 | 96 | 435 | 2070 | 435 | 2080 | 434 | 2080 |
| 444.namd | 384 | 757 | 4070 | 755 | 4080 | 754 | 4090 | 384 | 749 | 4110 | 750 | 4110 | 744 | 4140 |
| 447.dealII | 384 | 623 | 7050 | 633 | 6940 | 633 | 6940 | 384 | 632 | 6950 | 639 | 6880 | 632 | 6960 |
| 450.soplex | 384 | 1914 | 1670 | 1911 | 1680 | 1919 | 1670 | 384 | 1847 | 1730 | 1846 | 1740 | 1847 | 1730 |
| 453.povray | 384 | 597 | 3420 | 602 | 3390 | 597 | 3420 | 384 | 494 | 4140 | 490 | 4170 | 496 | 4120 |
| 454.calculix | 384 | 1115 | 2840 | 1126 | 2810 | 1119 | 2830 | 384 | 1109 | 2860 | 1097 | 2890 | 1090 | 2910 |
| 459.GemsFDTD | 384 | 3131 | 1300 | 3134 | 1300 | 3136 | 1300 | 384 | 3131 | 1300 | 3134 | 1300 | 3136 | 1300 |
| 465.tonto | 384 | 1461 | 2590 | 1460 | 2590 | 1458 | 2590 | 384 | 1185 | 3190 | 1187 | 3180 | 1182 | 3200 |
| 470.lbm | 384 | 1209 | 4360 | 1215 | 4340 | 1214 | 4350 | 384 | 1209 | 4360 | 1215 | 4340 | 1214 | 4350 |
| 481.wrf | 384 | 1474 | 2910 | 1494 | 2870 | 1495 | 2870 | 384 | 1441 | 2980 | 1444 | 2970 | 1443 | 2970 |
| 482.sphinx3 | 384 | 2658 | 2820 | 2689 | 2780 | 2683 | 2790 | 384 | 2647 | 2830 | 2646 | 2830 | 2639 | 2840 |
C/C++ compiler updated to July2011 PTF
Version 11.01.0000.0003
Fortran compiler updated to July2011 PTF
Version 13.01.0000.0003
IBM Post-Link optimization tool used for:
433.milc 435.gromacs 436.cactusADM 450.soplex 482.sphinx3
with options -O4 -nodp
444.namd
with options -O3 -lu -1 -nodp -sdp 9
465.tonto
with options -O4
470.lbm
with options -kr -O4 -sdp 9 -vrox -m power7
The config file option 'submit' was used to assign benchmark copy to specific kernel thread using the "numactl" command (see flags file for details).
ulimit -s (stack) set to 2097152
Large pages reserved as follows by root user:
echo 25728 > /proc/sys/vm/nr_hugepages
The following environment varibles were set before the runspec command:
export XLFRTEOPTS=intrinthds=1
export HUGETLB_VERBOSE=0
export HUGETLB_MORECORE=yes
export HUGETLB_ELFMAP=RW
447.dealII (peak): "apache_stdcxx_4_2_1" src.alt was used. 447.dealII (base): "apache_stdcxx_4_2_1" src.alt was used. The Apache C++ Standard Library V4.2.1 was installed from http://stdcxx.apache.org/download.html using: gmake BUILDTYPE=8d CONFIG=gcc.config IBM Post-Link optimization tool can be downloaded from http://www-304.ibm.com/webapp/set2/sas/f/lopdiags/sdkdownload.html
| xlc -qlanglvl=extc99 |
| xlC |
| xlf95 |
| xlc -qlanglvl=extc99 xlf95 |
| 410.bwaves: | -qfixed |
| 416.gamess: | -qfixed |
| 434.zeusmp: | -qfixed |
| 435.gromacs: | -qfixed -qextname |
| 436.cactusADM: | -qfixed -qextname |
| 437.leslie3d: | -qfixed |
| 454.calculix: | -qfixed -qextname |
| 481.wrf: | -DNOUNDERSCORE |
| 482.sphinx3: | -qchars=signed |
| -O5 -lhugetlbfs |
| -O4 -qrtti -qcpp_stdinc=/root/stdcxx421/include/ansi:/root/stdcxx421/include:/opt/ibmcmp/vacpp/11.1/include -lhugetlbfs -L/root/stdcxx421/lib -R/root/stdcxx421/lib -lstd8d |
| -O5 -qalias=nostd -lhugetlbfs |
| -O5 -qalias=nostd -lhugetlbfs |
| -qipa=noobject -qipa=threads |
| -qipa=noobject -qipa=threads |
| -qipa=noobject -qipa=threads |
| -qipa=noobject -qipa=threads |
| xlc -qlanglvl=extc99 |
| xlC |
| xlf95 |
| xlc -qlanglvl=extc99 xlf95 |
| 410.bwaves: | -qfixed |
| 416.gamess: | -qfixed |
| 434.zeusmp: | -qfixed |
| 435.gromacs: | -qfixed -qextname |
| 436.cactusADM: | -qfixed -qextname -DSPEC_CPU_LP64 |
| 437.leslie3d: | -qfixed |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -qfixed -qextname |
| 481.wrf: | -DNOUNDERSCORE |
| 482.sphinx3: | -qchars=signed |
| 433.milc: | -Wl,-q -O5 -lhugetlbfs |
| 470.lbm: | basepeak = yes |
| 482.sphinx3: | -Wl,-q -qpdf1(pass 1) -qpdf2(pass 2) -O4 -lhugetlbfs |
| 444.namd: | -Wl,-q -qpdf1(pass 1) -qpdf2(pass 2) -O5 -lhugetlbfs |
| 447.dealII: | -O4 -qrtti -qcpp_stdinc=/root/stdcxx421/include/ansi:/root/stdcxx421/include:/opt/ibmcmp/vacpp/11.1/include -lsmartheap -lhugetlbfs -L/root/stdcxx421/lib -R/root/stdcxx421/lib -lstd8d |
| 450.soplex: | -Wl,-q -O3 -qarch=auto -qtune=auto -lhugetlbfs |
| 453.povray: | -Wl,-q -qpdf1(pass 1) -qpdf2(pass 2) -O4 -qsimd -q64 -lsmartheap64 |
| 410.bwaves: | -qpdf1(pass 1) -qpdf2(pass 2) -O5 -q64 -lhugetlbfs |
| 416.gamess: | -qpdf1(pass 1) -qpdf2(pass 2) -O5 -qalias=nostd -lhugetlbfs |
| 434.zeusmp: | -O5 -qsmallstack=dynlenonheap -qalias=nostd -B/usr/share/libhugetlbfs/ -tl -Wl,--hugetlbfs-align |
| 437.leslie3d: | -O5 -lhugetlbfs |
| 459.GemsFDTD: | basepeak = yes |
| 465.tonto: | -Wl,-q -qpdf1(pass 1) -qpdf2(pass 2) -O5 -qsimd -lhugetlbfs |
| 435.gromacs: | -Wl,-q -qpdf1(pass 1) -qpdf2(pass 2) -O4 -qsimd -lhugetlbfs |
| 436.cactusADM: | -Wl,-q -O4 -q64 -qsimd -qnostrict -qsmallstack=dynlenonheap -qalias=nostd -lhugetlbfs |
| 454.calculix: | -qpdf1(pass 1) -qpdf2(pass 2) -O5 -lhugetlbfs |
| 481.wrf: | -O3 -qarch=auto -qtune=auto -q64 -lhugetlbfs |