SPEC(R) MPIM2007 Summary IBM Corporation IBM BladeCenter JS22 Express (4 GHz, 4x4 core) Sat Oct 25 23:04:17 2008 MPI2007 License: 0005 Test date: Oct-2008 Test sponsor: IBM Corporation Hardware availability: Nov-2008 Tested by: IBM Corporation Software availability: Nov-2008 Base Base Base Peak Peak Peak Benchmarks Ranks Run Time Ratio Ranks Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 104.milc 32 860 1.82 S 32 860 1.82 S 104.milc 32 871 1.80 S 32 871 1.80 S 104.milc 32 869 1.80 * 32 869 1.80 * 107.leslie3d 32 1681 3.11 S 32 1692 3.08 * 107.leslie3d 32 1719 3.04 * 32 1694 3.08 S 107.leslie3d 32 1728 3.02 S 32 1686 3.10 S 113.GemsFDTD 32 1447 4.36 * 32 1447 4.36 * 113.GemsFDTD 32 1445 4.36 S 32 1445 4.36 S 113.GemsFDTD 32 1451 4.35 S 32 1451 4.35 S 115.fds4 32 873 2.23 * 32 863 2.26 S 115.fds4 32 898 2.17 S 32 859 2.27 S 115.fds4 32 873 2.24 S 32 863 2.26 * 121.pop2 32 1440 2.87 * 32 1440 2.87 * 121.pop2 32 1436 2.88 S 32 1436 2.88 S 121.pop2 32 1441 2.86 S 32 1441 2.86 S 122.tachyon 32 2056 1.36 * 32 2015 1.39 S 122.tachyon 32 2059 1.36 S 32 2022 1.38 S 122.tachyon 32 2055 1.36 S 32 2016 1.39 * 126.lammps 32 1183 2.46 S 32 1183 2.46 S 126.lammps 32 1209 2.41 S 32 1209 2.41 S 126.lammps 32 1194 2.44 * 32 1194 2.44 * 127.wrf2 32 2675 2.91 * 32 1774 4.39 * 127.wrf2 32 2670 2.92 S 32 1786 4.36 S 127.wrf2 32 2677 2.91 S 32 1771 4.40 S 128.GAPgeofem 32 667 3.10 * 32 667 3.10 * 128.GAPgeofem 32 668 3.09 S 32 668 3.09 S 128.GAPgeofem 32 667 3.10 S 32 667 3.10 S 129.tera_tf 32 2077 1.33 S 32 1510 1.83 * 129.tera_tf 32 2077 1.33 * 32 1511 1.83 S 129.tera_tf 32 2076 1.33 S 32 1501 1.84 S 130.socorro 32 1165 3.28 S 32 451 8.46 * 130.socorro 32 1166 3.27 S 32 447 8.53 S 130.socorro 32 1166 3.27 * 32 456 8.37 S 132.zeusmp2 32 1273 2.44 S 32 1273 2.44 S 132.zeusmp2 32 1325 2.34 S 32 1325 2.34 S 132.zeusmp2 32 1288 2.41 * 32 1288 2.41 * 137.lu 32 1607 2.29 S 32 1607 2.29 S 137.lu 32 1600 2.30 S 32 1600 2.30 S 137.lu 32 1604 2.29 * 32 1604 2.29 * ============================================================================== 104.milc 32 869 1.80 * 32 869 1.80 * 107.leslie3d 32 1719 3.04 * 32 1692 3.08 * 113.GemsFDTD 32 1447 4.36 * 32 1447 4.36 * 115.fds4 32 873 2.23 * 32 863 2.26 * 121.pop2 32 1440 2.87 * 32 1440 2.87 * 122.tachyon 32 2056 1.36 * 32 2016 1.39 * 126.lammps 32 1194 2.44 * 32 1194 2.44 * 127.wrf2 32 2675 2.91 * 32 1774 4.39 * 128.GAPgeofem 32 667 3.10 * 32 667 3.10 * 129.tera_tf 32 2077 1.33 * 32 1510 1.83 * 130.socorro 32 1166 3.27 * 32 451 8.46 * 132.zeusmp2 32 1288 2.41 * 32 1288 2.41 * 137.lu 32 1604 2.29 * 32 1604 2.29 * SPECmpiM_base2007 2.44 SPECmpiM_peak2007 2.79 BENCHMARK DETAILS ----------------- Type of System: Heterogeneous Total Compute Nodes: 4 Total Chips: 8 Total Cores: 16 Total Threads: 32 Total Memory: 80 GB Base Ranks Run: 32 Minimum Peak Ranks: 32 Maximum Peak Ranks: 32 C Compiler: IBM XL C/C++ Enterprise Edition V9 for AIX Updated with the September 2008 Fix level C++ Compiler: IBM XL C/C++ Enterprise Edition V9 for AIX Updated with the September 2008 Fix level Fortran Compiler: IBM XL Fortran Enterprise Edition V11.1 for AIX Updated with the September 2008 Fix level Base Pointers: 32-bit Peak Pointers: 32/64-bit MPI Library: IBM Parallel Environment for AIX, Version 5 Release 1 Other MPI Info: None Pre-processors: None Other Software: IBM Engineering and Scientific Subroutine Library (ESSL) for AIX Version 4 Release 3 Updated with PTF Set 3 Node Description: IBM System JS22 ================================= HARDWARE -------- Number of nodes: 1 Uses of the node: compute, head, fileserver Vendor: IBM Corporation Model: IBM System JS22 CPU Name: POWER6 CPU(s) orderable: 4 cores per blade Chips enabled: 2 Cores enabled: 4 Cores per chip: 2 Threads per core: 2 CPU Characteristics: CPU MHz: 4000 Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 4 MB I+D on chip per core L3 Cache: None Other Cache: None Memory: 32 GB (4x8 GB) DDR2 500 MHz Disk Subsystem: 1x146 GB SAS 15K RPM Other Hardware: BladeCenter-H chassis Voltaire 4X InfiniBand Pass-thru Module (P/N 43W4419) Adapter: 4X InfiniBand DDR Expansion Card (CFFh) for IBM BladeCenter (P/N 43W4423) Number of Adapters: 1 Slot Type: PCIe x8 Gen2 Data Rate: 4x DDR 20Gbps Ports Used: 1 Interconnect Type: InfiniBand SOFTWARE -------- Adapter: 4X InfiniBand DDR Expansion Card (CFFh) for IBM BladeCenter (P/N 43W4423) Adapter Driver: devices.pciex.b3157862.rte 6.1.2.0 Adapter Firmware: 2.3.0 Operating System: IBM AIX V6.1 with the 6100-02 Technology Level Local File System: AIX/JFS2 Shared File System: NFSv3 System State: Multi-user Other Software: None General Notes ------------- Blade[1] runs the following commands to compose the cluster: mkdev -c management -s infiniband -t icm /usr/sbin/mkiba -a 192.1.10.1 -m 255.255.255.0 -i ib0 -A iba0 -p 1 -P 0xFFFF -M 65532 -q 4000 -k off -Q 0x1E -S up startsrc -s ctcas preprpnode mpiblade1 mkrpdomain mpiblades mpiblade1 mpiblade2 mpiblade3 mpiblade4 startrpdomain mpiblades cd /usr/lpp/ppe.poe/samples/nrt make chmod 4755 nrt_api shutdown -rF su spec cd mpiblades.64ranks.load ../nrt_api -l Node Description: IBM System JS22 ================================= HARDWARE -------- Number of nodes: 3 Uses of the node: compute Vendor: IBM Corporation Model: IBM System JS22 CPU Name: POWER6 CPU(s) orderable: 4 cores per blade Chips enabled: 2 Cores enabled: 4 Cores per chip: 2 Threads per core: 2 CPU Characteristics: CPU MHz: 4000 Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 4 MB I+D on chip per core L3 Cache: None Other Cache: None Memory: 16 GB (4x4 GB) DDR2 667 MHz Disk Subsystem: 1x146 GB SAS 15K RPM Other Hardware: BladeCenter-H chassis Voltaire 4X InfiniBand Pass-thru Module (P/N 43W4419) Adapter: 4X InfiniBand DDR Expansion Card (CFFh) for IBM BladeCenter (P/N 43W4423) Number of Adapters: 1 Slot Type: PCIe x8 Gen2 Data Rate: 4x DDR 20Gbps Ports Used: 1 Interconnect Type: InfiniBand SOFTWARE -------- Adapter: 4X InfiniBand DDR Expansion Card (CFFh) for IBM BladeCenter (P/N 43W4423) Adapter Driver: devices.pciex.b3157862.rte 6.1.2.0 Adapter Firmware: 2.3.0 Operating System: IBM AIX V6.1 with the 6100-02 Technology Level Local File System: AIX/JFS2 Shared File System: NFSv3 System State: Multi-user Other Software: None General Notes ------------- Each blade runs the following commands to compose the cluster, where $CLUSTER_INDEX is 2-4 for Blade[2]-Blade[4]: mkdev -c management -s infiniband -t icm /usr/sbin/mkiba -a 192.1.10.$CLUSTER_INDEX -m 255.255.255.0 -i ib0 -A iba0 -p 1 -P 0xFFFF -M 65532 -q 4000 -k off -Q 0x1E -S up startsrc -s ctcas preprpnode mpiblade1 cd /usr/lpp/ppe.poe/samples/nrt make chmod 4755 nrt_api shutdown -rF su spec cd mpiblades.64ranks.load ../nrt_api -l Interconnect Description: InfiniBand ==================================== HARDWARE -------- Vendor: IBM Corporation Model: 4x DDR InfiniBand Switch Model: QLogic SilverStorm 9024 Number of Switches: 1 Number of Ports: 24 Data Rate: 4x DDR 20Gbps Firmware: 4.2.1.1.1 Topology: single switch Primary Use: MPI Communication Interconnect Description: Ethernet ================================== HARDWARE -------- Vendor: IBM Corporation Model: 4-port Gigabit Ethernet Switch Model: IBM BladeCenter 4-port Gigabit Ethernet switch module (P/N 26K6483) Number of Switches: 1 Number of Ports: 18 Data Rate: 1Gbps Firmware: 1.08 Topology: single switch Primary Use: File system Compiler Invocation Notes ------------------------- Blade[1], with 32GB of memory and 32GB of paging space, was used to compile the benchmarks. Submit Notes ------------ The config file option 'submit' was used. submit = poe task_stride.2level.32+64rank 4 2 8 $ranks $command -procs $ranks -hostfile /spec/MapFiles/ib0hosts.8x.1-8 General Notes ------------- Environment settings: All ulimits set to unlimited ranks = 32 CWD = /spec/mpi2007 MEMORY_AFFINITY = MCM XLFRTEOPTS = intrinthds=1 MP_PGMMODEL = spmd MP_MSG_API = mpi MP_DEVTYPE = ib MP_CLOCK_SOURCE = AIX MP_STDINMODE = none MP_SHARED_MEMORY = yes MP_SINGLE_THREAD = yes MP_EUILIB = us NRT_WINDOW_COUNT = 1 MP_RESD = no MP_PULSE = 0 ADAPTER_USE = shared EUIDEVICE = sn_single MP_CSS_INTERRUPT = no MP_BUFFER_MEM = 67108864 MP_USE_BULK_XFER = yes MP_BULK_MIN_MSG_SIZE = 8192 MP_EAGER_LIMIT = 65536 MP_WAIT_MODE = yield MP_INFOLEVEL = 0 MP_LABELIO = no MP_STDOUTMODE = unordered MP_PMDLOG = no NRT_JOB_KEY = 64 Compiler Invocation ------------------- C benchmarks: /usr/bin/mpcc_r C++ benchmarks: 126.lammps: /usr/bin/mpCC_r Fortran benchmarks: /usr/bin/mpxlf95_r Benchmarks using both Fortran and C: /usr/bin/mpcc_r /usr/bin/mpxlf95_r Portability Flags ----------------- 107.leslie3d: -qfixed 115.fds4: -DSPEC_MPI_LC_NO_TRAILING_UNDERSCORE -qfixed 121.pop2: -DSPEC_MPI_AIX 127.wrf2: -DNOUNDERSCORE -DSPEC_MPI_AIX 130.socorro: -DSPEC_NO_UNDERSCORE -qcpluscmt 132.zeusmp2: -qfixed -DSPEC_SINGLE_UNDERSCORE 137.lu: -qfixed Base Optimization Flags ----------------------- C benchmarks: -bmaxdata:0x80000000 -O5 -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K C++ benchmarks: 126.lammps: -bmaxdata:0x80000000 -O5 Fortran benchmarks: -bmaxdata:0x80000000 -O4 -qstrict -qalias=nostd -qhot=level=0 -qsave -bdatapsize:64K -bstackpsize:64K -btextpsize:64K Benchmarks using both Fortran and C: -bmaxdata:0x80000000 -O5 -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -O4 -qstrict -qalias=nostd -qhot=level=0 -qsave Peak Optimization Flags ----------------------- C benchmarks: 104.milc: basepeak = yes 122.tachyon: -O5 -lessl -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -q64 C++ benchmarks: 126.lammps: basepeak = yes Fortran benchmarks: 107.leslie3d: -O5 -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -bmaxdata:0x70000000 113.GemsFDTD: basepeak = yes 129.tera_tf: -O5 -qessl -lessl -bdatapsize:64K -bstackpsize:64K -btextpsize:64K 137.lu: basepeak = yes Benchmarks using both Fortran and C: 115.fds4: -O5 -lessl -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -qstrict -qalias=nostd -qhot=level=0 -qsave -q64 121.pop2: basepeak = yes 127.wrf2: -O5 -bmaxdata:0x80000000 128.GAPgeofem: basepeak = yes 130.socorro: -O5 -lessl -D_ILS_MACROS -bdatapsize:64K -bstackpsize:64K -btextpsize:64K -qessl -bmaxdata:0x80000000 132.zeusmp2: basepeak = yes Other Flags ----------- C benchmarks: -w -qsuppress=1500-036 -qipa=noobject -qipa=threads C++ benchmarks: 126.lammps: -w -qsuppress=1500-036 -qipa=noobject -qipa=threads Fortran benchmarks: -w -qsuppress=1500-036 -qsuppress=cmpmsg -qspillsize=32648 Benchmarks using both Fortran and C: -w -qsuppress=1500-036 -qipa=noobject -qipa=threads -qsuppress=cmpmsg -qspillsize=32648 The flags files that were used to format this result can be browsed at http://www.spec.org/mpi2007/flags/MPI2007_flags.20081105.html http://www.spec.org/mpi2007/flags/IBM-XL.html http://www.spec.org/mpi2007/flags/IBM-AIX.html You can also download the XML flags sources by saving the following links: http://www.spec.org/mpi2007/flags/MPI2007_flags.20081105.xml http://www.spec.org/mpi2007/flags/IBM-XL.xml http://www.spec.org/mpi2007/flags/IBM-AIX.xml SPEC and SPEC MPI are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2010 Standard Performance Evaluation Corporation Tested with SPEC MPI2007 v1.1. Report generated on Tue Jul 22 13:34:49 2014 by MPI2007 ASCII formatter v1463. Originally published on 5 November 2008.