SPEC(R) MPIM2007 Summary IBM Corporation IBM Power 575 Sat Jun 28 06:18:17 2008 MPI2007 License: 0005 Test date: Jun-2008 Test sponsor: IBM Corporation Hardware availability: May-2008 Tested by: IBM Corporation Software availability: May-2008 Base Base Base Peak Peak Peak Benchmarks Ranks Run Time Ratio Ranks Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 104.milc 32 571 2.74 S 32 571 2.74 S 104.milc 32 571 2.74 * 32 571 2.74 * 104.milc 32 570 2.75 S 32 570 2.75 S 107.leslie3d 32 757 6.90 S 32 757 6.90 S 107.leslie3d 32 755 6.92 S 32 755 6.92 S 107.leslie3d 32 755 6.91 * 32 755 6.91 * 113.GemsFDTD 32 707 8.92 * 32 707 8.92 * 113.GemsFDTD 32 707 8.92 S 32 707 8.92 S 113.GemsFDTD 32 707 8.92 S 32 707 8.92 S 115.fds4 32 411 4.75 S 32 411 4.75 S 115.fds4 32 411 4.75 * 32 411 4.75 * 115.fds4 32 411 4.74 S 32 411 4.74 S 121.pop2 32 849 4.86 * 32 849 4.86 * 121.pop2 32 848 4.87 S 32 848 4.87 S 121.pop2 32 850 4.86 S 32 850 4.86 S 122.tachyon 32 1538 1.82 * 32 1538 1.82 * 122.tachyon 32 1537 1.82 S 32 1537 1.82 S 122.tachyon 32 1539 1.82 S 32 1539 1.82 S 126.lammps 32 755 3.86 S 32 755 3.86 S 126.lammps 32 755 3.86 S 32 755 3.86 S 126.lammps 32 755 3.86 * 32 755 3.86 * 127.wrf2 32 1317 5.92 S 32 1317 5.92 S 127.wrf2 32 1316 5.93 S 32 1316 5.93 S 127.wrf2 32 1317 5.92 * 32 1317 5.92 * 128.GAPgeofem 32 325 6.35 S 32 325 6.35 S 128.GAPgeofem 32 326 6.34 * 32 326 6.34 * 128.GAPgeofem 32 327 6.32 S 32 327 6.32 S 129.tera_tf 32 1225 2.26 S 32 1225 2.26 S 129.tera_tf 32 1224 2.26 * 32 1224 2.26 * 129.tera_tf 32 1224 2.26 S 32 1224 2.26 S 130.socorro 32 264 14.4 S 32 264 14.4 S 130.socorro 32 265 14.4 * 32 265 14.4 * 130.socorro 32 266 14.4 S 32 266 14.4 S 132.zeusmp2 32 941 3.30 S 32 941 3.30 S 132.zeusmp2 32 943 3.29 S 32 943 3.29 S 132.zeusmp2 32 943 3.29 * 32 943 3.29 * 137.lu 32 552 6.65 S 32 552 6.65 S 137.lu 32 553 6.65 * 32 553 6.65 * 137.lu 32 555 6.63 S 32 555 6.63 S ============================================================================== 104.milc 32 571 2.74 * 32 571 2.74 * 107.leslie3d 32 755 6.91 * 32 755 6.91 * 113.GemsFDTD 32 707 8.92 * 32 707 8.92 * 115.fds4 32 411 4.75 * 32 411 4.75 * 121.pop2 32 849 4.86 * 32 849 4.86 * 122.tachyon 32 1538 1.82 * 32 1538 1.82 * 126.lammps 32 755 3.86 * 32 755 3.86 * 127.wrf2 32 1317 5.92 * 32 1317 5.92 * 128.GAPgeofem 32 326 6.34 * 32 326 6.34 * 129.tera_tf 32 1224 2.26 * 32 1224 2.26 * 130.socorro 32 265 14.4 * 32 265 14.4 * 132.zeusmp2 32 943 3.29 * 32 943 3.29 * 137.lu 32 553 6.65 * 32 553 6.65 * SPECmpiM_base2007 4.81 SPECmpiM_peak2007 4.81 BENCHMARK DETAILS ----------------- Type of System: SMP Total Compute Nodes: 1 Total Chips: 16 Total Cores: 32 Total Threads: 32 Total Memory: 128 GB Base Ranks Run: 32 Minimum Peak Ranks: 32 Maximum Peak Ranks: 32 C Compiler: IBM XL C/C++ Enterprise Edition V9.0 Updated with the Oct2007 PTF C++ Compiler: IBM XL C/C++ Enterprise Edition V9.0 Updated with the Oct2007 PTF Fortran Compiler: IBM XL Fortran Enterprise Edition V11.1 Updated with the Oct2007 PTF Base Pointers: 64-bit Peak Pointers: 64-bit MPI Library: IBM Parallel Environment for AIX V4.3.2.2 Other MPI Info: -- Pre-processors: -- Other Software: None Node Description: IBM Power 575 =============================== HARDWARE -------- Number of nodes: 1 Uses of the node: compute, head, fileserver Vendor: IBM Corporation Model: IBM Power 575 CPU Name: POWER6 CPU(s) orderable: 32 cores Chips enabled: 16 Cores enabled: 32 Cores per chip: 2 Threads per core: 1 CPU Characteristics: CPU MHz: 4700 Primary Cache: 64 KB I + 64 KB D on chip per core Secondary Cache: 4 MB I+D on chip per core L3 Cache: 32 MB I+D off chip per chip Other Cache: None Memory: 128 GB (64x2 GB) DDR2 533 MHz Disk Subsystem: 1x146 GB SFF SAS, 10K RPM Other Hardware: None Adapter: 0 Number of Adapters: 0 Slot Type: 0 Data Rate: 0 Ports Used: 0 Interconnect Type: 0 SOFTWARE -------- Adapter: 0 Adapter Driver: 0 Adapter Firmware: -- Operating System: IBM AIX V5.3 with the 5300-08-02 Technology Level Local File System: AIX/JFS2 Shared File System: NFS over ethernet System State: Multi-user Other Software: APAR IZ26983 software update for InfiniBand adapter drivers IBM LoadLeveler for AIX V3.4.3.2 General Notes ------------- 113.GemsFDTD (base): Applied maxprocandstop src.alt 129.tera_tf (base): Applied fixbuffer src.alt 127.wrf2 (base): Applied fixcalling src.alt all ulimits set to unlimited "petaskbind.sh" script used to bind each task to a unique processor POE Environment variables set before executing benchmarks: CWD =/specmpi/mpi2007-1.0 MP_ADAPTER_USE =shared MP_EUILIB =us MP_EUIDEVICE =sn_all MP_SHARED_MEMORY =yes MP_SINGLE_THREAD =yes MP_WAIT_MODE =poll MP_EAGER_LIMIT =65536 MP_BUFFER_MEM =67108864 MP_POLLING_INTERVAL =80000000 MP_USE_BULK_XFER =yes MP_BULK_MIN_MSG_SIZE=65536 MP_STDINMODE =none MP_LABELIO =no MP_HOSTFILE =$CWD/r35.32-1node Other Environment variables MEMORY_AFFINITY =MCM LDR_CNTRL =DATAPSIZE=64K@TEXTPSIZE=64K@STACKPSIZE=64K XLFRTEOTPS =intrinthds=1 submit command uses petaskbind.sh script to bind logical processors to ranks poe $CWD/petaskbind.sh $command -procs $ranks The Gigabit ethernet switch is shared among many nodes, not just the cluster used in this benchmark. Base Compiler Invocation ------------------------ C benchmarks: /usr/bin/mpcc_r C++ benchmarks: 126.lammps: /usr/bin/mpCC_r Fortran benchmarks: /usr/bin/mpxlf95_r Benchmarks using both Fortran and C: /usr/bin/mpcc_r /usr/bin/mpxlf95_r Base Portability Flags ---------------------- 107.leslie3d: -qfixed 115.fds4: -DSPEC_MPI_LC_NO_TRAILING_UNDERSCORE -qfixed 121.pop2: -DSPEC_MPI_AIX 127.wrf2: -DNOUNDERSCORE -DSPEC_MPI_AIX 130.socorro: -DSPEC_NO_UNDERSCORE -qcpluscmt 132.zeusmp2: -qfixed -DSPEC_SINGLE_UNDERSCORE 137.lu: -qfixed Base Optimization Flags ----------------------- C benchmarks: -O4 -qarch=pwr6 -qtune=pwr6 -q64 C++ benchmarks: 126.lammps: -O4 -qarch=pwr6 -qtune=pwr6 -qstrict -q64 Fortran benchmarks: -O4 -qarch=pwr6 -qtune=pwr6 -qalias=nostd -q64 Benchmarks using both Fortran and C: -O4 -qarch=pwr6 -qtune=pwr6 -qalias=nostd -q64 Base Other Flags ---------------- C benchmarks: -w -qsuppress=1500-036 -qipa=noobject -qipa=threads C++ benchmarks: 126.lammps: -w -qsuppress=1500-036 -qipa=noobject -qipa=threads Fortran benchmarks: -w -qsuppress=1500-036 -qsuppress=cmpmsg -qipa=noobject -qipa=threads Benchmarks using both Fortran and C: -w -qsuppress=1500-036 -qsuppress=cmpmsg -qipa=noobject -qipa=threads Peak Optimization Flags ----------------------- C benchmarks: 104.milc: basepeak = yes 122.tachyon: basepeak = yes C++ benchmarks: 126.lammps: basepeak = yes Fortran benchmarks: 107.leslie3d: basepeak = yes 113.GemsFDTD: basepeak = yes 129.tera_tf: basepeak = yes 137.lu: basepeak = yes Benchmarks using both Fortran and C: 115.fds4: basepeak = yes 121.pop2: basepeak = yes 127.wrf2: basepeak = yes 128.GAPgeofem: basepeak = yes 130.socorro: basepeak = yes 132.zeusmp2: basepeak = yes The flags files that were used to format this result can be browsed at http://www.spec.org/mpi2007/flags/MPI2007_flags.20080828.html http://www.spec.org/mpi2007/flags/MPI2007_flags.0.20080828.html http://www.spec.org/mpi2007/flags/MPI2007_flags.1.html You can also download the XML flags sources by saving the following links: http://www.spec.org/mpi2007/flags/MPI2007_flags.20080828.xml http://www.spec.org/mpi2007/flags/MPI2007_flags.0.20080828.xml http://www.spec.org/mpi2007/flags/MPI2007_flags.1.xml SPEC and SPEC MPI are registered trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ----------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2006-2010 Standard Performance Evaluation Corporation Tested with SPEC MPI2007 v1.0. Report generated on Tue Jul 22 13:34:35 2014 by MPI2007 ASCII formatter v1463. Originally published on 27 August 2008.