SPEC(R) ACCEL(TM) ACC Summary IBM Corporation Tesla P100 IBM Power Systems S822LC for High Performance Computing (8335-GTB) Test Sponsor: NVIDIA Corporation Fri Sep 2 21:49:58 2016 ACCEL License: 019 Test date: Sep-2016 Test sponsor: NVIDIA Corporation Hardware availability: Sep-2016 Tested by: IBM Corporation Software availability: Sep-2016 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 303.ostencil 145 19.5 7.44 S 303.ostencil 145 19.5 7.42 S 303.ostencil 145 19.5 7.43 * 304.olbm 455 42.1 10.8 S 304.olbm 455 42.1 10.8 * 304.olbm 455 42.0 10.8 S 314.omriq 956 108 8.87 * 314.omriq 956 108 8.88 S 314.omriq 956 108 8.87 S 350.md 252 19.7 12.8 S 350.md 252 19.7 12.8 S 350.md 252 19.7 12.8 * 351.palm 370 177 2.09 S 351.palm 370 172 2.15 S 351.palm 370 176 2.10 * 352.ep 530 64.7 8.20 * 352.ep 530 64.6 8.20 S 352.ep 530 64.7 8.19 S 353.clvrleaf 445 65.2 6.83 S 353.clvrleaf 445 64.9 6.85 * 353.clvrleaf 445 64.9 6.85 S 354.cg 408 56.4 7.23 S 354.cg 408 56.4 7.23 * 354.cg 408 56.4 7.24 S 355.seismic 370 46.7 7.92 * 355.seismic 370 46.7 7.92 S 355.seismic 370 46.7 7.93 S 356.sp 276 80.0 3.45 S 356.sp 276 80.3 3.44 * 356.sp 276 80.5 3.43 S 357.csp 270 30.7 8.79 * 357.csp 270 30.8 8.77 S 357.csp 270 30.7 8.81 S 359.miniGhost 369 67.3 5.48 S 359.miniGhost 369 67.4 5.48 * 359.miniGhost 369 67.8 5.44 S 360.ilbdc 367 40.9 8.97 * 360.ilbdc 367 40.9 8.97 S 360.ilbdc 367 41.0 8.95 S 363.swim 230 56.4 4.08 S 363.swim 230 55.7 4.13 S 363.swim 230 56.0 4.10 * 370.bt 223 12.7 17.6 * 370.bt 223 12.5 17.8 S 370.bt 223 12.7 17.6 S ============================================================================== 303.ostencil 145 19.5 7.43 * 304.olbm 455 42.1 10.8 * 314.omriq 956 108 8.87 * 350.md 252 19.7 12.8 * 351.palm 370 176 2.10 * 352.ep 530 64.7 8.20 * 353.clvrleaf 445 64.9 6.85 * 354.cg 408 56.4 7.23 * 355.seismic 370 46.7 7.92 * 356.sp 276 80.3 3.44 * 357.csp 270 30.7 8.79 * 359.miniGhost 369 67.4 5.48 * 360.ilbdc 367 40.9 8.97 * 363.swim 230 56.0 4.10 * 370.bt 223 12.7 17.6 * SPECaccel_acc_base 7.16 SPECaccel_acc_peak Not Run HARDWARE -------- CPU Name: POWER8 with NVLink CPU Characteristics: CPU MHz: 3259 CPU MHz Maximum: 3857 FPU: Integrated CPU(s) enabled: 16 cores, 2 chips, 8 cores/chip, 8 threads/core CPU(s) orderable: 2 chips Primary Cache: 32 KB I + 64 KB D on chip per core Secondary Cache: 512 KB I+D on chip per core L3 Cache: 8 MB I+D on chip per chip Other Cache: 16 MB I+D off chip per 4 DIMMs Memory: 512 GB (16 x 32 GB RDIMMs) DDR4 1600 MHz Disk Subsystem: 2x 1TB SATA 6.0Gb/s 7200 RPM Other Hardware: No ACCELERATOR ----------- Accel Model Name: Tesla P100 Accel Vendor: NVIDIA Accel Name: Tesla P100 Type of Accel: GPU Accel Connection: NVLink Does Accel Use ECC: Yes Accel Description: See Notes Accel Driver: NVIDIA UNIX ppc64le Kernel Module 361.85 SOFTWARE -------- Operating System: Ubuntu 16.04.1 LTS 4.4.0-34-generic Compiler: PGI Accelerator Fortran/C/C++ Server, Release 16.9 File System: ext4 System State: Run level 5 (multi-user) Other Software: None Submit Notes ------------ The config file option 'submit' was used. Platform Notes -------------- Sysinfo program /home/user/SPECACCEL/Docs/sysinfo $Rev: 6874 $ $Date:: 2013-11-20 #$ 0953404ef7e75a5f9bbb534c6de3f831 running on gar1 Fri Sep 2 20:50:00 2016 This section contains SUT (System Under Test) info as seen by some common utilities. To remove or add to this section, see: http://www.spec.org/accel/Docs/config.html#sysinfo From /proc/cpuinfo clock : 2061.000000MHz clock : 2094.000000MHz clock : 2128.000000MHz clock : 2194.000000MHz clock : 2360.000000MHz clock : 2527.000000MHz clock : 4023.000000MHz machine : PowerNV 8335-GTB model : 8335-GTB platform : PowerNV revision : 1.0 (pvr 004c 0100) cpu : POWER8NVL (raw), altivec supported * * 0 "physical id" tags found. Perhaps this is an older system, * or a virtualized system. Not attempting to guess how to * count chips/cores for this system. * 128 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) From /proc/meminfo MemTotal: 535690880 kB HugePages_Total: 0 Hugepagesize: 16384 kB /usr/bin/lsb_release -d Ubuntu 16.04.1 LTS From /etc/*release* /etc/*version* debian_version: stretch/sid os-release: NAME="Ubuntu" VERSION="16.04.1 LTS (Xenial Xerus)" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu 16.04.1 LTS" VERSION_ID="16.04" HOME_URL="http://www.ubuntu.com/" SUPPORT_URL="http://help.ubuntu.com/" uname -a: Linux gar1 4.4.0-34-generic #53-Ubuntu SMP Wed Jul 27 16:04:07 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux run-level 5 Sep 2 16:36 SPEC is set to: /home/user/SPECACCEL Filesystem Type Size Used Avail Use% Mounted on /dev/mapper/g82L--vg-root ext4 788G 231G 517G 31% / (End of data from sysinfo program) Information from pgaccelinfo CUDA Driver Version: 8000 NVRM version: NVIDIA UNIX ppc64le Kernel Module 361.85 Device Number: 0 Device Name: Tesla P100-SXM2-16GB Device Revision Number: 6.0 Global Memory Size: 17071669248 Number of Multiprocessors: 56 Concurrent Copy and Execution: Yes Total Constant Memory: 65536 Total Shared Memory per Block: 49152 Registers per Block: 65536 Warp Size: 32 Maximum Threads per Block: 1024 Maximum Block Dimensions: 1024, 1024, 64 Maximum Grid Dimensions: 2147483647 x 65535 x 65535 Maximum Memory Pitch: 2147483647B Texture Alignment: 512B Clock Rate: 1480 MHz Execution Timeout: No Integrated Device: No Can Map Host Memory: Yes Compute Mode: default Concurrent Kernels: Yes ECC Enabled: Yes Memory Clock Rate: 715 MHz Memory Bus Width: 4096 bits L2 Cache Size: 4194304 bytes Max Threads Per SMP: 2048 Async Engines: 3 Unified Addressing: Yes Managed Memory: Yes PGI Compiler Option: -ta=tesla:cc60 Base Compiler Invocation ------------------------ C benchmarks: pgcc Fortran benchmarks: pgfortran Benchmarks using both Fortran and C: pgcc pgfortran Base Optimization Flags ----------------------- C benchmarks: -fast -acc -ta=tesla:cc60 -ta=tesla:managed Fortran benchmarks: -fast -acc -ta=tesla:cc60 -ta=tesla:managed Benchmarks using both Fortran and C: 353.clvrleaf: -fast -acc -ta=tesla:cc60 -ta=tesla:managed 359.miniGhost: -fast -acc -ta=tesla:cc60 -ta=tesla:managed -Mnomain The flags file that was used to format this result can be browsed at http://www.spec.org/accel/flags/pgi2016_flags.html You can also download the XML flags source by saving the following link: http://www.spec.org/accel/flags/pgi2016_flags.xml SPEC and SPEC ACCEL are registered trademarks or trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. --------------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2014-2016 Standard Performance Evaluation Corporation Tested with SPEC ACCEL v1.1. Report generated on Wed Sep 28 11:29:51 2016 by ACCEL ASCII formatter v1290. Originally published on 28 September 2016.