SPEC(R) ACCEL(TM) OCL Summary Supermicro NVIDIA Tesla K20m Supermicro X9DRG-HF Test Sponsor: HZDR Thu Aug 24 07:13:28 2017 ACCEL License: 65A Test date: Aug-2017 Test sponsor: HZDR Hardware availability: Jan-2013 Tested by: HZDR Software availability: Aug-2016 Base Base Base Peak Peak Peak Benchmarks Ref. Run Time Ratio Ref. Run Time Ratio -------------- ------ --------- --------- ------ --------- --------- 101.tpacf 107 80.3 1.33 * 107 81.1 1.32 S 101.tpacf 107 80.2 1.33 S 107 81.4 1.31 S 101.tpacf 107 80.4 1.33 S 107 81.3 1.32 * 103.stencil 125 69.1 1.81 * 125 69.1 1.81 * 103.stencil 125 69.1 1.81 S 125 69.1 1.81 S 103.stencil 125 69.3 1.80 S 125 69.3 1.80 S 104.lbm 112 54.6 2.05 S 112 43.1 2.60 * 104.lbm 112 54.1 2.07 S 112 43.0 2.61 S 104.lbm 112 54.4 2.06 * 112 43.2 2.59 S 110.fft 111 42.2 2.63 * 111 42.2 2.63 * 110.fft 111 41.9 2.65 S 111 41.9 2.65 S 110.fft 111 42.3 2.62 S 111 42.3 2.62 S 112.spmv 147 89.6 1.64 S 147 89.5 1.64 S 112.spmv 147 89.5 1.64 * 147 89.5 1.64 * 112.spmv 147 89.5 1.64 S 147 89.1 1.65 S 114.mriq 109 26.7 4.08 * 109 26.7 4.08 * 114.mriq 109 22.5 4.84 S 109 22.5 4.84 S 114.mriq 109 26.8 4.07 S 109 26.8 4.07 S 116.histo 114 100 1.14 * 114 100 1.14 * 116.histo 114 112 1.02 S 114 112 1.02 S 116.histo 114 99.7 1.14 S 114 99.7 1.14 S 117.bfs 117 70.3 1.66 S 117 54.8 2.13 * 117.bfs 117 71.3 1.64 * 117 55.1 2.12 S 117.bfs 117 71.3 1.64 S 117 54.3 2.16 S 118.cutcp 99 46.1 2.15 * 99 46.1 2.15 * 118.cutcp 99 46.3 2.14 S 99 46.3 2.14 S 118.cutcp 99 44.5 2.23 S 99 44.5 2.23 S 120.kmeans 100 93.1 1.07 * 100 92.7 1.08 * 120.kmeans 100 93.6 1.07 S 100 88.8 1.13 S 120.kmeans 100 93.1 1.07 S 100 93.0 1.08 S 121.lavamd 109 22.7 4.80 S 109 22.7 4.80 S 121.lavamd 109 23.4 4.65 S 109 23.4 4.65 S 121.lavamd 109 23.0 4.75 * 109 23.0 4.75 * 122.cfd 126 72.4 1.74 * 126 72.2 1.75 S 122.cfd 126 73.2 1.72 S 126 73.8 1.71 S 122.cfd 126 72.4 1.74 S 126 73.7 1.71 * 123.nw 115 84.4 1.36 * 115 84.4 1.36 * 123.nw 115 84.4 1.36 S 115 84.4 1.36 S 123.nw 115 84.5 1.36 S 115 84.5 1.36 S 124.hotspot 114 51.2 2.23 S 114 51.2 2.23 S 124.hotspot 114 51.0 2.24 S 114 51.0 2.24 S 124.hotspot 114 51.0 2.23 * 114 51.0 2.23 * 125.lud 119 118 1.01 S 119 105 1.13 * 125.lud 119 116 1.02 S 119 104 1.15 S 125.lud 119 116 1.02 * 119 105 1.13 S 126.ge 155 56.8 2.73 S 155 12.9 12.0 S 126.ge 155 56.4 2.75 S 155 12.9 12.1 S 126.ge 155 56.4 2.75 * 155 12.9 12.0 * 127.srad 114 78.6 1.45 * 114 78.6 1.45 * 127.srad 114 78.5 1.45 S 114 78.5 1.45 S 127.srad 114 78.6 1.45 S 114 78.6 1.45 S 128.heartwall 106 158 0.670 S 106 158 0.670 S 128.heartwall 106 158 0.670 * 106 158 0.670 * 128.heartwall 106 158 0.671 S 106 158 0.671 S 140.bplustree 108 117 0.923 S 108 117 0.923 S 140.bplustree 108 117 0.921 * 108 117 0.921 * 140.bplustree 108 117 0.921 S 108 117 0.921 S ============================================================================== 101.tpacf 107 80.3 1.33 * 107 81.3 1.32 * 103.stencil 125 69.1 1.81 * 125 69.1 1.81 * 104.lbm 112 54.4 2.06 * 112 43.1 2.60 * 110.fft 111 42.2 2.63 * 111 42.2 2.63 * 112.spmv 147 89.5 1.64 * 147 89.5 1.64 * 114.mriq 109 26.7 4.08 * 109 26.7 4.08 * 116.histo 114 100 1.14 * 114 100 1.14 * 117.bfs 117 71.3 1.64 * 117 54.8 2.13 * 118.cutcp 99 46.1 2.15 * 99 46.1 2.15 * 120.kmeans 100 93.1 1.07 * 100 92.7 1.08 * 121.lavamd 109 23.0 4.75 * 109 23.0 4.75 * 122.cfd 126 72.4 1.74 * 126 73.7 1.71 * 123.nw 115 84.4 1.36 * 115 84.4 1.36 * 124.hotspot 114 51.0 2.23 * 114 51.0 2.23 * 125.lud 119 116 1.02 * 119 105 1.13 * 126.ge 155 56.4 2.75 * 155 12.9 12.0 * 127.srad 114 78.6 1.45 * 114 78.6 1.45 * 128.heartwall 106 158 0.670 * 106 158 0.670 * 140.bplustree 108 117 0.921 * 108 117 0.921 * SPECaccel_ocl_base 1.70 SPECaccel_ocl_peak 1.89 HARDWARE -------- CPU Name: Intel Xeon E5-2609 CPU Characteristics: No TURBO CPU MHz: 2400 CPU MHz Maximum: 2400 FPU: Integrated CPU(s) enabled: 8 cores, 2 chips, 4 cores/chip CPU(s) orderable: 1,2 chips Primary Cache: 32 KB I + 32 KB D on chip per core Secondary Cache: 256 KB I+D on chip per core L3 Cache: 10 MB I+D on chip per chip Other Cache: None Memory: 64 GB (8 x 8 GB 2Rx4 PC3-12800R-11, ECC, running at 1066MHz) Disk Subsystem: 60 GB INTEL SSDSC2CW060A3 Other Hardware: None ACCELERATOR ----------- Accel Model Name: Tesla K20m Accel Vendor: NVIDIA Accel Name: NVIDIA Tesla K20m Type of Accel: GPU Accel Connection: PCIe 2.0 16x Does Accel Use ECC: yes Accel Description: NVIDIA Tesla K20m, 2688 CUDA cores, 732 MHz 6 GB GDDR5 RAM (Kepler Generation) Accel Driver: NVIDIA UNIX x86_64 Kernel Module 367.48 SOFTWARE -------- Operating System: Ubuntu 14.04.5 LTS Ubuntu 14.04.5 LTS 4.4.0-38-generic Compiler: GNU Compiler C/C++ Version 6.2.0 File System: ext3 System State: Run level 5 (user-level) Other Software: NVIDIA Cuda SDK 7.0, driver version 367.48 Platform Notes -------------- Sysinfo program /tmp/spec/1.2/Docs/sysinfo $Rev: 6965 $ $Date:: 2015-04-21 #$ c05a7f14b1b1765e3fe1df68447e8a35 running on kepler002 Thu Aug 24 13:13:30 2017 This section contains SUT (System Under Test) info as seen by some common utilities. To remove or add to this section, see: http://www.spec.org/accel/Docs/config.html#sysinfo From /proc/cpuinfo model name : Intel(R) Xeon(R) CPU E5-2609 0 @ 2.40GHz 2 "physical id"s (chips) 8 "processors" cores, siblings (Caution: counting these is hw and system dependent. The following excerpts from /proc/cpuinfo might not be reliable. Use with caution.) cpu cores : 4 siblings : 4 physical 0: cores 0 1 2 3 physical 1: cores 0 1 2 3 cache size : 10240 KB From /proc/meminfo MemTotal: 65949360 kB HugePages_Total: 0 Hugepagesize: 2048 kB /usr/bin/lsb_release -d Ubuntu 14.04.5 LTS From /etc/*release* /etc/*version* debian_version: jessie/sid os-release: NAME="Ubuntu" VERSION="14.04.5 LTS, Trusty Tahr" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu 14.04.5 LTS" VERSION_ID="14.04" HOME_URL="http://www.ubuntu.com/" SUPPORT_URL="http://help.ubuntu.com/" redhat-release: Red Hat Enterprise Linux Server release 6.5 (Santiago) rh-release: Red Hat Enterprise Linux Server release 7.2 (Maipo) uname -a: Linux kepler002 4.4.0-38-generic #57~14.04.1-Ubuntu SMP Tue Sep 6 17:20:43 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux run-level 5 Jan 23 15:07 SPEC is set to: /tmp/spec/1.2 Filesystem Type Size Used Avail Use% Mounted on /dev/sda1 ext3 30G 14G 15G 47% / Cannot run dmidecode; consider saying 'chmod +s /usr/sbin/dmidecode' (End of data from sysinfo program) Base Runtime Environment ------------------------ C benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.2 CUDA 8.0.44 OpenCL Device #0: Tesla K20m, v 367.48 C++ benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.2 CUDA 8.0.44 OpenCL Device #0: Tesla K20m, v 367.48 Base Compiler Invocation ------------------------ C benchmarks: gcc C++ benchmarks: g++ Base Portability Flags ---------------------- 116.histo: -DSPEC_LOCAL_MEMORY_HEADROOM=2 122.cfd: -std=gnu++98 Base Optimization Flags ----------------------- C benchmarks: -O2 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL C++ benchmarks: -O2 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL Peak Runtime Environment ------------------------ C benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.2 CUDA 8.0.44 OpenCL Device #0: Tesla K20m, v 367.48 C++ benchmarks: OpenCL Platform: NVIDIA CUDA, OpenCL 1.2 CUDA 8.0.44 OpenCL Device #0: Tesla K20m, v 367.48 Peak Compiler Invocation ------------------------ C benchmarks: gcc C++ benchmarks: g++ Peak Portability Flags ---------------------- 116.histo: -DSPEC_LOCAL_MEMORY_HEADROOM=2 122.cfd: -std=gnu++98 Peak Optimization Flags ----------------------- C benchmarks: 110.fft: basepeak = yes 114.mriq: basepeak = yes 116.histo: basepeak = yes 117.bfs: -O2 -DSPEC_ACCEL_WG_SIZE_0_0=64 -DSPEC_ACCEL_WG_SIZE_1_0=64 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL 118.cutcp: basepeak = yes 121.lavamd: basepeak = yes 124.hotspot: basepeak = yes 127.srad: basepeak = yes 128.heartwall: basepeak = yes 140.bplustree: basepeak = yes C++ benchmarks: 101.tpacf: -O2 -DSPEC_ACCEL_WG_SIZE_0_0=1024 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL 103.stencil: basepeak = yes 104.lbm: -O2 -DSPEC_ACCEL_WG_SIZE_0_0=32 -DSPEC_ACCEL_WG_SIZE_0_1=1 -DSPEC_ACCEL_WG_SIZE_0_2=1 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL 112.spmv: -O2 -DSPEC_ACCEL_WG_SIZE_0_0=96 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL 120.kmeans: -O2 -DSPEC_ACCEL_WG_SIZE_0_0=288 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL 122.cfd: -O2 -DSPEC_ACCEL_WG_SIZE_3_0=288 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL 123.nw: basepeak = yes 125.lud: -O2 -DSPEC_ACCEL_WG_SIZE_0_0=32 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL 126.ge: -O2 -DSPEC_ACCEL_WG_SIZE_0_0=512 -DSPEC_ACCEL_WG_SIZE_1_0=1 -DSPEC_ACCEL_WG_SIZE_1_1=512 -I/opt/pkg/devel/cuda/7.0/include -L/opt/pkg/devel/cuda/7.0/libb64 -lOpenCL The flags file that was used to format this result can be browsed at https://www.spec.org/accel/flags/flags-advanced.20170929.html You can also download the XML flags source by saving the following link: https://www.spec.org/accel/flags/flags-advanced.20170929.xml SPEC and SPEC ACCEL are registered trademarks or trademarks of the Standard Performance Evaluation Corporation. All other brand and product names appearing in this result are trademarks or registered trademarks of their respective holders. ---------------------------------------------------------------------------------- For questions about this result, please contact the tester. For other inquiries, please contact webmaster@spec.org. Copyright 2015-2017 Standard Performance Evaluation Corporation Tested with SPEC ACCEL v1.2. Report generated on Fri Sep 29 13:31:59 2017 by ACCEL ASCII formatter v1290. Originally published on 28 September 2017.