SPEC CPU®2017 Floating Point Rate Result

Copyright 2017-2022 Standard Performance Evaluation Corporation

Dell Inc.

PowerEdge R940 (Intel Xeon Platinum 8276L, 2.20
GHz)

SPECrate®2017_fp_base = 63400

SPECrate®2017_fp_peak = Not Run

CPU2017 License: 55 Test Date: Nov-2022
Test Sponsor: Dell Inc. Hardware Availability: Apr-2018
Tested by: Dell Inc. Software Availability: May-2022

Benchmark result graphs are available in the PDF report.

Hardware
CPU Name: Intel Xeon Platinum 8276L
  Max MHz: 4000
  Nominal: 2200
Enabled: 112 cores, 4 chips, 2 threads/core
Orderable: 1,2,4 chips
Cache L1: 32 KB I + 32 KB D on chip per core
  L2: 1 MB I+D on chip per core
  L3: 38.5 MB I+D on chip per chip
  Other: None
Memory: 768 GB (24 x 32 GB 2Rx8 PC4-3200AA-R, running
at 2933)
Storage: 125 GB on tmpfs
Other: None
Software
OS: Red Hat Enterprise Linux 8.6 (Ootpa)
4.18.0-372.9.1.el8.x86_64
Compiler: C/C++: Version 2022.1 of Intel oneAPI DPC++/C++
Compiler for Linux;
Fortran: Version 2022.1 of Intel Fortran Compiler
for Linux;
Parallel: No
Firmware: Version 2.15.1 released Jun-2022
File System: tmpfs
System State: Run level 3 (multi-user)
Base Pointers: 64-bit
Peak Pointers: Not Applicable
Other: jemalloc memory allocator V5.0.1
Power Management: BIOS and OS set to prefer performance
at the cost of additional power usage.

Results Table

Benchmark Base Peak
Copies Seconds Ratio Seconds Ratio Seconds Ratio Copies Seconds Ratio Seconds Ratio Seconds Ratio
SPECrate®2017_fp_base 63400
SPECrate®2017_fp_peak Not Run
Results appear in the order in which they were run. Bold underlined text indicates a median measurement.
503.bwaves_r 224 701 3200 703 3200
507.cactuBSSN_r 224 380 747 379 748
508.namd_r 224 480 444 480 443
510.parest_r 224 2457 238 2428 241
511.povray_r 224 765 684 766 682
519.lbm_r 224 758 312 761 310
521.wrf_r 224 1120 448 1122 447
526.blender_r 224 498 685 498 685
527.cam4_r 224 581 674 578 678
538.imagick_r 224 313 1780 312 1780
544.nab_r 224 286 1320 287 1310
549.fotonik3d_r 224 1827 478 1823 479
554.roms_r 224 1625 219 1627 219

Submit Notes

 The numactl mechanism was used to bind copies to processors. The config file option 'submit'
 was used to generate numactl commands to bind each copy to a specific processor.
 For details, please see the config file.

Operating System Notes

 Stack size set to unlimited using "ulimit -s unlimited"

Environment Variables Notes

Environment variables set by runcpu before the start of the run:
LD_LIBRARY_PATH =
     "/mnt/ramdisk/cpu2017-1.1.8-ic2022.1/lib/intel64:/mnt/ramdisk/cpu2017-1.
     1.8-ic2022.1/je5.0.1-64"
MALLOC_CONF = "retain:true"

General Notes

 Binaries compiled on a system with 2x Intel Xeon Platinum 8280M CPU + 384GB RAM
 memory using Red Hat Enterprise Linux 8.4
 Transparent Huge Pages enabled by default
 Prior to runcpu invocation
 Filesystem page cache synced and cleared with:
 sync; echo 3>       /proc/sys/vm/drop_caches
 runcpu command invoked through numactl i.e.:
 numactl --interleave=all runcpu <etc>
 jemalloc, a general purpose malloc implementation
 built with the RedHat Enterprise 7.5, and the system compiler gcc 4.8.5
 sources available from jemalloc.net or https://github.com/jemalloc/jemalloc/releases

 NA: The test sponsor attests, as of date of publication, that CVE-2017-5754 (Meltdown)
 is mitigated in the system as tested and documented.
 Yes: The test sponsor attests, as of date of publication, that CVE-2017-5753 (Spectre variant 1)
 is mitigated in the system as tested and documented.
 Yes: The test sponsor attests, as of date of publication, that CVE-2017-5715 (Spectre variant 2)
 is mitigated in the system as tested and documented.

 Benchmark run from a 125 GB ramdisk created with the cmd: "mount -t tmpfs -o size=125G tmpfs /mnt/ramdisk"

Platform Notes


BIOS settings:
     Virtualization Technology : Disabled
              Sub NUMA Cluster : Enabled
       DCU Streamer Prefetcher : Disabled
           Dead Line LLC Alloc : Disabled

                System Profile : Custom
          CPU Power Management : Maximum Performance
                           C1E : Disabled
                      C States : Autonomous
           Memory Patrol Scrub : Disabled
      Energy Efficiency Policy : Performance
            PCI ASPM L1 Link
              Power Management : Disabled

 Sysinfo program /mnt/ramdisk/cpu2017-1.1.8-ic2022.1/bin/sysinfo
 Rev: r6622 of 2021-04-07 982a61ec0915b55891ef0e16acafc64d
 running on auvcetillem1m1.corp.emc.com Thu Nov 17 23:47:46 2022

 SUT (System Under Test) info as seen by some common utilities.
 For more information on this section, see
    https://www.spec.org/cpu2017/Docs/config.html#sysinfo

 From /proc/cpuinfo
    model name : Intel(R) Xeon(R) Platinum 8276L CPU @ 2.20GHz
       4  "physical id"s (chips)
       224 "processors"
    cores, siblings (Caution: counting these is hw and system dependent. The following
    excerpts from /proc/cpuinfo might not be reliable.  Use with caution.)
       cpu cores : 28
       siblings  : 56
       physical 0: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 1: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 2: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30
       physical 3: cores 0 1 2 3 4 5 6 8 9 10 11 12 13 14 16 17 18 19 20 21 22 24 25 26 27
       28 29 30

 From lscpu from util-linux 2.32.1:
      Architecture:        x86_64
      CPU op-mode(s):      32-bit, 64-bit
      Byte Order:          Little Endian
      CPU(s):              224
      On-line CPU(s) list: 0-223
      Thread(s) per core:  2
      Core(s) per socket:  28
      Socket(s):           4
      NUMA node(s):        8
      Vendor ID:           GenuineIntel
      BIOS Vendor ID:      Intel
      CPU family:          6
      Model:               85
      Model name:          Intel(R) Xeon(R) Platinum 8276L CPU @ 2.20GHz
      BIOS Model name:     Intel(R) Xeon(R) Platinum 8276L CPU @ 2.20GHz
      Stepping:            5
      CPU MHz:             4000.000
      CPU max MHz:         4000.0000
      CPU min MHz:         1000.0000
      BogoMIPS:            4400.00
      L1d cache:           32K
      L1i cache:           32K
      L2 cache:            1024K
      L3 cache:            39424K
      NUMA node0 CPU(s):
      0,9,17,25,33,41,49,57,65,73,81,89,97,105,112,121,129,137,145,153,161,169,177,185,193
      ,201,209,217
      NUMA node1 CPU(s):
      1,8,16,24,32,40,48,56,64,72,80,88,96,104,113,120,128,136,144,152,160,168,176,184,192
      ,200,208,216
      NUMA node2 CPU(s):
      2,10,18,26,34,42,50,58,66,74,82,90,98,106,114,122,130,138,146,154,162,170,178,186,19
      4,202,210,218
      NUMA node3 CPU(s):
      3,11,19,27,35,43,51,59,67,75,83,91,99,107,115,123,131,139,147,155,163,171,179,187,19
      5,203,211,219
      NUMA node4 CPU(s):
      4,12,20,28,36,44,52,60,68,76,84,92,100,108,116,124,132,140,148,156,164,172,180,188,1
      96,204,212,220
      NUMA node5 CPU(s):
      5,13,21,29,37,45,53,61,69,77,85,93,101,109,117,125,133,141,149,157,165,173,181,189,1
      97,205,213,221
      NUMA node6 CPU(s):
      6,14,22,30,38,46,54,62,70,78,86,94,102,110,118,126,134,142,150,158,166,174,182,190,1
      98,206,214,222
      NUMA node7 CPU(s):
      7,15,23,31,39,47,55,63,71,79,87,95,103,111,119,127,135,143,151,159,167,175,183,191,1
      99,207,215,223
      Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
      pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp
      lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid
      aperfmperf pni pclmulqdq dtes64 monitor ds_cpl smx est tm2 ssse3 sdbg fma cx16 xtpr
      pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx
      f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb cat_l3 cdp_l3 invpcid_single
      intel_ppin ssbd mba ibrs ibpb stibp fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms
      invpcid rtm cqm mpx rdt_a avx512f avx512dq rdseed adx smap clflushopt clwb intel_pt
      avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc
      cqm_mbm_total cqm_mbm_local dtherm ida arat pln pts pku ospke md_clear flush_l1d
      arch_capabilities

 /proc/cpuinfo cache data
    cache size : 39424 KB

 From numactl --hardware
 WARNING: a numactl 'node' might or might not correspond to a physical chip.
   available: 8 nodes (0-7)
   node 0 cpus: 0 9 17 25 33 41 49 57 65 73 81 89 97 105 112 121 129 137 145 153 161 169
   177 185 193 201 209 217
   node 0 size: 96761 MB
   node 0 free: 95608 MB
   node 1 cpus: 1 8 16 24 32 40 48 56 64 72 80 88 96 104 113 120 128 136 144 152 160 168
   176 184 192 200 208 216
   node 1 size: 94882 MB
   node 1 free: 91332 MB
   node 2 cpus: 2 10 18 26 34 42 50 58 66 74 82 90 98 106 114 122 130 138 146 154 162 170
   178 186 194 202 210 218
   node 2 size: 96761 MB
   node 2 free: 95692 MB
   node 3 cpus: 3 11 19 27 35 43 51 59 67 75 83 91 99 107 115 123 131 139 147 155 163 171
   179 187 195 203 211 219
   node 3 size: 96761 MB
   node 3 free: 95056 MB
   node 4 cpus: 4 12 20 28 36 44 52 60 68 76 84 92 100 108 116 124 132 140 148 156 164 172
   180 188 196 204 212 220
   node 4 size: 96720 MB
   node 4 free: 94953 MB
   node 5 cpus: 5 13 21 29 37 45 53 61 69 77 85 93 101 109 117 125 133 141 149 157 165 173
   181 189 197 205 213 221
   node 5 size: 96761 MB
   node 5 free: 96244 MB
   node 6 cpus: 6 14 22 30 38 46 54 62 70 78 86 94 102 110 118 126 134 142 150 158 166 174
   182 190 198 206 214 222
   node 6 size: 96761 MB
   node 6 free: 96343 MB
   node 7 cpus: 7 15 23 31 39 47 55 63 71 79 87 95 103 111 119 127 135 143 151 159 167 175
   183 191 199 207 215 223
   node 7 size: 96751 MB
   node 7 free: 96260 MB
   node distances:
   node   0   1   2   3   4   5   6   7
     0:  10  21  21  21  21  11  21  21
     1:  21  10  21  21  11  21  21  21
     2:  21  21  10  21  21  21  11  21
     3:  21  21  21  10  21  21  21  11
     4:  21  11  21  21  10  21  21  21
     5:  11  21  21  21  21  10  21  21
     6:  21  21  11  21  21  21  10  21
     7:  21  21  21  11  21  21  21  10

 From /proc/meminfo
    MemTotal:       790694756 kB
    HugePages_Total:       0
    Hugepagesize:       2048 kB

 /sbin/tuned-adm active
     Current active profile: throughput-performance

 /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor has
    performance

 From /etc/*release* /etc/*version*
    os-release:
       NAME="Red Hat Enterprise Linux"
       VERSION="8.6 (Ootpa)"
       ID="rhel"
       ID_LIKE="fedora"
       VERSION_ID="8.6"
       PLATFORM_ID="platform:el8"
       PRETTY_NAME="Red Hat Enterprise Linux 8.6 (Ootpa)"
       ANSI_COLOR="0;31"
    redhat-release: Red Hat Enterprise Linux release 8.6 (Ootpa)
    system-release: Red Hat Enterprise Linux release 8.6 (Ootpa)
    system-release-cpe: cpe:/o:redhat:enterprise_linux:8::baseos

 uname -a:
    Linux auvcetillem1m1.corp.emc.com 4.18.0-372.9.1.el8.x86_64 #1 SMP Fri Apr 15 22:12:19
    EDT 2022 x86_64 x86_64 x86_64 GNU/Linux

 Kernel self-reported vulnerability status:

 CVE-2018-12207 (iTLB Multihit):                        KVM: Mitigation: VMX unsupported
 CVE-2018-3620 (L1 Terminal Fault):                     Not affected
 Microarchitectural Data Sampling:                      Mitigation: Clear CPU buffers; SMT
                                                        vulnerable
 CVE-2017-5754 (Meltdown):                              Not affected
 CVE-2018-3639 (Speculative Store Bypass):              Mitigation: Speculative Store
                                                        Bypass disabled via prctl and
                                                        seccomp
 CVE-2017-5753 (Spectre variant 1):                     Mitigation: usercopy/swapgs
                                                        barriers and __user pointer
                                                        sanitization
 CVE-2017-5715 (Spectre variant 2):                     Mitigation: Retpolines, IBPB:
                                                        conditional, IBRS_FW, STIBP:
                                                        conditional, RSB filling
 CVE-2020-0543 (Special Register Buffer Data Sampling): Not affected
 CVE-2019-11135 (TSX Asynchronous Abort):               Mitigation: Clear CPU buffers; SMT
                                                        vulnerable

 run-level 3 Nov 17 19:58

 SPEC is set to: /mnt/ramdisk/cpu2017-1.1.8-ic2022.1
    Filesystem     Type   Size  Used Avail Use% Mounted on
    tmpfs          tmpfs  125G  3.6G  122G   3% /mnt/ramdisk

 From /sys/devices/virtual/dmi/id
     Vendor:         Dell Inc.
     Product:        PowerEdge R940
     Product Family: PowerEdge
     Serial:         3MM3HH2

 Additional information from dmidecode 3.3 follows.  WARNING: Use caution when you
 interpret this section. The 'dmidecode' program reads system data which is "intended to
 allow hardware to be accurately determined", but the intent may not be met, as there are
 frequent changes to hardware, firmware, and the "DMTF SMBIOS" standard.
   Memory:
     24x 00CE063200CE M393A4G43AB3-CWE 32 GB 2 rank 3200, configured at 2933

 BIOS:
    BIOS Vendor:       Dell Inc.
    BIOS Version:      2.15.1
    BIOS Date:         06/15/2022
    BIOS Revision:     2.15

 (End of data from sysinfo program)

Compiler Version Notes

==============================================================================
C               | 519.lbm_r(base) 538.imagick_r(base) 544.nab_r(base)
------------------------------------------------------------------------------
Intel(R) oneAPI DPC++/C++ Compiler for applications running on Intel(R) 64,
  Version 2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
------------------------------------------------------------------------------

==============================================================================
C++             | 508.namd_r(base) 510.parest_r(base)
------------------------------------------------------------------------------
Intel(R) oneAPI DPC++/C++ Compiler for applications running on Intel(R) 64,
  Version 2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
------------------------------------------------------------------------------

==============================================================================
C++, C          | 511.povray_r(base) 526.blender_r(base)
------------------------------------------------------------------------------
Intel(R) oneAPI DPC++/C++ Compiler for applications running on Intel(R) 64,
  Version 2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
Intel(R) oneAPI DPC++/C++ Compiler for applications running on Intel(R) 64,
  Version 2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
------------------------------------------------------------------------------

==============================================================================
C++, C, Fortran | 507.cactuBSSN_r(base)
------------------------------------------------------------------------------
Intel(R) oneAPI DPC++/C++ Compiler for applications running on Intel(R) 64,
  Version 2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
Intel(R) oneAPI DPC++/C++ Compiler for applications running on Intel(R) 64,
  Version 2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
Intel(R) Fortran Compiler for applications running on Intel(R) 64, Version
  2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
------------------------------------------------------------------------------

==============================================================================
Fortran         | 503.bwaves_r(base) 549.fotonik3d_r(base) 554.roms_r(base)
------------------------------------------------------------------------------
Intel(R) Fortran Compiler for applications running on Intel(R) 64, Version
  2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
------------------------------------------------------------------------------

==============================================================================
Fortran, C      | 521.wrf_r(base) 527.cam4_r(base)
------------------------------------------------------------------------------
Intel(R) Fortran Compiler for applications running on Intel(R) 64, Version
  2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
Intel(R) oneAPI DPC++/C++ Compiler for applications running on Intel(R) 64,
  Version 2022.1.0 Build 20220316
Copyright (C) 1985-2022 Intel Corporation. All rights reserved.
------------------------------------------------------------------------------

Base Compiler Invocation

C benchmarks:

 icx 

C++ benchmarks:

 icpx 

Fortran benchmarks:

 ifx 

Benchmarks using both Fortran and C:

 ifx   icx 

Benchmarks using both C and C++:

 icpx   icx 

Benchmarks using Fortran, C, and C++:

 icpx   icx   ifx 

Base Portability Flags

503.bwaves_r:  -DSPEC_LP64 
507.cactuBSSN_r:  -DSPEC_LP64 
508.namd_r:  -DSPEC_LP64 
510.parest_r:  -DSPEC_LP64 
511.povray_r:  -DSPEC_LP64 
519.lbm_r:  -DSPEC_LP64 
521.wrf_r:  -DSPEC_LP64   -DSPEC_CASE_FLAG   -convert big_endian 
526.blender_r:  -DSPEC_LP64   -DSPEC_LINUX   -funsigned-char 
527.cam4_r:  -DSPEC_LP64   -DSPEC_CASE_FLAG 
538.imagick_r:  -DSPEC_LP64 
544.nab_r:  -DSPEC_LP64 
549.fotonik3d_r:  -DSPEC_LP64 
554.roms_r:  -DSPEC_LP64 

Base Optimization Flags

C benchmarks:

 -w   -std=c11   -m64   -Wl,-z,muldefs   -xCORE-AVX512   -Ofast   -ffast-math   -flto   -mfpmath=sse   -funroll-loops   -qopt-mem-layout-trans=4   -ljemalloc   -L/usr/local/jemalloc64-5.0.1/lib 

C++ benchmarks:

 -w   -m64   -Wl,-z,muldefs   -xCORE-AVX512   -Ofast   -ffast-math   -flto   -mfpmath=sse   -funroll-loops   -qopt-mem-layout-trans=4   -ljemalloc   -L/usr/local/jemalloc64-5.0.1/lib 

Fortran benchmarks:

 -w   -m64   -Wl,-z,muldefs   -xCORE-AVX512   -Ofast   -ffast-math   -flto   -mfpmath=sse   -funroll-loops   -qopt-mem-layout-trans=4   -nostandard-realloc-lhs   -align array32byte   -auto   -ljemalloc   -L/usr/local/jemalloc64-5.0.1/lib 

Benchmarks using both Fortran and C:

 -w   -m64   -std=c11   -Wl,-z,muldefs   -xCORE-AVX512   -Ofast   -ffast-math   -flto   -mfpmath=sse   -funroll-loops   -qopt-mem-layout-trans=4   -nostandard-realloc-lhs   -align array32byte   -auto   -ljemalloc   -L/usr/local/jemalloc64-5.0.1/lib 

Benchmarks using both C and C++:

 -w   -m64   -std=c11   -Wl,-z,muldefs   -xCORE-AVX512   -Ofast   -ffast-math   -flto   -mfpmath=sse   -funroll-loops   -qopt-mem-layout-trans=4   -ljemalloc   -L/usr/local/jemalloc64-5.0.1/lib 

Benchmarks using Fortran, C, and C++:

 -w   -m64   -std=c11   -Wl,-z,muldefs   -xCORE-AVX512   -Ofast   -ffast-math   -flto   -mfpmath=sse   -funroll-loops   -qopt-mem-layout-trans=4   -nostandard-realloc-lhs   -align array32byte   -auto   -ljemalloc   -L/usr/local/jemalloc64-5.0.1/lib 

The flags files that were used to format this result can be browsed at
http://www.spec.org/cpu2017/flags/Intel-ic2022-official-linux64_revA.html,
http://www.spec.org/cpu2017/flags/Dell-Platform-Flags-PowerEdge-Intel-Xeon-v1.2.html.

You can also download the XML flags sources by saving the following links:
http://www.spec.org/cpu2017/flags/Intel-ic2022-official-linux64_revA.xml,
http://www.spec.org/cpu2017/flags/Dell-Platform-Flags-PowerEdge-Intel-Xeon-v1.2.xml.