| CPU2006 license: | 4 | Test date: | Jun-2016 |
|---|---|---|---|
| Test sponsor: | SGI | Hardware Availability: | Mar-2016 |
| Tested by: | SGI | Software Availability: | May-2016 |
| Hardware | |
|---|---|
| CPU Name: | Intel Xeon E5-2680 v4 |
| CPU Characteristics: | Intel Turbo Boost Technology up to 3.30 GHz |
| CPU MHz: | 2400 |
| FPU: | Integrated |
| CPU(s) enabled: | 28 cores, 2 chips, 14 cores/chip, 2 threads/core |
| CPU(s) orderable: | 1,2 chip |
| Primary Cache: | 32 KB I + 32 KB D on chip per core |
| Secondary Cache: | 256 KB I+D on chip per core |
| L3 Cache: | 35 MB I+D on chip per chip |
| Other Cache: | None |
| Memory: | 256 GB (8 x 32 GB 2Rx4 PC4-2400T-R) |
| Disk Subsystem: | 1 x 500 GB SATA, 10K RPM |
| Other Hardware: | None |
| Software | |
|---|---|
| Operating System: | SuSE Enterprise Linux 12 (x86_64) SP1, Kernel 3.12.57-60.35-default |
| Compiler: | C/C++: Version 16.0.0.101 of Intel C++ Studio XE for Linux; Fortran: Version 16.0.0.101 of Intel Fortran Studio XE for Linux |
| Auto Parallel: | No |
| File System: | xfs |
| System State: | Run level 3 (multi-user) |
| Base Pointers: | 32/64-bit |
| Peak Pointers: | 32/64-bit |
| Other Software: | None |
| Benchmark | Base | Peak | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
| Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
| 410.bwaves | 56 | 1139 | 668 | 1139 | 668 | 1140 | 667 | 56 | 1139 | 668 | 1139 | 668 | 1140 | 667 |
| 416.gamess | 56 | 1022 | 1070 | 1025 | 1070 | 1024 | 1070 | 56 | 990 | 1110 | 989 | 1110 | 986 | 1110 |
| 433.milc | 56 | 805 | 639 | 805 | 639 | 805 | 639 | 56 | 805 | 639 | 805 | 639 | 805 | 639 |
| 434.zeusmp | 56 | 499 | 1020 | 499 | 1020 | 499 | 1020 | 56 | 499 | 1020 | 499 | 1020 | 499 | 1020 |
| 435.gromacs | 56 | 311 | 1290 | 313 | 1280 | 310 | 1290 | 56 | 299 | 1340 | 301 | 1330 | 300 | 1330 |
| 436.cactusADM | 56 | 591 | 1130 | 591 | 1130 | 592 | 1130 | 56 | 591 | 1130 | 591 | 1130 | 592 | 1130 |
| 437.leslie3d | 56 | 1122 | 469 | 1124 | 468 | 1121 | 469 | 56 | 1122 | 469 | 1124 | 468 | 1121 | 469 |
| 444.namd | 56 | 491 | 914 | 493 | 911 | 493 | 910 | 56 | 490 | 917 | 491 | 915 | 488 | 920 |
| 447.dealII | 56 | 383 | 1670 | 379 | 1690 | 382 | 1680 | 56 | 383 | 1670 | 379 | 1690 | 382 | 1680 |
| 450.soplex | 56 | 971 | 481 | 972 | 481 | 973 | 480 | 28 | 433 | 540 | 431 | 542 | 431 | 541 |
| 453.povray | 56 | 217 | 1380 | 217 | 1380 | 218 | 1370 | 56 | 185 | 1610 | 182 | 1630 | 182 | 1630 |
| 454.calculix | 56 | 300 | 1540 | 299 | 1550 | 298 | 1550 | 56 | 300 | 1540 | 299 | 1550 | 298 | 1550 |
| 459.GemsFDTD | 56 | 1330 | 447 | 1331 | 446 | 1329 | 447 | 56 | 1330 | 447 | 1331 | 446 | 1329 | 447 |
| 465.tonto | 56 | 549 | 1000 | 544 | 1010 | 548 | 1010 | 56 | 511 | 1080 | 510 | 1080 | 514 | 1070 |
| 470.lbm | 56 | 865 | 890 | 865 | 890 | 865 | 889 | 56 | 865 | 890 | 865 | 890 | 865 | 889 |
| 481.wrf | 56 | 785 | 797 | 783 | 799 | 783 | 799 | 56 | 785 | 797 | 783 | 799 | 783 | 799 |
| 482.sphinx3 | 56 | 1317 | 829 | 1317 | 828 | 1319 | 827 | 56 | 1317 | 829 | 1317 | 828 | 1319 | 827 |
The numactl mechanism was used to bind copies to processors. The config file option 'submit' was used to generate numactl commands to bind each copy to a specific processor. For details, please see the config file.
Stack size set to unlimited using "ulimit -s unlimited"
Environment variables set by runspec before the start of the run: LD_LIBRARY_PATH = "/store/draddatz/cpu2006/libs/32:/store/draddatz/cpu2006/libs/64:/store/draddatz/cpu2006/sh" Binaries compiled on a system with 1x Intel Core i5-4670K CPU + 32GB memory using RedHat EL 7.1 Transparent Huge Pages enabled with: echo always > /sys/kernel/mm/transparent_hugepage/enabled Filesystem page cache cleared with: echo 1 > /proc/sys/vm/drop_caches runspec command invoked through numactl i.e.: numactl --interleave=all runspec <etc>
| icc -m64 |
| icpc -m64 |
| ifort -m64 |
| icc -m64 ifort -m64 |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
| 436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
| 437.leslie3d: | -DSPEC_CPU_LP64 |
| 444.namd: | -DSPEC_CPU_LP64 |
| 447.dealII: | -DSPEC_CPU_LP64 |
| 450.soplex: | -DSPEC_CPU_LP64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 482.sphinx3: | -DSPEC_CPU_LP64 |
| -xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
| -xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
| -xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch |
| -xCORE-AVX2 -ipo -O3 -no-prec-div -opt-prefetch -auto-p32 -ansi-alias -opt-mem-layout-trans=3 |
| icc -m64 |
| icpc -m64 | |
| 450.soplex: | icpc -m32 -L/opt/intel/compilers_and_libraries_2016/linux/compiler/lib/ia32_lin |
| ifort -m64 |
| icc -m64 ifort -m64 |
| 410.bwaves: | -DSPEC_CPU_LP64 |
| 416.gamess: | -DSPEC_CPU_LP64 |
| 433.milc: | -DSPEC_CPU_LP64 |
| 434.zeusmp: | -DSPEC_CPU_LP64 |
| 435.gromacs: | -DSPEC_CPU_LP64 -nofor_main |
| 436.cactusADM: | -DSPEC_CPU_LP64 -nofor_main |
| 437.leslie3d: | -DSPEC_CPU_LP64 |
| 444.namd: | -DSPEC_CPU_LP64 |
| 447.dealII: | -DSPEC_CPU_LP64 |
| 450.soplex: | -D_FILE_OFFSET_BITS=64 |
| 453.povray: | -DSPEC_CPU_LP64 |
| 454.calculix: | -DSPEC_CPU_LP64 -nofor_main |
| 459.GemsFDTD: | -DSPEC_CPU_LP64 |
| 465.tonto: | -DSPEC_CPU_LP64 |
| 470.lbm: | -DSPEC_CPU_LP64 |
| 481.wrf: | -DSPEC_CPU_LP64 -DSPEC_CPU_CASE_FLAG -DSPEC_CPU_LINUX |
| 482.sphinx3: | -DSPEC_CPU_LP64 |
| 433.milc: | basepeak = yes |
| 470.lbm: | basepeak = yes |
| 482.sphinx3: | basepeak = yes |
| 444.namd: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -fno-alias -auto-ilp32 |
| 447.dealII: | basepeak = yes |
| 450.soplex: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -opt-malloc-options=3 |
| 453.povray: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -unroll4 -ansi-alias |
| 410.bwaves: | basepeak = yes |
| 416.gamess: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -prof-use(pass 2) -unroll2 -inline-level=0 -scalar-rep- |
| 434.zeusmp: | basepeak = yes |
| 437.leslie3d: | basepeak = yes |
| 459.GemsFDTD: | basepeak = yes |
| 465.tonto: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -prof-use(pass 2) -unroll4 -auto -inline-calloc -opt-malloc-options=3 |
| 435.gromacs: | -xCORE-AVX2(pass 2) -prof-gen:threadsafe(pass 1) -ipo(pass 2) -O3(pass 2) -no-prec-div(pass 2) -par-num-threads=1(pass 1) -opt-mem-layout-trans=3(pass 2) -prof-use(pass 2) -opt-prefetch -auto-ilp32 |
| 436.cactusADM: | basepeak = yes |
| 454.calculix: | basepeak = yes |
| 481.wrf: | basepeak = yes |