CPU2006 license: | 6 | Test date: | Mar-2007 |
---|---|---|---|
Test sponsor: | Sun Microsystems | Hardware Availability: | Apr-2007 |
Tested by: | Sun Microsystems | Software Availability: | Jul-2007 |
Hardware | |
---|---|
CPU Name: | SPARC64 VI |
CPU Characteristics: | |
CPU MHz: | 2280 |
FPU: | Integrated |
CPU(s) enabled: | 128 cores, 64 chips, 2 cores/chip, 2 threads/core |
CPU(s) orderable: | 1 to 16 CMUs; each CMU contains 2 or 4 chips |
Primary Cache: | 128 KB I + 128 KB D on chip per core |
Secondary Cache: | 5 MB I+D on chip per chip |
L3 Cache: | None |
Other Cache: | None |
Memory: | 1 TB (512 x 2 GB) |
Disk Subsystem: | 673 GB RAID 1+0 created by Solaris Volume Manager with 20 x 73 GB 10,000 RPM Fujitsu MAY2073RC SAS |
Other Hardware: | None |
Software | |
---|---|
Operating System: | Solaris 10 7/07 (build s10s_u4wos_04) |
Compiler: | Sun Studio 12 (Build 44.0) |
Auto Parallel: | No |
File System: | ufs |
System State: | Default |
Base Pointers: | 32-bit |
Peak Pointers: | 32-bit |
Other Software: | None |
Benchmark | Base | Peak | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | Copies | Seconds | Ratio | Seconds | Ratio | Seconds | Ratio | |
Results appear in the order in which they were run. Bold underlined text indicates a median measurement. | ||||||||||||||
410.bwaves | 255 | 2717 | 1280 | 2676 | 1290 | 2676 | 1290 | 255 | 2675 | 1300 | 2676 | 1290 | 2677 | 1290 |
416.gamess | 255 | 4041 | 1240 | 4035 | 1240 | 4051 | 1230 | 127 | 1928 | 1290 | 1922 | 1290 | 1923 | 1290 |
433.milc | 255 | 3794 | 617 | 3795 | 617 | 3794 | 617 | 255 | 3737 | 626 | 3740 | 626 | 3766 | 622 |
434.zeusmp | 255 | 2612 | 888 | 2592 | 895 | 2592 | 895 | 255 | 2612 | 888 | 2592 | 895 | 2592 | 895 |
435.gromacs | 255 | 1450 | 1260 | 1447 | 1260 | 1450 | 1260 | 127 | 672 | 1350 | 672 | 1350 | 675 | 1340 |
436.cactusADM | 255 | 1900 | 1600 | 1835 | 1660 | 1828 | 1670 | 255 | 1900 | 1600 | 1835 | 1660 | 1828 | 1670 |
437.leslie3d | 255 | 2188 | 1100 | 2186 | 1100 | 2188 | 1100 | 255 | 2148 | 1120 | 2149 | 1120 | 2155 | 1110 |
444.namd | 255 | 1631 | 1250 | 1631 | 1250 | 1632 | 1250 | 127 | 762 | 1340 | 763 | 1340 | 763 | 1340 |
447.dealII | 255 | 1698 | 1720 | 1695 | 1720 | 1695 | 1720 | 255 | 1664 | 1750 | 1680 | 1740 | 1673 | 1740 |
450.soplex | 255 | 2970 | 716 | 2899 | 733 | 2897 | 734 | 255 | 2895 | 735 | 2820 | 754 | 2809 | 757 |
453.povray | 255 | 1273 | 1070 | 1274 | 1060 | 1268 | 1070 | 127 | 406 | 1670 | 405 | 1670 | 405 | 1670 |
454.calculix | 255 | 1395 | 1510 | 1389 | 1510 | 1390 | 1510 | 255 | 1395 | 1510 | 1389 | 1510 | 1390 | 1510 |
459.GemsFDTD | 255 | 3596 | 752 | 3557 | 761 | 3584 | 755 | 255 | 3596 | 752 | 3557 | 761 | 3584 | 755 |
465.tonto | 255 | 1868 | 1340 | 1869 | 1340 | 1867 | 1340 | 127 | 871 | 1430 | 829 | 1510 | 870 | 1440 |
470.lbm | 255 | 3823 | 916 | 3835 | 914 | 3828 | 915 | 255 | 3802 | 922 | 3813 | 919 | 3814 | 919 |
481.wrf | 255 | 2036 | 1400 | 2034 | 1400 | 2030 | 1400 | 255 | 2036 | 1400 | 2034 | 1400 | 2030 | 1400 |
482.sphinx3 | 255 | 5318 | 935 | 5309 | 936 | 5309 | 936 | 255 | 5318 | 935 | 5309 | 936 | 5309 | 936 |
Processes were bound to cores using "submit" and "pbind". The SPEC toolset was bound to processor 0. These shell commands request use of local 4MB pages: export LD_PRELOAD=madv.so.1:mpss.so.1 export MPSSHEAP=4MB export MPSSSTACK=4MB export MADV=access_lwp 'access_lwp' means that the next light weight process to touch the specified address range will access it the most heavily. ulimit -s 131072 was used to limit the space consumed by the stack (and therefore make more space available to the heap). /etc/system parameters autoup=300 Causes pages older than the listed number of seconds to be written by fsflush. bufhwm=3000 Memory byte limit for caching I/O buffers segmap_percent=1 Set maximum percent memory for file system cache tune_t_fsflushr=3 Controls how many seconds elapse between runs of the page flush daemon, fsflush. The "webconsole" service was turned off using svcadm disable webconsole
"CMU" = CPU/Memory Unit; each holds 2 or 4 CPU chips. Memory is 8-way interleaved by filling all slots with the same capacity DIMMs. This result was measured using a Sun SPARC Enterprise M9000 Server. Note that the Fujitsu SPARC Enterprise M9000 and Sun SPARC Enterprise M9000 are electrically equivalent.
cc |
CC |
f90 |
cc f90 |
-fast -fma=fused -xcache=128/64/2:5120/256/10 -xipo=2 -xpagesize=4M -xprefetch_level=2 -xprefetch=latx:2 -xalias_level=std -xprefetch_level=3 -xprefetch_auto_type=indirect_array_access |
-xdepend -library=stlport4 -fast -fma=fused -xcache=128/64/2:5120/256/10 -xipo=2 -xpagesize=4M -xprefetch_level=2 -xprefetch=latx:2 -xalias_level=compatible |
-fast -fma=fused -xcache=128/64/2:5120/256/10 -xipo=2 -xpagesize=4M -xprefetch_level=2 -xprefetch=latx:2 |
-fast(cc) -fast(f90) -fma=fused -xcache=128/64/2:5120/256/10 -xipo=2 -xpagesize=4M -xprefetch_level=2 -xprefetch=latx:2 -xalias_level=std -xprefetch_level=3 -xprefetch_auto_type=indirect_array_access |
-xjobs=24 -V -# |
-xjobs=24 -verbose=diags,version |
-xjobs=24 -V -v |
-xjobs=24 -V -# -v |
cc |
CC |
f90 |
cc f90 |
410.bwaves: | -fast -xcache=128/64/2:5120/256/10 -xpagesize=4M -xipo=2 -xprefetch_level=2 -fma=fused -xprefetch=latx:3 |
416.gamess: | -fast -xcache=128/64/2:5120/256/10 -xpagesize=4M -xipo=2 -xprefetch_level=2 -fma=fused |
434.zeusmp: | basepeak = yes |
437.leslie3d: | -fast -xcache=128/64/2:5120/256/10 -xpagesize=4M -xprefetch_level=3 -qoption cg -Qlp=1 -qoption cg -Qlp-fa=0 -qoption cg -Qlp-fl=1 -qoption cg -Qlp-av=448 -qoption cg -Qlp-t=4 -xprefetch=latx:3.5 |
459.GemsFDTD: | basepeak = yes |
465.tonto: | -fast -xcache=128/64/2:5120/256/10 -xpagesize=4M -xipo=2 -xprefetch=latx:12 -lfast |
435.gromacs: | -xprofile=collect:./feedback(pass 1) -xprofile=use:./feedback(pass 2) -fast(cc) -fast(f90) -xcache=128/64/2:5120/256/10 -xpagesize=4M -xipo=2 -xinline= -xarch=generic -xchip=generic -fsimple=0 -fma=fused |
436.cactusADM: | basepeak = yes |
454.calculix: | basepeak = yes |
481.wrf: | basepeak = yes |