OS Images |
os_Image_1(1)
|
Hardware Description |
hw_1
|
Number of Systems |
1
|
SW Environment |
non-virtual
|
Tuning |
BIOS Settings: - NUMA nodes per socket = NPS2
- Determinism Control = Manual
- Determinism Slider = Power
- cTDP Control = Manual
- cTDP = 240
- Package Power Limit Control = Manual
- Package Power Limit = 240
- Memory Clock Speed = 1467MHz
- L1 Stream HW Prefetcher = Disable
- L2 Stream HW Prefetcher = Disable
|
Notes |
None
|
|
JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(8), jvm_TxInjector_1(8)
|
OS Image Description |
os_1
|
Tuning |
- cpupower -c all frequency-set -g performance
- tuned-adm profile throughput-performance
- echo 10000000 > /proc/sys/kernel/sched_min_granularity_ns
- echo 15000000 > /proc/sys/kernel/sched_wakeup_granularity_ns
- echo 1000 > /proc/sys/kernel/sched_migration_cost_ns
- echo 990000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 24000000 > /proc/sys/kernel/sched_latency_ns
- echo 10000 > /proc/sys/vm/dirty_expire_centisecs
- echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
- Add cgroup_disable=memory,cpu,cpuacct,blkio,hugetlb,pids,cpuset,perf_event,freezer,devices,net_cls,net_prio to GRUB_CMDLINE_LINUX_DEFAULT
- ulimit -n 1024000
- UserTasksMax=970000
- DefaultTasksMax=970000
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelOldGC -XX:ParallelGCThreads=2
|
Tuning |
None
|
Notes |
Used numactl to interleave memory on all NUMA nodes
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms126g -Xmx126g -Xmn124g -server -XX:MetaspaceSize=30m -XX:AllocatePrefetchInstr=2 -XX:LargePageSizeInBytes=2m -XX:-UsePerfData -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:-UseBiasedLocking -XX:+UseLargePages -XX:+UseParallelOldGC -XX:SurvivorRatio=70 -XX:TargetSurvivorRatio=95 -XX:ParallelGCThreads=12 -XX:MaxTenuringThreshold=15 -XX:InitialCodeCacheSize=25m -XX:+UseCompressedOops -XX:ObjectAlignmentInBytes=32 -XX:+UseTransparentHugePages
|
Tuning |
None
|
Notes |
Used numactl to affinitize two Backend JVM (2 Groups) to one NUMA node - Group1: numactl --cpunodebind=0 --localalloc
- Group2: numactl --cpunodebind=0 --localalloc
- Group3: numactl --cpunodebind=1 --localalloc
- Group4: numactl --cpunodebind=1 --localalloc
- Group5: numactl --cpunodebind=2 --localalloc
- Group6: numactl --cpunodebind=2 --localalloc
- Group7: numactl --cpunodebind=3 --localalloc
- Group8: numactl --cpunodebind=3 --localalloc
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelOldGC -XX:ParallelGCThreads=2
|
Tuning |
None
|
Notes |
Used numactl to affinitize two Transaction Injector JVM (2 Groups) to one NUMA node - Group1: numactl --cpunodebind=0 --membind=0
- Group2: numactl --cpunodebind=0 --membind=0
- Group3: numactl --cpunodebind=1 --membind=1
- Group4: numactl --cpunodebind=1 --membind=1
- Group5: numactl --cpunodebind=2 --membind=2
- Group6: numactl --cpunodebind=2 --membind=2
- Group7: numactl --cpunodebind=3 --membind=3
- Group8: numactl --cpunodebind=3 --membind=3
|
|