|
JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(8), jvm_TxInjector_1(8)
|
OS Image Description |
os_1
|
Tuning |
- cpupower -c all frequency-set -g performance
- tuned-adm profile throughput-performance
- echo "performance" | tee /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
- echo 300 > /sys/kernel/mm/hugepages/hugepages-1048576kB/nr_hugepages
- echo 8000 > /sys/kernel/mm/hugepages/hugepages-2048kB/nr_hugepages
- systemctl stop systemd-update-utmp-runlevel.service
- echo 10000 > /proc/sys/kernel/sched_cfs_bandwidth_slice_us
- echo 0 > /proc/sys/kernel/sched_child_runs_first
- echo 56000000 > /sys/kernel/debug/sched/latency_ns
- echo 1000 > /sys/kernel/debug/sched/migration_cost_ns
- echo 16000000 > /sys/kernel/debug/sched/min_granularity_ns
- echo 100 > /proc/sys/kernel/sched_rr_timeslice_ms
- echo 1000000 > /proc/sys/kernel/sched_rt_period_us
- echo 990000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 0 > /proc/sys/kernel/sched_schedstats
- echo 1 > /sys/kernel/debug/sched/tunable_scaling
- echo 50000000 > /sys/kernel/debug/sched/wakeup_granularity_ns
- echo 3000 > /proc/sys/vm/dirty_expire_centisecs
- echo 500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- ulimit -n 1024000
- ulimit -v 800000000
- ulimit -m 800000000
- ulimit -l 800000000
- echo 274877906944 > /proc/sys/kernel/shmmax
- echo 274877906944 > /proc/sys/kernel/shmall
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-server -Xms2g -Xmx2g -Xmn1536m -XX:+UseLargePages -XX:LargePageSizeInBytes=1G -XX:+UseParallelGC -XX:ParallelGCThreads=2
|
Tuning |
numactl used to interleave the controller amongst all available nodes, eg:
|
Notes |
None
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-XX:+UseParallelGC -XX:+UseLargePages -XX:+AlwaysPreTouch -XX:-UseAdaptiveSizePolicy -XX:MaxTenuringThreshold=15 -XX:InlineSmallCode=10k -verbose:gc -XX:-UseCountedLoopSafepoints -XX:LoopUnrollLimit=20 -server -XX:TargetSurvivorRatio=95 -XX:SurvivorRatio=28 -XX:LargePageSizeInBytes=1G -XX:MaxGCPauseMillis=500 -XX:AdaptiveSizeMajorGCDecayTimeScale=12 -XX:AdaptiveSizeDecrementScaleFactor=2 -XX:AllocatePrefetchLines=3 -XX:AllocateInstancePrefetchLines=2 -XX:AllocatePrefetchStepSize=128 -XX:AllocatePrefetchDistance=384 -Xms29g -Xmx29g -Xmn27g -XX:UseAVX=0 -XX:ParallelGCThreads=32 -XX:+UseHugeTLBFS
|
Tuning |
Used numactl to affinitize each Backend JVM to physcical cores in a NUMA node- Group1: numactl --physcpubind=0-15,128-143 --localalloc
- Group2:numactl --physcpubind=16-31,144-159 --localalloc
- Group3:numactl --physcpubind=32-47,160-175 --localalloc
- Group4:numactl --physcpubind=48-63,176-191 --localalloc
- Group5: numactl --physcpubind=64-79,192-207 --localalloc
- Group6:numactl --physcpubind=80-95,208-223 --localalloc
- Group7:numactl --physcpubind=96-111,224-239 --localalloc
- Group8:numactl --physcpubind=112-127,240-255 --localalloc
|
Notes |
None
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-server -Xms2g -Xmx2g -Xmn1536m -XX:+UseLargePages -XX:LargePageSizeInBytes=1G -XX:+UseParallelGC -XX:ParallelGCThreads=2
|
Tuning |
Used numactl to affinitize each TxInjector JVM to physcical cores in a NUMA node- Group1: numactl --physcpubind=0-15,128-143 --localalloc
- Group2:numactl --physcpubind=16-31,144-159 --localalloc
- Group3:numactl --physcpubind=32-47,160-175 --localalloc
- Group4:numactl --physcpubind=48-63,176-191 --localalloc
- Group5: numactl --physcpubind=64-79,192-207 --localalloc
- Group6:numactl --physcpubind=80-95,208-223 --localalloc
- Group7:numactl --physcpubind=96-111,224-239 --localalloc
- Group8:numactl --physcpubind=112-127,240-255 --localalloc
|
Notes |
None
|
|