JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(16), jvm_TxInjector_1(16)
|
OS Image Description |
os_1
|
Tuning |
- cpupower frequency-set -g performance
- tuned-adm profile throughput-performance
- echo 950000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 20000000 > /proc/sys/kernel/sched_latency_ns
- echo 40000 > /proc/sys/kernel/sched_migration_cost_ns
- echo 200000 > /proc/sys/kernel/sched_min_granularity_ns
- echo 40000 > /proc/sys/kernel/sched_wakeup_granularity_ns
- echo 128 > /proc/sys/kernel/sched_nr_migrate
- echo 10000 > /proc/sys/vm/dirty_expire_centisecs
- echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
- ulimit -n 1024000
- UserTasksMax=970000
- DefaultTasksMax=970000
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2 -XX:CICompilerCount=4
|
Tuning |
numactl --interleave=all
|
Notes |
None
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms120g -Xmx120g -Xmn110g -server -XX:MetaspaceSize=512m -XX:AllocatePrefetchInstr=2 -XX:-UsePerfData -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:+UseParallelGC -XX:SurvivorRatio=100 -XX:TargetSurvivorRatio=95 -XX:ParallelGCThreads=30 -XX:MaxTenuringThreshold=15 -XX:InitialCodeCacheSize=25m -XX:InlineSmallCode=10k -XX:MaxGCPauseMillis=200 -XX:+UseCompressedOops -XX:ObjectAlignmentInBytes=32 -XX:+UseTransparentHugePages -XX:TLABAllocationWeight=55 -XX:ThreadStackSize=512 -XX:CompileThresholdScaling=120 -XX:UseAVX=0
|
Tuning |
Used numactl to affinitize two Backend JVM (2 Groups) to one NUMA node- Group1: numactl --cpunodebind=0 --localalloc
- Group2: numactl --cpunodebind=0 --localalloc
- Group3: numactl --cpunodebind=1 --localalloc
- Group4: numactl --cpunodebind=1 --localalloc
- Group5: numactl --cpunodebind=2 --localalloc
- Group6: numactl --cpunodebind=2 --localalloc
- Group7: numactl --cpunodebind=3 --localalloc
- Group8: numactl --cpunodebind=3 --localalloc
- Group9: numactl --cpunodebind=4 --localalloc
- Group10: numactl --cpunodebind=4 --localalloc
- Group11: numactl --cpunodebind=5 --localalloc
- Group12: numactl --cpunodebind=5 --localalloc
- Group13: numactl --cpunodebind=6 --localalloc
- Group14: numactl --cpunodebind=6 --localalloc
- Group15: numactl --cpunodebind=7 --localalloc
- Group16: numactl --cpunodebind=7 --localalloc
|
Notes |
None
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2 -XX:CICompilerCount=4
|
Tuning |
Used numactl to affinitize two TxInjector JVM (2 Groups) to one NUMA node- Group1: numactl --cpunodebind=0 --localalloc
- Group2: numactl --cpunodebind=0 --localalloc
- Group3: numactl --cpunodebind=1 --localalloc
- Group4: numactl --cpunodebind=1 --localalloc
- Group5: numactl --cpunodebind=2 --localalloc
- Group6: numactl --cpunodebind=2 --localalloc
- Group7: numactl --cpunodebind=3 --localalloc
- Group8: numactl --cpunodebind=3 --localalloc
- Group9: numactl --cpunodebind=4 --localalloc
- Group10: numactl --cpunodebind=4 --localalloc
- Group11: numactl --cpunodebind=5 --localalloc
- Group12: numactl --cpunodebind=5 --localalloc
- Group13: numactl --cpunodebind=6 --localalloc
- Group14: numactl --cpunodebind=6 --localalloc
- Group15: numactl --cpunodebind=7 --localalloc
- Group16: numactl --cpunodebind=7 --localalloc
|
Notes |
None
|