|
JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(8), jvm_TxInjector_1(8)
|
OS Image Description |
os_1
|
Tuning |
- tuned-adm profile throughput-performance
- ulimit -n 1024000
- echo 10000 > /proc/sys/kernel/sched_cfs_bandwidth_slice_us
- echo 0 > /proc/sys/kernel/sched_child_runs_first
- echo 56000000 > /sys/kernel/debug/sched/latency_ns
- echo 1000 > /sys/kernel/debug/sched/migration_cost_ns
- echo 16000000 > /sys/kernel/debug/sched/min_granularity_ns
- echo 9 > /sys/kernel/debug/sched/nr_migrate
- echo 100 > /proc/sys/kernel/sched_rr_timeslice_ms
- echo 1000000 > /proc/sys/kernel/sched_rt_period_us
- echo 990000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 0 > /proc/sys/kernel/sched_schedstats
- echo 1 > /sys/kernel/debug/sched/tunable_scaling
- echo 50000000 > /sys/kernel/debug/sched/wakeup_granularity_ns
- echo 3000 > /proc/sys/vm/dirty_expire_centisecs
- echo 500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
- ulimit -v 800000000
- ulimit -m 800000000
- ulimit -l 800000000
- echo 300 > /sys/kernel/mm/hugepages/hugepages-1048576kB/nr_hugepages
- echo 8000 > /sys/kernel/mm/hugepages/hugepages-2048kb/nr_hugepages
- systemctl stop systemed-update-utmp-runlevel.service
- echo 274877906944 > /proc/sys/kernel/shmmax
- echo 274877906944 > /proc/sys/kernel/shmall
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-server -Xms2g -Xmx2g -Xmn1536m -XX:+UseLargePages -XX:LargePageSizeInBytes=1G -XX:+UseParallelGC -XX:ParallelGCThreads=2
|
Tuning |
Used numactl to interleave memory on all CPUs
|
Notes |
None
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-XX:+UseParallelGC -XX:+UseLargePages -XX:+AlwaysPreTouch -XX:-UseAdaptiveSizePolicy -XX:MaxTenuringThreshold=15 -XX:InlineSmallCode=10k -verbose:gc -XX:-UseCountedLoopSafepoints -XX:LoopUnrollLimit=20 -server -XX:TargetSurvivorRatio=95 -XX:SurvivorRatio=28 -XX:LargePageSizeInBytes=1G -XX:MaxGCPauseMillis=500 -XX:AdaptiveSizeMajorGCDecayTimeScale=12 -XX:AdaptiveSizeDecrementScaleFactor=2 -XX:AllocatePrefetchLines=3 -XX:AllocateInstancePrefetchLines=2 -XX:AllocatePrefetchStepSize=128 -XX:AllocatePrefetchDistance=384 -Xms29g -Xmx29g -Xmn27g -XX:UseAVX=0 -XX:ParallelGCThreads=14 -XX:+UseHugeTLBFS
|
Tuning |
Used numactl to affinitize each Backend JVM to physical cores in a NUMA node. - Group1: numactl --physcpubind=0-6,56-62 --localalloc
- Group2: numactl --physcpubind=7-13,63-69 --localalloc
- Group3: numactl --physcpubind=14-20,70-76 --localalloc
- Group4: numactl --physcpubind=21-27,77-83 --localalloc
- Group5: numactl --physcpubind=28-34,84-90 --localalloc
- Group6: numactl --physcpubind=35-41,91-97 --localalloc
- Group7: numactl --physcpubind=42-48,98-104 --localalloc
- Group8: numactl --physcpubind=49-55,105-111 --localalloc
|
Notes |
None
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-server -Xms2g -Xmx2g -Xmn1536m -XX:+UseLargePages -XX:LargePageSizeInBytes=1G -XX:+UseParallelGC -XX:ParallelGCThreads=2
|
Tuning |
Used numactl to affinitize each Injector JVM to physical cores in a NUMA node. - Group1: numactl --physcpubind=0-6,56-62 --localalloc
- Group2: numactl --physcpubind=7-13,63-69 --localalloc
- Group3: numactl --physcpubind=14-20,70-76 --localalloc
- Group4: numactl --physcpubind=21-27,77-83 --localalloc
- Group5: numactl --physcpubind=28-34,84-90 --localalloc
- Group6: numactl --physcpubind=35-41,91-97 --localalloc
- Group7: numactl --physcpubind=42-48,98-104 --localalloc
- Group8: numactl --physcpubind=49-55,105-111 --localalloc
|
Notes |
None
|
|