OS Images |
os_Image_1(1)
|
Hardware Description |
hw_1
|
Number of Systems |
1
|
SW Environment |
non-virtual
|
Tuning |
BIOS Settings: - NUMA nodes per socket = NPS4
- Determinism Control = Manual
- Determinism Slider = Power
- L1 Stream HW Prefetcher = Disable
- L2 Stream HW Prefetcher = Disable
- xGMI Link Configuration = 4 xGMI Links
- 4 Link xGMI max speed = 32Gbps
- TDP Control = Manual
- TDP = 400
- PPT Control = Manual
- PPT = 400
|
Notes |
notes
|
|
JVM Instances |
jvm_Ctr_1(1), jvm_Backend_1(16), jvm_TxInjector_1(16)
|
OS Image Description |
os_1
|
Tuning |
- cpupower -c all frequency-set -g performance
- tuned-adm profile throughput-performance
- echo 960000 > /proc/sys/kernel/sched_rt_runtime_us
- echo 40000000 > /proc/sys/kernel/sched_latency_ns
- echo 40000 > /proc/sys/kernel/sched_migration_cost_ns
- echo 800000000 > /proc/sys/kernel/sched_min_granularity_ns
- echo 200000000 > /proc/sys/kernel/sched_wakeup_granularity_ns
- echo 9000 > /proc/sys/kernel/sched_nr_migrate
- echo 10000 > /proc/sys/vm/dirty_expire_centisecs
- echo 1500 > /proc/sys/vm/dirty_writeback_centisecs
- echo 40 > /proc/sys/vm/dirty_ratio
- echo 10 > /proc/sys/vm/dirty_background_ratio
- echo 10 > /proc/sys/vm/swappiness
- echo 0 > /proc/sys/kernel/numa_balancing
- echo 0 > /proc/sys/vm/numa_stat
- echo always > /sys/kernel/mm/transparent_hugepage/enabled
- echo always > /sys/kernel/mm/transparent_hugepage/defrag
- ulimit -n 1024000
- UserTasksMax=970000
- DefaultTasksMax=970000
|
Notes |
None
|
Parts of Benchmark |
Controller
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2
|
Tuning |
None
|
Notes |
Used numactl to interleave memory on all CPUs
|
Parts of Benchmark |
Backend
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms31g -Xmx31g -Xmn29g -XX:AllocatePrefetchInstr=2 -XX:+UseParallelGC -XX:ParallelGCThreads=8 -XX:LargePageSizeInBytes=2m -XX:-UseAdaptiveSizePolicy -XX:+AlwaysPreTouch -XX:+UseLargePages -XX:SurvivorRatio=28 -XX:TargetSurvivorRatio=95 -XX:MaxTenuringThreshold=15 -XX:InlineSmallCode=11k -XX:MaxGCPauseMillis=300 -XX:LoopUnrollLimit=200 -XX:AdaptiveSizeMajorGCDecayTimeScale=12 -XX:AdaptiveSizeDecrementScaleFactor=2 -XX:+UseTransparentHugePages -XX:TLABAllocationWeight=55 -XX:ThreadStackSize=512
|
Tuning |
None
|
Notes |
Used numactl to affinitize each Backend JVM to 4 Core / 8 Threads - Group1: --physcpubind=0-3,64-67 --localalloc
- Group2: --physcpubind=4-7,68-71 --localalloc
- Group3: --physcpubind=8-11,72-75 --localalloc
- Group4: --physcpubind=12-15,76-79 --localalloc
- Group5: --physcpubind=16-19,80-83 --localalloc
- Group6: --physcpubind=20-23,84-87 --localalloc
- Group7: --physcpubind=24-27,88-91 --localalloc
- Group8: --physcpubind=28-31,92-95 --localalloc
- Group9: --physcpubind=32-35,96-99 --localalloc
- Group10: --physcpubind=36-39,100-103 --localalloc
- Group11: --physcpubind=40-43,104-107 --localalloc
- Group12: --physcpubind=44-47,108-111 --localalloc
- Group13: --physcpubind=48-51,112-115 --localalloc
- Group14: --physcpubind=52-55,116-119 --localalloc
- Group15: --physcpubind=56-59,120-123 --localalloc
- Group16: --physcpubind=60-63,124-127 --localalloc
|
Parts of Benchmark |
TxInjector
|
JVM Instance Description |
jvm_1
|
Command Line |
-Xms2g -Xmx2g -Xmn1536m -XX:+UseParallelGC -XX:ParallelGCThreads=2
|
Tuning |
None
|
Notes |
Used numactl to affinitize each Transaction Injector JVM to 2 Core/2 Threads - Group1: --physcpubind=0,64 --localalloc
- Group2: --physcpubind=4,68 --localalloc
- Group3: --physcpubind=8,72 --localalloc
- Group4: --physcpubind=12,76 --localalloc
- Group5: --physcpubind=16,80 --localalloc
- Group6: --physcpubind=20,84 --localalloc
- Group7: --physcpubind=24,88 --localalloc
- Group8: --physcpubind=28,92 --localalloc
- Group9: --physcpubind=32,96 --localalloc
- Group10: --physcpubind=36,100 --localalloc
- Group11: --physcpubind=40,104 --localalloc
- Group12: --physcpubind=44,108 --localalloc
- Group13: --physcpubind=48,112 --localalloc
- Group14: --physcpubind=52,116 --localalloc
- Group15: --physcpubind=56,120 --localalloc
- Group16: --physcpubind=60,124 --localalloc
|
|