Responding in a timely manner Martin Thompson - @mjpt777 Hard - - PowerPoint PPT Presentation

▶

May 30, 2023 117 likes •1.05k views

Responding in a timely manner Martin Thompson - @mjpt777 Hard Real-time Soft Real-time Squidgy Real-time The Unaware 1. How to Test and Measure 2. A little bit of Theory 3. A little bit of Practice 4. Common Pitfalls 5. Useful Algorithms and

SLIDE 1

Responding in a timely manner

Martin Thompson - @mjpt777

SLIDE 2

SLIDE 3

Hard Real-time

SLIDE 4

SLIDE 5

Soft Real-time

SLIDE 6

SLIDE 7

Squidgy Real-time

SLIDE 8

SLIDE 9

The Unaware

SLIDE 10

SLIDE 11

1. How to Test and Measure
2. A little bit of Theory
3. A little bit of Practice
4. Common Pitfalls
5. Useful Algorithms and Techniques

SLIDE 12

Test & Measure

SLIDE 13

System Under Test

SLIDE 14

Distributed Load Generation Agents System Under Test

SLIDE 15

Distributed Load Generation Agents System Under Test

SLIDE 16

Distributed Load Generation Agents System Under Test

SLIDE 17

Distributed Load Generation Agents System Under Test Observer

SLIDE 18

Pro Tip:

Setup a continuous performance testing environment

SLIDE 19

Pro Tip: Record Everything

SLIDE 20

Latency Histograms

SLIDE 21

Latency Histograms

Mode

SLIDE 22

Latency Histograms

Mode Median

SLIDE 23

Latency Histograms

Mode Median Mean

SLIDE 24

System: 1000 TPS, mean RT 50µs

SLIDE 25

System: 1000 TPS, mean RT 50µs What is the mean if you add in a 25ms GC pause per second?

SLIDE 26

System: 1000 TPS, mean RT 50µs What is the mean if you add in a 25ms GC pause per second?

~300µs

SLIDE 27

SLIDE 28

Forget averages, it’s all about percentiles

SLIDE 29

Source: Gil Tene (Azul Systems)

Coordinated Omission

SLIDE 30

Pro Tip: Don’t deceive yourself

SLIDE 31

Theory

SLIDE 32

Queuing Theory

0.0 2.0 4.0 6.0 8.0 10.0 12.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Response Time Utilisation

SLIDE 33

Queuing Theory

Kendall Notation

M/D/1

SLIDE 34

Queuing Theory

r = s(2 – ρ) / 2(1 – ρ)

r = mean response time s = service time ρ = utilisation

SLIDE 35

Queuing Theory

r = s(2 – ρ) / 2(1 – ρ)

r = mean response time s = service time ρ = utilisation Note: ρ = λ * (1 / s)

SLIDE 36

Queuing Theory

0.0 2.0 4.0 6.0 8.0 10.0 12.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Response Time Utilisation

SLIDE 37

Pro Tip:

Ensure that you have sufficient capacity

SLIDE 38

Queuing Theory

Little’s Law: L = λ * W

L = mean queue length λ = mean arrival rate W = mean time in system

SLIDE 39

Pro Tip:

Bound queues to meet response time SLAs

SLIDE 40

Can we go parallel to speedup?

SLIDE 41

Sequential Process time

Amdahl’s Law

SLIDE 42

Sequential Process

A B

Parallel Process A

A A A

time

Amdahl’s Law

SLIDE 43

Sequential Process Parallel Process B

A B

Parallel Process A

A A A

time

B A B B B B

Amdahl’s Law

SLIDE 44

Amdahl's Law

SLIDE 45

Universal Scalability Law

C(N) = N / (1 + α(N – 1) + ((β* N) * (N – 1)))

C = capacity or throughput N = number of processors α = contention penalty β = coherence penalty

SLIDE 46

Universal Scalability Law

2 4 6 8 10 12 14 16 18 20 1 2 4 8 16 32 64 128 256 512 1024 Speedup Processors Amdahl USL

SLIDE 47

What about the service time?

SLIDE 48

Order of Algorithms

SLIDE 49

Practice

SLIDE 50

SLIDE 51

SLIDE 52

SLIDE 53

SLIDE 54

SLIDE 55

Pitfalls

SLIDE 56

Modern Processors

P & C States???

Hyperthreading? SMIs?

SLIDE 57

Non-Uniform Memory Architecture (NUMA)

P & C States???

C 1 C n C 1 C n

Registers/Buffers <1ns

L1 L1 L1 L1

~4 cycles ~1ns

L2 L2 L2 L2

~12 cycles ~3ns

L3 L3

~40 cycles ~15ns ~60 cycles ~20ns (dirty hit) ~65ns

DRAM

QPI ~40ns

MC MC DRAM DRAM DRAM DRAM DRAM DRAM DRAM

... ... ... ... ... ...

QPI QPI PCI-e 3 PCI-e 3

40X IO 40X IO

* Assumption: 3GHz Processor

SLIDE 58

Virtual Memory Management

Transparent Huge Pages Page Flushing & IO Scheduling vm.min_free_kbytes Swap???

SLIDE 59

Safepoints in the JVM

Garbage Collection, De-optimisation, Biased Locking, Stack traces, etc.

SLIDE 60

Virtualization

System Calls

SLIDE 61

Notification

public class SomethingUseful { // Lots of useful stuff public void handOffSomeWork() { // prepare for handoff synchronized (this) { someObject.notify(); } } }

SLIDE 62

Notification

public class SomethingUseful { // Lots of useful stuff public void handOffSomeWork() { // prepare for handoff synchronized (this) { someObject.notify(); } } }

SLIDE 63

Law of Leaky Abstractions

“All non-trivial abstractions, to some extent, are leaky.”

Joel Spolsky

SLIDE 64

Law of Leaky Abstractions

“The detail of underlying complexity cannot be ignored.”

SLIDE 65

Mechanical Sympathy

SLIDE 66

Responding in the presence of failure

SLIDE 67

Algorithms & Techniques

SLIDE 68

Clean Room Experiments

sufficient CPUs
intel_idle.max_cstate=0
cpufreq
isocpus
numctl, cgroups, affinity
“Washed” SSDs
network buffer sizing
jHiccup
tune your stack!
Mechanical Sympathy

SLIDE 69

Profiling

SLIDE 70

Pro Tip:

Incorporate telemetry and histograms

SLIDE 71

Smart Batching

Latency Load Typical Possible

SLIDE 72

Smart Batching

Producers

SLIDE 73

Smart Batching

Batcher Producers << Amortise Expensive Costs >>

SLIDE 74

Pro Tip:

Amortise the Expensive Costs

SLIDE 75

Applying Backpressure

Transaction Service Threads Network Stack Storage Threads Network Stack Gateway Services Network Stack IO Customers

SLIDE 76

Non-Blocking Design

“Get out of your own way!”

Don’t hog any resource
Always try to make progress
Enables Smart Batching

SLIDE 77

Pro Tip:

Beware of hogging resources in synchronous designs

SLIDE 78

Lock-Free Concurrent Algorithms

Agree protocols of

interaction

Don’t get a 3rd party

involved, i.e. the OS

Keep to user-space
Beat the “notify()”

problem

SLIDE 79

Observable State Machines

SLIDE 80

Pro Tip:

Observable state machines make monitoring easy

SLIDE 81

Cluster for Response and Resilience

Service A Service A Sequencer

SLIDE 82

Cluster for Response and Resilience

Service A Service A Sequencer

SLIDE 83

Cluster for Response and Resilience

Service A Service A Service N Sequencer

SLIDE 84

Data Structures and O(?) Models

Is there a world beyond maps and lists?

SLIDE 85

In closing…

SLIDE 86

SLIDE 87

SLIDE 88

The Internet of Things (IoT)

“There will be X connected devices by 2020...” Where X is 20 to 75 Billion

SLIDE 89

If you cannot control arrival rates...

SLIDE 90

...you have to think hard about improving service times!

SLIDE 91

...and/or you have to think hard about removing all contention!

SLIDE 92

Questions?

Blog: http://mechanical-sympathy.blogspot.com/ Twitter: @mjpt777 “It does not matter how intelligent you are, if you guess and that guess cannot be backed up by experimental evidence – then it is still a guess.”

Richard Feynman