Data Acquisition for Real-time Decision-making under Freshness - - PowerPoint PPT Presentation

data acquisition for real time decision making under
SMART_READER_LITE
LIVE PREVIEW

Data Acquisition for Real-time Decision-making under Freshness - - PowerPoint PPT Presentation

Data Acquisition for Real-time Decision-making under Freshness Constraints Shaohan Hu , Shuochao Yao, Haiming Jin, Yiran Zhao, Yitao Hu, Xiaochen Liu, Nooreddin Naghibolhosseini, Shen Li, Akash Kapoor, William Dron, Lu Su, Amotz Bar-Noy, Pedro


slide-1
SLIDE 1

Data Acquisition for Real-time Decision-making under Freshness Constraints

Shaohan Hu, Shuochao Yao, Haiming Jin, Yiran Zhao, Yitao Hu, Xiaochen Liu, Nooreddin Naghibolhosseini, Shen Li, Akash Kapoor, William Dron, Lu Su, Amotz Bar-Noy, Pedro Szekely, Ramesh Govindan, Reginald Hobbs, Tarek Abdelzaher

slide-2
SLIDE 2

2

slide-3
SLIDE 3

3

Chaotic, dynamic environments In response, need to decide… what course of action to take how to carry it out

slide-4
SLIDE 4

Road not blocked Shelter has vacancy Disease controlled Medic team staffed Comm. center up Food store not flooded

DECISION

4

slide-5
SLIDE 5

DECISION

OR AND AND

5

Road not blocked Shelter has vacancy Disease controlled Medic team staffed Comm. center up Food store not flooded

slide-6
SLIDE 6

DECISION

Decision Maker

6

slide-7
SLIDE 7

7

slide-8
SLIDE 8

8

Resource Limitation Environment Dynamics

slide-9
SLIDE 9

1

9

slide-10
SLIDE 10

10

slide-11
SLIDE 11

OR AND AND

11

slide-12
SLIDE 12

12

slide-13
SLIDE 13

Order?

13

slide-14
SLIDE 14

OR AND A B C AND D E

14

slide-15
SLIDE 15

OR AND A B C AND D E

15

Intuition

Maximize probability of short-circuiting per unit cost First examine the course of action that’s most likely to succeed Within a course of action, first examine the condition that’s most likely to fail

slide-16
SLIDE 16

OR AND A B C

AND D E

C P(True) 4 3 0.3 0.6

16

R (1-0.3)/4 =0.175 (1-0.6)/3 =0.133

slide-17
SLIDE 17

C(D->E) = 4+0.3(3) = 4.9 C(E->D) = 3+0.6(4) = 5.4

OR AND B C

AND D E

C P(True) 4 3 0.3 0.6

17

R (1-0.3)/4 =0.175 (1-0.6)/3 =0.133

slide-18
SLIDE 18

A B

OR AND

A B C

AND

D E

C P(True) 4.9 0.18 7 0.1

18

R 0.18/4.9=0.037 0.1/7=0.014

slide-19
SLIDE 19

A B

OR AND

A B C

AND

D E

C P(True) 4.9 0.18 7 0.1 C(L->R) = 7+(1-0.1)x4.9 = 11.41 C(R->L) = 4.9+(1-0.18)x7 = 10.64

19

R 0.18/4.9=0.037 0.1/7=0.014

slide-20
SLIDE 20

key: Short-circuit probability cost

20

slide-21
SLIDE 21

2

21

slide-22
SLIDE 22
slide-23
SLIDE 23

OR AND AND

23

slide-24
SLIDE 24

24

slide-25
SLIDE 25

25

retrieval latency freshness interval

{

{

slide-26
SLIDE 26

26

slide-27
SLIDE 27

27

Some random ordering

slide-28
SLIDE 28

28

expired

Some random ordering

slide-29
SLIDE 29

29

expired bad resolution

Some random ordering

slide-30
SLIDE 30

30

slide-31
SLIDE 31

31

Latest Deadline First (LDF)

slide-32
SLIDE 32

32

good resolution

Latest Deadline First (LDF)

slide-33
SLIDE 33

LDF - Latest Deadline First

Inspired by EDF: data objects with later freshness deadlines are retrieved sooner Optimal: if LDF cannot avoid freshness deadline violation, no sequential order can

33

slide-34
SLIDE 34

LDF , what about Cost?

34

slide-35
SLIDE 35

LDF, what about Cost?

35

slide-36
SLIDE 36

LDF, what about Cost?

LDF satisfies data freshness constraints

36

slide-37
SLIDE 37

LDF, what about Cost?

This might have lower expected cost

37

slide-38
SLIDE 38

LDF, what about Cost?

This might have lower expected cost

38

Greedily rearrange LDF order to reduce the expected data retrieval cost

slide-39
SLIDE 39

LDF , still Freshness Violations?

39

slide-40
SLIDE 40

LDF, still Freshness Violations?

40

slide-41
SLIDE 41

LDF, still Freshness Violations?

41

slide-42
SLIDE 42

LDF, still Freshness Violations?

42

slide-43
SLIDE 43

LDF, still Freshness Violations?

expired

43

slide-44
SLIDE 44

LDF, still Freshness Violations?

parallel retrieval

44

slide-45
SLIDE 45

LDF, still Freshness Violations?

good resolution

45

slide-46
SLIDE 46

LDF, still Freshness Violations?

good resolution

46

Increment parallel retrieval level until freshness constraints are met

slide-47
SLIDE 47

vLDF Data Retrieval

Compute LDF order Greedily rearrange LDF order to reduce the expected data retrieval cost Gradually increment parallel retrieval level until freshness constraints are met

47

slide-48
SLIDE 48

Evaluation

Simulation experiments An application scenario

48

slide-49
SLIDE 49

Simulation Experiments

Baselines

LCF - Lowest Cost Source First SCB - Shortcircuit Benefit only PbP - Probability based Prediction

Settings

% fast changing data: 40~100%, default 70% # Action size: 4~10, default 6 Data object size: 3~5 MB, default 3.45 MB Network bandwidth: 3.5~6.5 KBps, default 5 KBps Transmission latency fluctuation: -3~3 min, default 0

49

slide-50
SLIDE 50

Simulation Results

Varying % of fast changing data

50

40 50 60 70 80 90 100 50 100 150 Ratio of Fast−Changing Data Items (%) Retrieval Cost Ratio (%) vLDF SCB PbP LCF 40 50 60 70 80 90 100 20 40 60 80 100 Ratio of Fast−Changing Data Items (%) Request Resolution Rate (%) vLDF SCB PbP LCF

slide-51
SLIDE 51

Simulation Results

Varying action size

51

4 5 6 7 8 9 10 50 100 150 200 Number of Conditions per Action Retrieval Cost Ratio (%) vLDF SCB PbP LCF 4 5 6 7 8 9 10 20 40 60 80 100 Number of Conditions per Action Request Resolution Rate (%) vLDF SCB PbP LCF

slide-52
SLIDE 52

Simulation Results

Varying data object size

52

2000 2500 3000 3500 4000 4500 5000 50 100 150 Average Data Item Size (KB) Retrieval Cost Ratio (%) vLDF SCB PbP LCF 2000 2500 3000 3500 4000 4500 5000 20 40 60 80 100 Average Data Item Size (KB) Request Resolution Rate (%) vLDF SCB PbP LCF

slide-53
SLIDE 53

Simulation Results

Varying network bandwidth

53

3.5 4 4.5 5 5.5 6 6.5 50 100 150 Network Bandwidth (KBps) Retrieval Cost Ratio (%) vLDF SCB PbP LCF 3.5 4 4.5 5 5.5 6 6.5 20 40 60 80 100 Network Bandwidth (KBps) Request Resolution Rate (%) vLDF SCB PbP LCF

slide-54
SLIDE 54

Simulation Results

Varying network transmission fluctuation

54

−3 −2 −1 1 2 3 50 100 150 Network Transmission Delay Fluctuation − Mean Retrieval Cost Ratio (%) vLDF SCB PbP LCF −3 −2 −1 1 2 3 20 40 60 80 100 Network Transmission Delay Fluctuation − Mean Request Resolution Rate (%) vLDF SCB PbP LCF

slide-55
SLIDE 55

Application: Route Finding

Find routes for <src, dst> pairs Each candidate route: AND of its segments Routing result: OR of all candidate routes

Visual verification for route segment conditions

55

slide-56
SLIDE 56
slide-57
SLIDE 57
slide-58
SLIDE 58

Results of 5 Runs

vLDF Cost (KB) PbP Cost (KB) vLDF Time (s) PbP Time (s) 516 685 164 255 343 598 150 206 319 485 160 248 506 1093 165 372 524 1042 175 206

slide-59
SLIDE 59

Conclusion

Environment dynamics & resource limitations affect real-time decision-making Efficient data acquisition algorithm Promising results through simulations and concrete route finding application scenario

59

slide-60
SLIDE 60

Thanks