Near-Optimal Adaptive Control
- f a Large Grid Application
Det Buaklee Greg Tracy Mary Vernon Steve Wright
Computer Science Department University of Wisconsin - Madison
Near-Optimal Adaptive Control of a Large Grid Application Det - - PowerPoint PPT Presentation
Near-Optimal Adaptive Control of a Large Grid Application Det Buaklee Greg Tracy Mary Vernon Steve Wright Computer Science Department University of Wisconsin - Madison Talk Outline Condor Stochastic Optimization, ATR ATR
Computer Science Department University of Wisconsin - Madison
ICS’02 New York City June 26, 2002 [2]
ICS’02 New York City June 26, 2002 [3]
– Submitting node is the “master” node – Condor dynamically allocates “worker” nodes – Worker nodes can drop out during computation (min,max)
Communication Link
ICS’02 New York City June 26, 2002 [4]
– Large number of possible scenarios for the data – Arises in planning-under-uncertainty applications
– aim to find the x that optimizes expected model performance over all the scenarios
ICS’02 New York City June 26, 2002 [5]
Q(x)
i=1 N
–Maybe sampled from the full set of scenarios –Increase N to improve the accuracy
ICS’02 New York City June 26, 2002 [6]
N = 16 = number of scenarios evaluated G = 4 = number of task groups T = 8 = number of tasks per iteration
For each Iteration
workers
ICS’02 New York City June 26, 2002 [7]
– Automated process – Fast/simple runtime computation
ICS’02 New York City June 26, 2002 [8]
Master Workers
ICS’02 New York City June 26, 2002 [9]
– L (network latency) – o (message processing overhead) – G (gap per byte - Bandwidth) – P (number of Processors
ICS’02 New York City June 26, 2002 [10]
1.35 2.69 5.19 10.36 10.56 20.54 avg Worker Execution Time (sec) 13.27 0.05 3.33 21 2411 3.30 6.74 400 400 6.25 0.03 2.25 25 2092 3.84 6.12 200 200 3.41 0.05 1.57 31 1162 3.40 5.94 100 100 7.60 0.05 2.42 32 1936 3.64 6.83 100 50 3.05 0.01 1.32 47 1405 3.56 6.04 50 50 2.06 0.01 0.38 82 915 3.36 6.51 25 25 max min avg num it. max min avg Master Time to Compute a New Iterate, x (sec) Master Time to Update Model Function m(x) (msec) T G
ICS’02 New York City June 26, 2002 [11]
– Number of scenarios evaluated – Processor speed
10 20 30 40 50 200 400 600 800 1000 1200
Number of Scenarios Evaluated (N/G) Worker Execution Time (sec)
MIPS 600 MIPS 780 MIPS 1100 MIPS 1700
ICS’02 New York City June 26, 2002 [12]
Updating m(x) after each task group (G) returns
– Excessive default debug I/O – Interference from Condor administrative tasks
i.e., negligible
0.01 0.1 1 10 100 1000 1000 2000 3000 4000
Worker Completion Event Count Time to Update m (x ) (msec)
lightly loaded master, default debug level lightly loaded master, reduced debug level isolated master, reduced debug level
ICS’02 New York City June 26, 2002 [13]
1 2 3 4 5 6 7 10 20 30 40 Iteration Number Time to Compute New x (sec) T = 200 T = 100 5 10 15 20 25 30 200 400 600 800 Iteration Number Time to Compute New x (sec)
SSN network design problem 20term problem
ICS’02 New York City June 26, 2002 [14]
Generating new x at the end of each iteration:
25 50 75 100 200 400 600 800 1000
Number of Tasks (T) Total Master Processing Time (sec)
each iteration depends on N, T
0.5 1 1.5 2 2.5 3 3.5
200 400 600 800 1000
Number of Task (T)
Compute New x (sec)
20 40 60 80
Number of Iteration (n)
ICS’02 New York City June 26, 2002 [15]
2 4 6 8 2 4 6 8 10 12 14 16
Size of Data Sent (KB) Time (usec) 0.00 0.14 0.28 0.42 0.56 0.70 0.84 0.98 2 4 6 8 10 12 14 16
Size of Data Sent (KB)
Time (sec)
Experiment 1 Experiment 2
for message sizes used in ATR (250–1200 bytes)
Between local nodes Between Wisconsin and Bologna, Italy
ICS’02 New York City June 26, 2002 [16]
20 40 60 80 100 120 140 160 1 2 3 4 5 6 Basket Size (B) Number of Iterations (n) maximum average minimum
ICS’02 New York City June 26, 2002 [17]
ICS’02 New York City June 26, 2002 [18]
– Variable for N, T, B – Include only time to generate new x
– Very low variation – Consistent from one iteration to another
– Communication time – Master updating Q(x) – (if T not too large)
ICS’02 New York City June 26, 2002 [19]
WI-Argonne Flock 24.9 22.9 20.96 441 44 400 20,000 ssn WI-pool 12.1 10.3 6.32 64 44 100 10000 ssn WI-Argonne Flock 29.3 26.4 20.88 295 61 200 20,000 ssn WI-Argonne Flock 36.3 33.5 20.89 244 84 100 20,000 ssn WI-Argonne Flock 44.7 40.8 20.91 180 108 50 20,000 ssn WI-NM Flock 52.2 48.8 30.97 297 84 100 40,000 ssn WI pool 70.5 69.4 2.35 2762 597 200 5,000 20-terms
Measured Model Total (tM) num it. (n) Note Total Execution Time (min) Benchmark Average (tW) (sec) Compute New x (sec) T N Planning Problem
ICS’02 New York City June 26, 2002 [20]
200 9.67 7.15 10.21 1.68 2.11 61.3 36 150 9.63 6.65 9.78 1.37 2.86 46.77 36 150 9.07 6.85 9.42 1.38 2.76 53.18 38 150 13.75 10.73 13.88 1.36 2.86 60.71 42 50 14.35 13.96 13.82 4.18 6.62 35.8 58 50 35.07 34.22 28.62 4.19 7.03 50.02 70 50 34.65 34.23 28.62 4.21 7.04 50.37 70 Number of Workers Request Measur ed Model tw
max
tw
min
avg. tM n Non Adaptive Execution Time (min) Worker Time ( sec) Computing new X (sec)
ICS’02 New York City June 26, 2002 [21]
18 min 149 min 68 min 92 min 61 min B=6 B=3 B=6 B=3 Default Debug Reduced Debug Near-Optimize ATR Execution Time Original ATR Execution Time (T = 100, G = 25)
ICS’02 New York City June 26, 2002 [22]
9 9 10 13 15 20 20 20 master node’s worker queue master node’s job queue per iteration benchmark: Ew:
ICS’02 New York City June 26, 2002 [23]
Original task assignment Execution Time (min) 52% 91 35.59 39.56 100 82.94 98.6 8.06 20.33 29% 67 3.24 6.78 100 9.52 9.6 0.83 2.84 45% 86 3.41 4.78 100 8.67 9.5 1.20 2.25 49% 26 9.86 11.43 50 22.32 19.4 2.58 7.76 17% 45 8.03 10.50 50 12.66 8.9 1.69 4.02 Number
Workers Used Model Measured Number of Worker Model max min avg Estimated Speedup (%) Adaptive task assignment Execution time (min) Worker Time (tW) (sec)
ICS’02 New York City June 26, 2002 [24]
ICS’02 New York City June 26, 2002 [25]
ICS’02 New York City June 26, 2002 [26]
ICS’02 New York City June 26, 2002 [27]
ICS’02 New York City June 26, 2002 [28]
1.80 27.0 0.05 Very cold 1.50 14.0 0.15 Cold 0.85 08.0 0.30 Warm 1.00 10.0 0.50 Normal Price Demand Prob. Scenario
ICS’02 New York City June 26, 2002 [29]
ICS’02 New York City June 26, 2002 [30]
Master Workers
ICS’02 New York City June 26, 2002 [31]
e.g., N = 5,000 or N = 40,000
e.g., G = 50 or G = 100
e.g., T = 200 or T = 1,000
e.g., B = 5
ICS’02 New York City June 26, 2002 [32]
– Benchmark = execution time of a sample task group
– Indicates the expected time needed for this worker to complete one task group