Arash Deshmeh, Jacob Machina, and Angela C. Sodan
University of Windsor, Canada
ADEPT Scalability Predictor in Support of Adaptive Resource Allocation
IPDPS 2010
ADEPT Scalability Predictor in Support of Adaptive Resource - - PowerPoint PPT Presentation
Arash Deshmeh, Jacob Machina, and Angela C. Sodan University of Windsor, Canada ADEPT Scalability Predictor in Support of Adaptive Resource Allocation IPDPS 2010 Outline Background: Adaptive Resource Allocation Related Work Downey
IPDPS 2010
Adaptive resource allocation:
Reducing fragmentation Adapting to current load (low/high)
Run at higher efficiency with smaller
size Ideal Real Speedup
min
N
N
max
N
Help in choosing job sizes tactically Determine maximum meaningful job sizes
Clusters (MPI jobs) SMPs (OpenMP or MPI jobs) Virtual-machine resource provisioning
Most approaches are white-box (detailed model)
Require tools: code instrumentation, compiler/OS support,
Achieve high prediction accuracy Provide computationally efficient approach Detect and automatically correct individual anomalies Detect and model non-uniform patterns (multi-phase) Perform reliability judgment with potential advice for
Simple (only A and to be learned) Needs few observation points
Speedup Curves, A varies
50 100 150 200 250 300 350 100 200 300 400
Speedup Curves, varies
50 100 150 200 250 300 100 200 300 400
Speedup curves for Downey m
typical application
20 40 60 80 100 120 140 160 100 200 300 400
Typical application Downey model
Flat Linear Transitional Declining
Core of ADEPT
Derives constraints from observations Calculates closed-form solutions (within certain
Use lowest and highest bounds as overall envelope
Forming the Envelope
50 100 150 200 250 300 100 200 300 400
N S Range Pair 1 Range Pair 2 Range Pair 3
Prediction per target point, biased to closest observations Weighted least-squared relative errors Two-step
Constraints from envelope and two-step curve fitting make
Speedup Prediction Using 4 Methods
50 100 150 200 100 200 300 400 500
N S
Levm ar ADEPT / Exhaus tive / Genetic
Experiments with MPI and OpenMP NAS benchmarks BT, CG, FT, LU, SP 7 real anonymous applications
Both interpolation and extrapolation 3 to 4 input observation points Prediction of T(n) and S(n) T(1) not always available
NAS_FT
10 20 30 40 50 60 50 100 150 Standard Biased Weighting Predictions Uniform Weighting Predictions
App_A
2 4 6 8 10 12 5 10 15 20 25
NAS_OMP_BT
1 2 3 4 5 6 10 20 30 40
NAS_OMP_CG
1 2 3 4 5 6 7 8 10 20 30 40
App_E
20 40 60 80 100 120 50 100 150 200 250 300
App_F
10 20 30 40 50 60 70 80 50 100 150
NAS_BT
1 10 100 1000 50 100 150 200 250 300
NAS_CG
1 10 100 1000 50 100 150
NAS_FT
1 10 100 1000 50 100 150
App_B
1 10 100 1000 10000 500 1000 1500 2000
App_D
1 10 100 1000 10000 100000 50 100 150 200 250 300
App_E
1 10 100 1000 10000 100000 50 100 150 200 250 300
Both interpolation and extrapolation work well
Core of ADEPT
Serious deviations from model can be detected
Approach: fluctuation metric R
Speedup curve, with anomalous point
20 40 60 80 100 120 50 100 150 200 250
R Metric Curve
0.80 1.00 1.20 1.40 1.60 1.80 2.00 2.20 50 100 150 200
R Metric Curves
0.80 1.00 1.20 1.40 1.60 1.80 2.00 2.20 50 100 150 200
Anomaly, NAS_SP
20 40 60 80 100 120 140 50 100 150 200 250 300
Anomaly, NAS_OMP_SP
1 2 3 4 5 6 7 8 9 10 20 30 40
Anomaly, Synthetic
20 40 60 80 100 120 140 160 50 100 150 200 250
Stepwise NAS_OMP_FT
1 2 3 4 5 6 7 10 20 30 40
Stepwise NAS_OMP_FT, Fitted
1 2 3 4 5 6 7 8 9 10 20 30 40
Stepwise Synthetic, Fitted
50 100 150 200 250 300 50 100 150 200 250 300 350
Specially Optimized for 2^n Nodes, Fitted
10 20 30 40 50 60 50 100 150 200
Core of ADEPT
All input points in linear section
High fitting error, not explainable as anomaly
Runner-up problem (two or more model instances
High Fitting Error, NAS_LU
50 100 150 200 250 50 100 150 200 250 300
All Linear Speedup, App_C
5 10 15 20 25 30 35 10 20 30 40
All 3 cases (linear, high-fitting error, runner-up) successfully detected
Runner-Up Model Instance, NAS_SP
1 10 100 1000 50 100 150 200 250
ADEPT is accurate and efficient
Employs envelope derivation technique to constrain search
Biased model fitting with efficient two-level approach Anomaly detection based on fluctuation metric and automatic
Warnings by reliability judgment if prediction uncertain Suitable for production environments