[PPT] - An asynchronous parallel derivative-free algorithm for handling PowerPoint Presentation

SLIDE 1

An asynchronous parallel derivative-free algorithm for handling general constraints

Josh Griffin

Computational Sciences and Mathematics Research Sandia National Laboratories Livermore, California USA Second International Congress on Mathematical Software Castro Urdiales, SPAIN September 1–3, 2006 Joint work with Tammy Kolda, Robert Michael Lewis, and Virginia Torczon

Computational Sciences and Mathematics Research Slide 1 September 2, 2006

SLIDE 2

Talk outline

1. Problems of interest
2. Generating set search background
3. Linear constraints
4. Nonlinear equality constraints
5. Numerical results

Computational Sciences and Mathematics Research Slide 2 September 2, 2006

SLIDE 3

Why use derivative-free?

Punchline Derivative-free methods more reliable, less restrictive Should I take the

r the

?

Derivative-based if ...

Function evaluations quick
All points finite/defined
Continous and smooth
Little to no noise

Derivative-free if ...

Function evaluations slow
Points may be undefined
Discontinous, nonsmooth, okay
Noise okay

Computational Sciences and Mathematics Research Slide 3 September 2, 2006

SLIDE 4

Problems we are interested in

Function evaluations are CPU-intensive
Simulation-based objective function can periodically crash
Noise limits ability to estimate derivatives
No analytic formula for objective function

Computational Sciences and Mathematics Research Slide 4 September 2, 2006

SLIDE 5

Optimization in nuclear safety studies

1. Question: Could accidental drop jeopardize

integrity of internal components?

2. Plan: Drop model from different angles
3. Goal: Find angle that maximizes damage

Resulting Problem: maximize

x∈R2

D(x) subject to 0 ≤ xi ≤ π

D(x) is measure of damage
Time per evaluation: 1-15 hrs
Software entities: 3

Computational Sciences and Mathematics Research Slide 5 September 2, 2006

SLIDE 6

Generating Set Search and APPSPACK

Computational Sciences and Mathematics Research Slide 6 September 2, 2006

SLIDE 7

Problem types for APPSPACK

minimize

x∈Rn

f(x) subject to c(x) = 0 Ax ≤ b Here f : Rn → R, c : Rn → Rp, and A is an m × n matrix.

linear equalities permitted
derivatives for f(x) and c(x) unavailable
number of variables relatively small (≤ 100)

Computational Sciences and Mathematics Research Slide 7 September 2, 2006

SLIDE 8

Generating set search algorithms

General idea: Use set of positively spanning search directions Two examples: Guaranteed: Search direction within 90◦ of steepest descent direction Punchline: Always have a descent direction if one exists

Computational Sciences and Mathematics Research Slide 8 September 2, 2006

SLIDE 9

Generating set search algorithms

General idea: Use set of positively spanning search directions Two examples: Guaranteed: Search direction within 90◦ of steepest descent direction Punchline: Always have a descent direction if one exists

Computational Sciences and Mathematics Research Slide 8 September 2, 2006

SLIDE 10

Generating set search algorithms

General idea: Use set of positively spanning search directions Two examples: Guaranteed: Search direction within 90◦ of steepest descent direction Punchline: Always have a descent direction if one exists

Computational Sciences and Mathematics Research Slide 8 September 2, 2006

SLIDE 11

Synchronous framework (unconstrained)

while (∆ > ∆tol)

1. Generate trial-points and send to evaluation queue:

X = {x + ∆d(i) : d(i) ∈ search pattern}

2. Collect evaluated trial points: Y = X
3. Update: Is there a point y ∈ Y better than x?

Yes: x ← y (successful) No: ∆ ← .5∆ (unsuccessful) end

Computational Sciences and Mathematics Research Slide 9 September 2, 2006

SLIDE 12

Synchronous framework (unconstrained)

while (∆ > ∆tol)

1. Generate trial-points and send to evaluation queue:

X = {x + ∆d(i) : d(i) ∈ search pattern}

2. Collect evaluated trial points: Y = X
3. Update: Is there a point y ∈ Y better than x?

Yes: x ← y (successful) No: ∆ ← .5∆ (unsuccessful) end ❅ ❅ ❅ ■

We enforce a sufficient decrease conditions based on step size ∆ f(y) ≤ f(x) − α∆2

Computational Sciences and Mathematics Research Slide 9 September 2, 2006

SLIDE 13

Synchronous framework (unconstrained)

while (∆ > ∆tol)

1. Generate trial-points and send to evaluation queue:

X = {x + ∆d(i) : d(i) ∈ search pattern}

2. Collect evaluated trial points: Y = X
3. Update: Is there a point y ∈ Y better than x?

Yes: x ← y (successful) No: ∆ ← .5∆ (unsuccessful) end

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

✛

Step where asynchronous algorithms wins in parallel Y = X

Computational Sciences and Mathematics Research Slide 9 September 2, 2006

SLIDE 14

Asynchronous framework (unconstrained)

while (∆(i) > ∆tol)

1. Generate trial-points and send to evaluation queue:

X = {x + ∆(i)d(i) : d(i) ∈ search pattern and inactive}

2. Collect nonempty set of evaluated trial points: Y = X
3. Update: Is there a point y ∈ Y better than x?

Yes: x ← y, ∆(i) = max(∆y, ∆min) Successful: may prune queue No: ∆(i) ← .5∆(i) for ”evaluated” indices Unsuccessful: may not prune queue

Computational Sciences and Mathematics Research Slide 10 September 2, 2006

SLIDE 15

Asynchronous framework (unconstrained)

while (∆(i) > ∆tol)

1. Generate trial-points and send to evaluation queue:

X = {x + ∆(i)d(i) : d(i) ∈ search pattern and inactive}

2. Collect nonempty set of evaluated trial points: Y = X
3. Update: Is there a point y ∈ Y better than x?

Yes: x ← y, ∆(i) = max(∆y, ∆min) Successful: may prune queue No: ∆(i) ← .5∆(i) for ”evaluated” indices Unsuccessful: may not prune queue

Computational Sciences and Mathematics Research Slide 11 September 2, 2006

SLIDE 16

Asynchronous framework (unconstrained)

while (∆(i) > ∆tol)

1. Generate trial-points and send to evaluation queue:

X = {x + ∆(i)d(i) : d(i) ∈ search pattern and inactive}

2. Collect nonempty set of evaluated trial points: Y = X
3. Update: Is there a point y ∈ Y better than x?

Yes: x ← y, ∆(i) = max(∆y, ∆min) Successful: may prune queue No: ∆(i) ← .5∆(i) for ”evaluated” indices Unsuccessful: may not prune queue

Computational Sciences and Mathematics Research Slide 12 September 2, 2006

SLIDE 17

Unconstrained optimization demo

best: a pending: b c d e evaluated: pruned: Trial points ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✄ ✎ ❆ ❆ ❆ ❆ ❯ Current best point

✠
∆

Step size ❅ ❅ ■

Computational Sciences and Mathematics Research Slide 13 September 2, 2006

SLIDE 18

Unconstrained optimization demo

best: a pending: b c d e evaluated: pruned:

Computational Sciences and Mathematics Research Slide 14 September 2, 2006

SLIDE 19

Unconstrained optimization demo

best: a pending: c d evaluated: b e pruned:

Computational Sciences and Mathematics Research Slide 15 September 2, 2006

SLIDE 20

Unconstrained optimization demo

best: a pending: f g c d evaluated: pruned:

Computational Sciences and Mathematics Research Slide 16 September 2, 2006

SLIDE 21

Unconstrained optimization demo

best: a pending: c d evaluated: f g pruned:

Computational Sciences and Mathematics Research Slide 17 September 2, 2006

SLIDE 22

Unconstrained optimization demo

best: f pending: h i j k c d evaluated: pruned:

Computational Sciences and Mathematics Research Slide 18 September 2, 2006

SLIDE 23

Unconstrained optimization demo

best: f pending: i k evaluated: c j h pruned: d ①

Computational Sciences and Mathematics Research Slide 19 September 2, 2006

SLIDE 24

Unconstrained optimization demo

best: c pending: l m n o i k evaluated: pruned:

Computational Sciences and Mathematics Research Slide 20 September 2, 2006

SLIDE 25

Unconstrained optimization demo

best: c pending: n k evaluated: l m o i pruned:

Computational Sciences and Mathematics Research Slide 21 September 2, 2006

SLIDE 26

Unconstrained optimization demo

best: l pending: p q r s n k evaluated: pruned:

Computational Sciences and Mathematics Research Slide 22 September 2, 2006

SLIDE 27

Unconstrained optimization demo

best: l pending: p q r s evaluated: n k pruned:

Computational Sciences and Mathematics Research Slide 23 September 2, 2006

SLIDE 28

Unconstrained optimization demo

best: l pending: p q r s evaluated: pruned:

Computational Sciences and Mathematics Research Slide 24 September 2, 2006

SLIDE 29

Handling linear constraints:

Same algorithm, different directions

Computational Sciences and Mathematics Research Slide 25 September 2, 2006

SLIDE 30

Computing conforming search directions

❅

❅ ❅ ❅ ❅ ❅

❅

❅ ❅ ❅ ❅ ❅

❅

❅ ❅ ❅ ❅ ❅

❅

❅ ❅ ❅ ❅ ❅

∗ ∗

✻ ✲ ❄ ✛

✒

❅ ❅ ❘ ❅ ❅ ■ ✏ ✏ ✏ ✏ ✏ ✮ ✏ ✏ ✏ ✏ ✏ ✮

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . . .. . . .. . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

−∇f(x)

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . .. . . . .. . . .. . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

−∇f(x)

Computational Sciences and Mathematics Research Slide 26 September 2, 2006

SLIDE 31

Locally conforming directions

We want the ability to move parallel to active constraints

Computational Sciences and Mathematics Research Slide 27 September 2, 2006

SLIDE 32

Locally conforming directions

We want the ability to move parallel to active constraints We also want the ability to move parallel to “nearby” constraints

Computational Sciences and Mathematics Research Slide 27 September 2, 2006

SLIDE 33

ǫ-active constraints

We place a ball of radius ǫ about current best point. Constraints passing through this ǫ-ball are considered ǫ-active constraints.

ǫ

Computational Sciences and Mathematics Research Slide 28 September 2, 2006

SLIDE 34

ǫ-active constraints

We place a ball of radius ǫ about current best point. Constraints passing through this ǫ-ball are considered ǫ-active constraints.

ǫ

ǫ-active constraints

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

❘ ✒

Computational Sciences and Mathematics Research Slide 28 September 2, 2006

SLIDE 35

Conforming directions

We then compute corresponding conforming search directions

Computational Sciences and Mathematics Research Slide 29 September 2, 2006

SLIDE 36

ǫ-tangent cone, T (x, ǫ)

The positive-span of conforming directions forms ǫ-tangent cone We will denote ǫ-tangent cone by T (x, ǫ)

Computational Sciences and Mathematics Research Slide 30 September 2, 2006

SLIDE 37

Summarizing

Punch-line: We can always travel a distance of at least ǫ along each conforming search direction and remain feasible. Thus it makes sense to set ǫ equal to the current step size: ǫ = ∆. In asynchronous mode we have multiple step size: ∆(i), i = 1, ..., p. Implies we must work with multiple tangent cones.

Computational Sciences and Mathematics Research Slide 31 September 2, 2006

SLIDE 38

Computing conforming search directions

Two-step process:

1. Determine ǫ-active constraints normals
Positive-span form ǫ-normal cone N(x, ǫ)
Always a subset of rows from constraint matrix
2. Find positive-spanning set for T (x, ǫ) = N(x, ǫ)◦
nondegenerate case: formed using LAPACK .
degenerate case: formed using C-library cddlib :

– Double description method of Motzkin et al. written by Komei Fukuda. Punchline: Conforming search directions are given by generators of T (x, ǫ)

Computational Sciences and Mathematics Research Slide 32 September 2, 2006

SLIDE 39

Synchronous framework for linear constraints

while (∆ > ∆tol)

1. Use conforming search directions for ǫ = min(∆, ǫmax).
2. Generate trial-points and send to evaluation queue:

X = {x + ˜ ∆d(i) : d(i) ∈ search pattern}, ˜ ∆ ∈ [0, ∆]

3. Collect evaluated trial points: Y = X
4. Update: Is there a point y ∈ Y better than x?

Yes: x ← y (successful) No: ∆ ← .5∆ (unsuccessful)

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

❨

Asymptotically need ǫ = ∆ Hence require ǫmax > ∆tol

Computational Sciences and Mathematics Research Slide 33 September 2, 2006

SLIDE 40

Asynchronous tricky

Synchronous case:

– one tangent cone per iteration – swap if tangent cone changes

Asynchronous case:

– multiple tangent cones per iteration – swap vs. append tangent cone Punchline: Must include conforming search directions for

{i: ∆(i)≤ǫmax}

T (x, ∆(i)) ∪ T (x, ǫmax)

Computational Sciences and Mathematics Research Slide 34 September 2, 2006

SLIDE 41

Asynchronous framework for linear constraints

For simplicity assume ǫmax = ∞ while (∆(i) > ∆tol)

1. Generate trial-points and send to evaluation queue:

X = {x + ˜ ∆(i)d(i) : d(i) ∈ search pattern and inactive}

2. Collect nonempty set of evaluated trial points: Y = X
3. Update: Is there a point y ∈ Y better than x?

Yes: x ← y, ∆(i) = max(∆y, ∆min) Use conforming directions for ǫ = current step-size No: ∆(i) ← .5∆(i) for ”evaluated” indices Append conforming directions for ǫ = mini(∆(i))

Computational Sciences and Mathematics Research Slide 35 September 2, 2006

SLIDE 42