Covariance Steering as a Tool for Planning and Control in the - - PowerPoint PPT Presentation

covariance steering as a tool for planning and control in
SMART_READER_LITE
LIVE PREVIEW

Covariance Steering as a Tool for Planning and Control in the - - PowerPoint PPT Presentation

Covariance Steering as a Tool for Planning and Control in the Presence of Uncertainty Panagiotis Tsiotras School of Aerospace Engineering Institute for Robotics and Intelligent Machines Georgia Institute of Technology Uncertainty Synthesis


slide-1
SLIDE 1

Covariance Steering as a Tool for Planning and Control in the Presence of Uncertainty

Panagiotis Tsiotras

School of Aerospace Engineering Institute for Robotics and Intelligent Machines Georgia Institute of Technology

Uncertainty Synthesis Workshop, CDC 2019 December 10, 2019

slide-2
SLIDE 2

Moving Densities

slide-3
SLIDE 3

Gaussian Case: Steering the Covariance

slide-4
SLIDE 4

Example: Powered Descend Guidance

slide-5
SLIDE 5

Problem Formulation

Given the discrete-time stochastic linear system xk+1 = Akxk + Bkuk + Dkwk Initial and final states to be distributed according to x0 ∼ N(µ0, Σ0), xN ∼ N(µf, Σf) with µ0, Σ0, µf, Σf given. Minimize the cost function J(u0, . . . , uN−1) = E N−1

  • k=0

u⊤

kuk

  • → min
slide-6
SLIDE 6

The system state at step k + 1 is given by xk+1 = Akx0 + BkUk + DkWk. where Uk =      u0 u1 . . . uk      , Wk =      w0 w1 . . . wk      and where Ak Ak,0, Bk Bk,0, Dk Dk,0 Bk1,k0

  • Bk1,k0

Bk1,k0+1 · · · Bk1,k1

  • ,

Dk1,k0

  • Dk1,k0

Dk1,k0+1 · · · Dk1,k1

  • ,

Ak1,k0 = Ak1Ak1−1 · · · Ak0, Bk1,k0 = Ak1,k0+1Bk0, Dk1,k0 = Ak1,k0+1Dk0

slide-7
SLIDE 7

Let A = AN−1, B = BN−1, D = DN−1 and U = UN−1 =      u0 u1 . . . uN−1      , W = WN−1 =      w0 w1 . . . wN−1      then xN = Ax0 + BU + DW

slide-8
SLIDE 8

The mean of the state µk = E [xk] obeys the expression µk+1 = Akµ0 + BkU k where U k = E [Uk]. Let

  • Uk Uk − U k,
  • xk xk − µk,

It follows that

  • xk+1 = Ak

x0 + Bk Uk + DkWk. J(U) = E

  • U⊤U
  • = U⊤U

+ tr

  • E
  • U

U⊤

.

slide-9
SLIDE 9

Steering the Mean

Main Result The optimal control U ⋆ that minimizes the cost Jµ = U

⊤U = N−1

  • k=0

E [uk]⊤E [uk] subject to the constraint Aµ0 + BU = µf is given by U ⋆ = B⊤(BB⊤)

−1(µf − Aµ0)

slide-10
SLIDE 10

Diffusionless Case (Dk = 0)

Theorem (Goldshtein and Tsiotras, 2017) Let V0S0V ⊤

0 = Σ0,

VF SF V ⊤

F = Σf,

and UΩSΩV ⊤

Ω SF 1 2 V ⊤ F(BB⊤) −1AV0S0 1 2 .

Then the optimal control gain L that minimizes JΣ subject to a constraint ΣN = Σf, is given by L⋆ = B⊤(BB⊤)

−1(VF SF 1 2 UΩV ⊤ ΩS0 − 1 2 V ⊤ 0 − A)

Control is of the form

  • U = L

x0,

slide-11
SLIDE 11

General Case (Dk = 0)

Key Observation The system xk+1 = Akxk + Bkuk + Dkwk at time step N can be viewed as a sum of N uncorrelated E

  • x(i)

k x(j) m ⊤

= 0, k, m, i, j ∈ {0, . . . , N}, i = j, diffusion-less sub-systems xN =

N−1

  • i=0

x(i)

N + DwN−1,

x(i)

k+1 = Akx(i) k + Bku(i) k ,

x(i)

i

= x0, for i = 0, Di−1wi−1,

  • therwise.
slide-12
SLIDE 12

Optimal Controller

U (i)

i,N−1 =

  • L(i)x(i)

i ,

i = 1, . . . , N − 1, L(0)x0 + E [U] , i = 0. where U (i)

k1,k2

      u(i)

k1

u(i)

k1+1

. . . u(i)

k2

      , 0 ≤ k1 ≤ k2 ≤ N − 1.

slide-13
SLIDE 13

Optimal Controller Assume Σ0 0 and Σf 0, let y0 = x0 − µ0, and define yk = Dk−1wk−1 = xk − (Ak−1xk−1 + Bk−1uk−1) Let Φk = (I + BN,kB⊤

N,kΛ) −1AN,k, with Λ = Λ⊤ be the solution of the matrix equation N−1

  • k=1

ΦkDk−1D⊤

k−1Φ⊤ k + Φ0Σ0Φ⊤ 0 = Σf − DN−1D⊤ N−1 0

The optimal linear control law is given by u⋆

k = B⊤ N,k(BB⊤)−1(µf − Aµ0) + k

  • i=0

L(i)

k yi

where, L(i)

k = −B⊤ N,kΛΦi.

slide-14
SLIDE 14

Relation with LQG

Theorem (Goldshtein and Tsiotras, 2017; Chen et al, 2016) Let initial and final state covariance matrices Σ0 and Σf and symmetric matrix Qf. Assume that the LQG controller that minimizes the cost function J(u0, . . . , uN−1) = E N−1

  • k=0

u⊤

kuk + x⊤ NQfxN

  • ,

results in the final state covariance being equal to Σf. Then, this controller coincides is the same as the covariance steering controller with boundary constraints x0 ∼ N(0, Σ0), xN ∼ N(0, Σf), with Λ = Qf.

slide-15
SLIDE 15

50 100 2000 4000 6000

σ2

x

11

50 100

  • 2000

2000 4000 6000

σ2

x

12

50 100

  • 2000

2000 4000 6000

σ2

x

21

50 100

Steps

2000 4000 6000

σ2

x

22

CC LQG

slide-16
SLIDE 16

General Cost

Consider discrete-time stochastic linear system xk+1 = Akxk + Bkuk + Dkwk We wish the initial and final states to be distributed according to x0 ∼ N(µ0, Σ0), xN ∼ N(µN, ΣN) where µ0, Σ0, µN, ΣN given, while minimizing the cost function J(x, u) = E N−1

  • k=0

x⊤

k Qkxk + u⊤ k Rkuk

  • where Qk 0 and Rk ≻ 0 for all k = 0, 1, . . . , N − 1.

Assume that Σ0 0 and ΣN ≻ 0,

slide-17
SLIDE 17

Introduce augmented state X = [x⊤

0 , x⊤ 1 , . . . , x⊤ N]⊤ to write

X = Ax0 + BU + DW cost J(X, U) = E

  • X⊤ ¯

QX + U ⊤ ¯ RU

  • boundary conditions

µ0 = E0E[X], Σ0 = E0

  • E[XX⊤] − E[X]E[X]⊤

E⊤ µN = ENE[X], ΣN = EN

  • E[XX⊤] − E[X]E[X]⊤

E⊤

N

where Ek is a matrix such that xk = EkX, k = 0, 1, . . . , N Note: A ← ENA, B ← ENB, D ← END

slide-18
SLIDE 18

Let the control sequence uk = vk + Kkyk where yk is given by yk+1 = Akyk + Dkwk y0 = x0 − µ0 and let the control law U = V + KY

slide-19
SLIDE 19

Theorem (Okamoto & PT, 2018) The cost function takes the form J(V , K) = tr

  • ((I + BK)⊤ ¯

Q(I + BK) + K⊤ ¯ RK)(AΣ0A⊤ + DD⊤)

  • + (Aµ0 + BV )⊤ ¯

Q(Aµ0 + BV ) + V ⊤ ¯ RV In addition, the terminal state constraints can be written as µN = EN (Aµ0 + BV ) , ΣN = EN(I + BK)(AΣ0A⊤ + DD⊤)(I + BK)⊤E⊤

N

Note that V steers the mean and K steers the covariance, respectively. Letting ΣN EN(I + BK)(AΣ0A⊤ + DD⊤)(I + BK)⊤E⊤

N

yields a convex problem.

slide-20
SLIDE 20

Can handle convex chance constraints of the form Pr(xk / ∈ χ) ≤ Pfail, k = 0, . . . , N − 1 where χ =

M

  • j=1

{x : α⊤

j x ≤ βj}

using the standard trick Pr(α⊤

j x ≤ βj) = Φ

  βj − a⊤

j x

  • α⊤

j Σxαj

  ≥ 1 − pj,

M

  • j=1

pj ≤ Pfail

  • r

α⊤

j x − βj +

  • α⊤

j Σxαj Φ−1(1 − pj) ≤ 0

where Φ is the cumulative distribution function of the standard normal distribution.

slide-21
SLIDE 21

Assuming Pr(α⊤

j X > βj) ≤ pj,fail M

  • j=1

pj,fail ≤ Pfail the chance constraint can be formulated as α⊤

j (Aµ0 + BV ) + (AΣ0A⊤ + DD⊤)1/2(I + BK)⊤αjΦ−1(1 − pj,fail) − βj ≤ 0

Second order cone (convex) constraint in K and V .

slide-22
SLIDE 22

Example

09

slide-23
SLIDE 23
slide-24
SLIDE 24

Non-Convex Constraints

For non-convex polytopic constraints, write χ =

NR−1

  • r=0

Mr−1

  • q=0

{x : α⊤

r,qx ≤ βr,q}

  • Rr

and enforce Pr (xk / ∈ Rr) < ǫ and Pr (xk+1 / ∈ Rr) < ǫ Lemma Given Rr, the condition Pr (xk / ∈ Rr) < ǫ and Pr (xk+1 / ∈ Rr) < ǫ, is a second-order cone constraint in V and K.

slide-25
SLIDE 25

Example

slide-26
SLIDE 26

Example

slide-27
SLIDE 27

MPC

(Model Predictive Control: Classical, Robust and Stochastic, B. Kouvaritakis and M. Cannon)

slide-28
SLIDE 28

Stochastic MPC

slide-29
SLIDE 29
slide-30
SLIDE 30

Stochastic MPC

min

uk|k,...,uk+N−1|kJN(µk, Σk; uk|k, . . . , uk+N−1|k) = Ek

k+N−1

  • t=k

x⊤

t|kQxt|k + u⊤ t|kRut|k

  • + Jf(xk+N|k)

subject to xt+1|k = Axt|k + But|k + Dwt, xk|k = xk ∼ N(µk, Σk) Prk

  • α⊤

x,ixt|k ≤ βx,i

  • ≥ 1 − px,i,

i = 0, . . . , Ns − 1 Prk

  • α⊤

u,jut|k ≤ βu,j

  • ≥ 1 − pu,j,

j = 0, . . . , Nc − 1

slide-31
SLIDE 31

Stochastic MPC

min

uk|k,uk+1|k,...,uk+N−1|kJN(xk; uk|k, uk+1|k, . . . , uk+N−1|k) =

Ek k+N−1

  • t=k

x⊤

t|kQxt|k + u⊤ t|kRut|k

  • + Ek[xk+N|k]⊤PmeanEk[xk+N|k]

subject to xt+1|k = Axt|k + But|k + Dwt, xk|k = xk ∼ N(µk, Σk) Prk

  • α⊤

x,ixt|k ≤ βx,i

  • ≥ 1 − px,i,

i = 0, . . . , Ns − 1 Prk

  • α⊤

u,jut|k ≤ βu,j

  • ≥ 1 − pu,j,

j = 0, . . . , Nc − 1 Ek

  • xk+N|k
  • ∈ X µ

f

Ek

  • (xk+N|k − E[xk+N|k])(xk+N|k − E[xk+N|k])⊤

Σf

slide-32
SLIDE 32
slide-33
SLIDE 33

Stochastic MPC

min

uk|k,uk+1|k,...,uk+N−1|kJN(xk; uk|k, uk+1|k, . . . , uk+N−1|k) =

Ek k+N−1

  • t=k

x⊤

t|kQxt|k + u⊤ t|kRut|k

  • + Ek[xk+N|k]⊤PmeanEk[xk+N|k]

subject to xt+1|k = Axt|k + But|k + Dwt, xk|k = xk ∼ N(µk, Σk) Prk

  • α⊤

x,ixt|k ≤ βx,i

  • ≥ 1 − px,i,

i = 0, . . . , Ns − 1 Prk

  • α⊤

u,jut|k ≤ βu,j

  • ≥ 1 − pu,j,

j = 0, . . . , Nc − 1 Ek

  • xk+N|k
  • ∈ X µ

f

Ek

  • (xk+N|k − E[xk+N|k])(xk+N|k − E[xk+N|k])⊤

Σf

slide-34
SLIDE 34

Theorem Given µk, Σk, X µ

f , Σf ≻ 0, and Pmean ≻ 0, and using the following control law

ut|k = vt|k + Kt|kyt|k yt+1|k = Ayt|k + Dwt yk|k = xk|k − µk|k the problem can be cast as a convex programming problem

slide-35
SLIDE 35

min

V ,K JN(µk, Σk; V , K) = tr

  • (I + BK)⊤ ¯

QP,cov(I + BK) + K⊤ ¯ RK

  • Σy
  • + (Aµk|k + BV )⊤ ¯

QP,mean(Aµk|k + BV ) + V ⊤ ¯ RV subject to α⊤

x,iEt−k

  • Aµk|k + BV
  • + Σ1/2

y

(I + BK)⊤E⊤

t−kαx,iΦ−1(1 − px,i) − βx,i ≤ 0

α⊤

u,jFt−kV + Σ1/2 y

K⊤F ⊤

t−kαu,jΦ−1 (1 − pu,j) − βu,j ≤ 0

EN

  • Aµk|k + BV
  • ∈ X µ

f

Σf EN(I + BK)Σy(I + BK)⊤E⊤

N

where µk|k = µk, Σk|k = Σk, Σy = AΣk|kA⊤ + DD⊤

slide-36
SLIDE 36

V =    vk|k . . . vk+N−1|k    , K =      Kk|k Kk+1|k ... Kk+N−1|k      ¯ QP,mean =      Q ... Q Pmean      , ¯ QP,cov =      Q ... Q      , ¯ R =    R ... R    .

slide-37
SLIDE 37

Note that the terminal covariance constraint Σf EN(I + BK)Σy(I + BK)⊤E⊤

N

can be converted to a linear matrix inequality (LMI)

  • Σf

EN(I + BK)Σ1/2

y

Σ1/2

y

(I + BK)⊤E⊤

N

I

slide-38
SLIDE 38

Stochastic MPC

min

uk|k,uk+1|k,...,uk+N−1|kJN(xk; uk|k, uk+1|k, . . . , uk+N−1|k) =

Ek k+N−1

  • t=k

x⊤

t|kQxt|k + u⊤ t|kRut|k

  • + Ek[xk+N|k]⊤PmeanEk[xk+N|k]

subject to xt+1|k = Axt|k + But|k + Dwt, xk|k = xk ∼ N(µk, Σk) Prk

  • α⊤

x,ixt|k ≤ βx,i

  • ≥ 1 − px,i,

i = 0, . . . , Ns − 1 Prk

  • α⊤

u,jut|k ≤ βu,j

  • ≥ 1 − pu,j,

j = 0, . . . , Nc − 1 Ek

  • xk+N|k
  • ∈ X µ

f

Ek

  • (xk+N|k − E[xk+N|k])(xk+N|k − E[xk+N|k])⊤

Σf

slide-39
SLIDE 39

Covariance Assignment

Definition The state covariance Σ ≻ 0 is assignable to the closed-loop system xk+1 = (A + B ˜ K)xk + Dwk if Σ satisfies Σ = (A + B ˜ K)Σ(A + B ˜ K)⊤ + DD⊤ where ˜ K is a state-feedback gain. The set of assignable state covariances Σ can be parameterized by the following set of LMIs (I − BB+)(Σ − AΣA⊤ − DD⊤)(I − BB+) = 0 Σ ≻ 0, Σ DD⊤

slide-40
SLIDE 40

Proposition (Collins and Skelton, 1987) Let Σ ≻ 0 be an assignable covariance matrix. Then all (stabilizing) assignability state-feedback gains ˜ K are parametrized by ˜ K = B+

  • (Σ − DD⊤)1/2G1

Ir T

  • G⊤

2 S−1 − A

  • + (Inu − B+B)Z

where T is an arbitrary orthogonal matrix, SS⊤ = Σ, Z is an arbitrary matrix, and G1 and G2 are defined from the singular-value decompositions (I − BB+)(Σ − DD⊤)1/2 = LΛG⊤

1

(I − BB+)AS = LΛG⊤

2

where Λ = diag(σ1, . . . , σr, 0, . . . , 0) with σ1 ≥ σ2 ≥ . . . ≥ σr > 0.

slide-41
SLIDE 41

Theorem Suppose that Σf is assignable, µf ∈ X µ

f , where X µ f is a positively invariant, such that for all

µ ∈ X µ

f

(A + B ˜ K)µ ∈ X µ

f

α⊤

x,iµ + Σf 1/2αx,iΦ−1(1 − px,i) − βx,i ≤ 0,

i = 0, . . . , Ns − 1 α⊤

u,j ˜

Kµ + Σf 1/2 ˜ K⊤αu,jΦ−1 (1 − pu,j) − βu,j ≤ 0, j = 0, . . . , Nc − 1 where ˜ K is from corresponding assignability gain matrix, and Pmean is the solution of the discrete-time Lyapunov equation (A + B ˜ K)⊤Pmean(A + B ˜ K) − Pmean + Q + ˜ K⊤R ˜ K = 0 Then, the solution ensures recursive feasibility and stability.

slide-42
SLIDE 42

Uncontrolled

  • 2
  • 1.5
  • 1
  • 0.5

0.5 1 1.5 2

x1

  • 2
  • 1.5
  • 1
  • 0.5

0.5 1 1.5 2

x2

slide-43
SLIDE 43

Infinite Horizon LQR

slide-44
SLIDE 44

CS-SMPC

slide-45
SLIDE 45

For More Details...

Goldshtein, M., and Tsiotras, P., “Finite-Horizon Covariance Control of Linear Time-Varying Systems,” 56th IEEE Conference on Decision and Control, Melbourne, Australia, Dec. 12–15, 2017, pp. 3606–3611. Ridderhof, J., and Tsiotras, P., “Uncertainty Quantification and Control During Mars Powered Descent and Landing using Covariance Steering,” AIAA Guidance, Navigation, and Control Conference, (AIAA 2018-1576), Kissimmee, FL, Jan. 8–12, 2018. Okamoto, K., Goldshtein, M., and Tsiotras, P., “Optimal Covariance Control for Stochastic Systems Under Chance Constraints,” IEEE Control Systems Letters, Vol. 2, No. 2, pp. 266–271, 2018. Okamoto, K. and Tsiotras, P., “Optimal Stochastic Vehicle Path Planning Using Covariance Steering,” IEEE Robotics and Automation Letters, Vol. 4, No. 3, pp. 2276–2281, 2019, Riderhoff, J., and Tsiotras, P., “Minimum-fuel Powered Descent in the Presence of Random Disturbances,” AIAA Guidance, Navigation, and Control Conference, San Diego, CA, Jan. 7–11, 2019 (best student paper award).

slide-46
SLIDE 46

For More Details...

Okamoto, K., and Tsiotras, P., “Stochastic Model Predictive Control for Constrained Linear Systems Using Optimal Covariance Steering,” http://arxiv.org/abs/1905.13296 Okamoto, K. and Tsiotras, P., “Input Hard Constrained Optimal Covariance Steering,” 58th IEEE Conference on Decision and Control, Nice, France, Dec. 11–13, 2019. Session ThA19.3 Stochastic Systems II Ridderhof, J., Okamoto, K. and Tsiotras, P., “Nonlinear Uncertainty Control with Iterative Covariance Steering,” 58th IEEE Conference on Decision and Control, Nice, France, Dec. 11–13,

  • 2019. Session ThA19.3 Stochastic Systems II
slide-47
SLIDE 47

Many Extensions

Output feedback covariance steering (Bakolas, 2019; Ridderhof and PT, 2020; Maity and PT, 2020) Input constrains (Okamoto and PT, 2019); see paper in Session ThA19.3 Stochastic Systems II Extension to nonlinear systems (Caluya and Halder 2019; Ridderhof, Okamoto, and PT, 2019); see paper in Session ThA19.3 Stochastic Systems II Differential games (Makkapatti, Okamoto and PT, 2019)