From Fourier to Koopman Spectral Methods for Long-term Time Series - - PowerPoint PPT Presentation

▶

May 29, 2023 658 likes •1.14k views

From Fourier to Koopman Spectral Methods for Long-term Time Series Prediction arXiv:2004.00574 Henning Lange, Steven L. Brunton, J. Nathan Kutz Objective > Given data snapshots from x t t = 1 t = T to > Predict temporal snapshots x

SLIDE 1

From Fourier to Koopman

Henning Lange, Steven L. Brunton, J. Nathan Kutz

Spectral Methods for Long-term Time Series Prediction

arXiv:2004.00574

SLIDE 2

> Given data snapshots from to   > Predict temporal snapshots > in the order of 10.000  > Assumption: > is produced by quasi-periodic system

xt t = 1 t = T xT+h h xt

Objective

SLIDE 3

Spatio-Temporal Systems

SLIDE 4

> Fourier Forecast > Similar to Fourier Transform > No implicit periodicity assumption  > Koopman Forecast > Based on Koopman theory > Fourier Transform in non-linear basis

Outline

SLIDE 5

> Fourier Forecast > Non-convex objective  > Koopman Forecast > Non-linear and non-convex objective > FFT allows for obtaining global optima

Outline

SLIDE 6

> Both learning objectives contain easy and hard to

ptimize parameters

> For both algorithms, the strategy for obtaining the global optimum of a single value of the hard to

ptimize parameters is introduced

> Apply coordinate descent > Alternately optimize hard and easy quantities

Solution strategy

SLIDE 7

Fourier Forecast

SLIDE 8

> Goal: Fit linear dynamical system to data

yt xt

Objective

E(A, B) =

∑

t=1

(xt − Ayt)2 yt = Byt−1

minimize subject to

Re[eig(B)] = 0

SLIDE 9

> Goal: Fit linear dynamical system to data

yt xt

Objective

E(A, ω) =

∑

t=1

xt − A sin(ω1t) ⋮ sin(ωNt) cos(ω1t) ⋮ cos(ωNt)

SLIDE 10

> Goal: Fit linear dynamical system to data

yt xt

Objective

E(A, ω) =

∑

t=1

(xt − AΩ(ωt))

SLIDE 11

> Goal: Fit linear dynamical system to data > Because of linearity of and > Analytic solution for > Symmetry relationship to Fourier Transform

yt xt A Ω ωi

Objective

E(A, ω) =

∑

t=1

(xt − AΩ(ωt))

SLIDE 12

Symmetry

E(A, ω) =

∑

t=1

(xt − AΩ(ωt))

Jaynes, E. T . "Bayesian spectrum and chirp analysis." Maximum-Entropy and Bayesian Spectral Analysis and Estimation Problems. Springer, Dordrecht, 1987. 1-37.

SLIDE 13

> For quasi-periodic systems, FT/error surface is superposition of sinc-functions

Spectral leakage

SLIDE 14

> Fast Fourier Transform

> evaluates the Fourier Transform at frequencies with period > harmful for forecasting > Gradient Descent > because of non-convexity, will get stuck in bad local minimum

Combining FFT and GD

SLIDE 15

> Use Fast Fourier Transform > to locate global valley of error surface

> Use Gradient Descent > to improve initial guess of FFT to break implicit periodicity assumptions

Combining FFT and GD

SLIDE 16

Combining FFT and GD

SLIDE 17

Koopman Forecast

SLIDE 18

Spatio-Temporal Systems

SLIDE 19

> Koopman showed in 1931: > any non-linear dynamical system can be lifted by non-linear but time-invariant function into space where time evolution is linear > Analogous to Cover’s theorem (1965) > Theoretical underpinning of Kernel methods and Deep Learning

Koopman Theory

Cover, T .M. (1965). "Geometrical and Statistical properties of systems of linear inequalities with applications in pattern recognition" (PDF). IEEE Transactions on Electronic Computers. EC-14 (3): 326–334 Koopman, Bernard O. "Hamiltonian systems and transformation in Hilbert space." Proceedings of the National Academy of Sciences of the United States of America 17.5 (1931): 315

SLIDE 20

Koopman Theory

Koopman: Cover:

f

SLIDE 21

Objective: Koopman

Ω(ωt) = sin(ω1t) ⋮ sin(ωNt) cos(ω1t) ⋮ cos(ωNt)

> Recap: Stable Linear Dynamical System

SLIDE 22

Objectives

E(Θ, ω) =

∑

t=1

(xt − fΘ(Ω(ωt)))

E(A, ω) =

∑

t=1

(xt − AΩ(ωt))

Koopman: Fourier:

SLIDE 23

Objectives

E(Θ, ω) =

∑

t=1

(xt − fΘ(Ω(ωt)))

Koopman:

SLIDE 24

Objective: Koopman

E(Θ, ω) =

∑

t=1

(xt − fΘ(Ω(ωt)))

Koopman:

Neural Network parameterized by Θ

SLIDE 25

Objective: Koopman

E(Θ, ω) =

∑

t=1

(xt − fΘ(Ω(ωt)))

Koopman:

Because of non-linearity, no analytical solution for

ωi

SLIDE 26

Objective: Koopman

E(Θ, ω) =

∑

t=1

(xt − fΘ(Ω(ωt)))

Koopman:

However, in spite of non-linearity and non-convexity, computing global optima in direction of possible!

ωi

SLIDE 27

Objective: Koopman

E(Θ, ω) =

∑

t=1

(xt − fΘ(Ω(ωt)))

Koopman: =

∑

t=1

L(Θ, ω, t) L(Θ, ω, t) = (xt − fΘ(Ω(ωt)))

SLIDE 28

Periodicity in loss

L(Θ, ω + 2π t , t) = (xt − fΘ(Ω((ω + 2π t )t)))

= (xt − fΘ(Ω(ωt)))

= L(Θ, ω, t)

SLIDE 29

Periodicity in loss

L(Θ, ω, t) = L(Θ, ω + 2π t , t) sin((ω + 2π t )t) = sin(ωt + 2π) = sin(ωt)

SLIDE 30

Periodicity in loss

L(Θ, ω, t) = L(Θ, ω + 2π t , t)

SLIDE 31

Computing the loss

For all , compute loss within

t 2π t

SLIDE 32

Computing the loss

For all , repeat computed loss times

t t

SLIDE 33

Computing the loss

For all , resample loss

SLIDE 34

Computing the loss

+ +

Sum all ‘temporally local’ losses

SLIDE 35

Computing the loss

+ + =

SLIDE 36

Easy and efficient to implement in freq. domain!

Computing the loss

for t in range(T): E_ft[range(K)*t] += fft(L[t]) E = ifft(E_ft)

SLIDE 37

Results

SLIDE 38

> Fourier algorithm has universal approximation properties on finite datasets > Sines and cosine form an orthogonal basis > which is periodic in > Analogous to Cover’s theorem, requires dimensional space

T N

Results: Theoretical

SLIDE 39

> For infinite data, Koopman algorithm is more expressive than Fourier counterpart

Results: Theoretical

SLIDE 40

> Close relationship to Bayesian Spectral analysis > Error grows linear in time and with noise variance > But shrinks superlinearly with amount of data

Results: Theoretical

Jaynes, E. T . "Bayesian spectrum and chirp analysis." Maximum-Entropy and Bayesian Spectral Analysis and Estimation Problems. Springer, Dordrecht, 1987. 1-37. Bretthorst, G. Larry. Bayesian spectrum analysis and parameter estimation. Vol. 48. Springer Science & Business Media, 2013.

| ̂ xt(ω) − ̂ xt(ω*)| ∈ 𝒫 ( t T3 ∑

σ2 Ai )

SLIDE 41

Results: Practical

xt = sin ( 2π 24 t)

+ ϵt

SLIDE 42

Results: Practical

SLIDE 43

Results: Practical

SLIDE 44

Results: Practical

SLIDE 45

Results: Practical

SLIDE 46

Spatio-Temporal Systems

SLIDE 47

> Fit linear and non-linear oscillators to data > non-convex and non-linear objective > Many real world phenomena are quasi-periodic > gait, (space) weather, fluid flows, epidemiological data, power systems, sales, room occupancy, …  > Code is available:

> https://github.com/helange23/from_fourier_to_koopman

From Fourier to Koopman

Objective

Spatio-Temporal Systems

Outline

Outline

Solution strategy

Fourier Forecast

Objective

Objective

Objective

Objective

Symmetry

Spectral leakage

Combining FFT and GD

Combining FFT and GD

Combining FFT and GD

Koopman Forecast

Spatio-Temporal Systems

Koopman Theory

Koopman Theory

f

Objective: Koopman

Objectives

Objectives

Objective: Koopman

Objective: Koopman

Objective: Koopman

Objective: Koopman

Periodicity in loss

Periodicity in loss

Periodicity in loss

Computing the loss

Computing the loss

Computing the loss

Computing the loss

+ +

Computing the loss

+ + =

Computing the loss

Results

Results: Theoretical

Results: Theoretical

Results: Theoretical

Results: Practical

Results: Practical

Results: Practical

Results: Practical

Results: Practical

Spatio-Temporal Systems

Summary