Modelling discontinuities in simulator output using Voronoi - - PowerPoint PPT Presentation

β–Ά
modelling discontinuities in simulator output using
SMART_READER_LITE
LIVE PREVIEW

Modelling discontinuities in simulator output using Voronoi - - PowerPoint PPT Presentation

School of something School of Mathematics FACULTY OF OTHER FACULTY OF MATHEMATICS AND PHYSICAL SCIENCES Modelling discontinuities in simulator output using Voronoi tessellations John Paul Gosling (University of Leeds) and Chris Pope , Jill


slide-1
SLIDE 1

School of something

FACULTY OF OTHER

School of Mathematics

FACULTY OF MATHEMATICS AND PHYSICAL SCIENCES

Modelling discontinuities in simulator

  • utput using Voronoi tessellations

John Paul Gosling (University of Leeds) and Chris Pope, Jill Johnson and Stuart Barber (University of Leeds) Paul Blackwell (University of Sheffield)

slide-2
SLIDE 2

Overview

  • 1. Why bother with discontinuities?
  • 2. Attempts to split the space of interest.
  • 3. Sampling to find discontinuities.
  • 4. Application in climate science.

DISCLAIMER: This is (still) work-in-progress: there are many aspects that we need to sort out and improve.

slide-3
SLIDE 3

Motivation

Heterogeneity can

  • ccur in spatial

processes. Discontinuities can create challenges for modelling. Transformations can only get us so far.

slide-4
SLIDE 4

Motivation

Heterogeneity can

  • ccur in spatial

processes. Discontinuities can create challenges for modelling. Transformations can only get us so far.

slide-5
SLIDE 5

Motivation

Heterogeneity can

  • ccur in spatial

processes. Discontinuities can create challenges for modelling. Transformations can only get us so far.

slide-6
SLIDE 6

GP emulation (or kriging)

Our regression building block for this talk is a Gaussian process regression model: 𝑔 . ~ 𝐻𝑄 𝑛 . , 𝜏*𝑑 . , . . We observe 𝑔 . at a limited number of points, and we can update this prior. We have used both Gaussian and MatΓ©rn correlation functions.

slide-7
SLIDE 7

1d example

slide-8
SLIDE 8

1d example

slide-9
SLIDE 9

1d example

slide-10
SLIDE 10

1d example

slide-11
SLIDE 11

1d example

slide-12
SLIDE 12

1d example

slide-13
SLIDE 13

1d example

slide-14
SLIDE 14

1d example

slide-15
SLIDE 15

1d example

slide-16
SLIDE 16

1d example

slide-17
SLIDE 17

Classification trees

Classification trees are learning analogues of decision trees.

slide-18
SLIDE 18

Classification trees

Treed Gaussian processes were designed to split space into heterogeneous areas.

Nice R implementation: tgp package.

slide-19
SLIDE 19

Classification trees

Treed Gaussian processes were designed to split the space into heterogeneous areas.

Nice R implementation: tgp package.

slide-20
SLIDE 20

Voronoi tessellations

Tiles are defined completely by a set of centres. A point lies on a tile if it is closest to that tile’s centre. A point lies on a boundary if it is equally close to more than one centre.

slide-21
SLIDE 21

Voronoi tessellations

Note that our β€œregions” do not need to be made up of neighbouring tiles.

slide-22
SLIDE 22

Our model

Input space is divided into disjoint regions: each contain a number of Voronoi tiles. Each region has an independent GP model: where 𝜌(. |. ) denotes a multivariate normal pdf derived from the GP model. We have extended the model of Kim et al. (2005) in several ways. π‘š 𝒛 π’š,𝒄, 𝜸, 𝝉*, 𝒖 ∝ 8 𝜌 𝒛9|π’š9,𝒄9,𝜏9

*,𝜸9, 𝒖 , : 9;<

slide-23
SLIDE 23

Priors

The prior for the region specification is 𝜌 𝒖 = 𝜌 𝑛, 𝒅 𝜌 𝑨 𝑛 𝜌 𝑹 𝑛, 𝑨 . And an additional prior constraint that says we can only have a region if there are enough training points to fit a GP.

Poisson point process with some sensible intensity Discrete uniform over [1,m] Discrete uniform over

  • rdered partitions:

1, 𝑛BC Bell number

slide-24
SLIDE 24

Implementation

Reversible-jump-MCMC: GP MAP estimates within birth/death/move and relationship-change MH;

slide-25
SLIDE 25

Implementation

Reversible-jump-MCMC: GP MAP estimates within birth/death/move and relationship-change MH; Start off 100 MCMC chains from random points in model space; Run each chain for 10,000 iterations and checking for autocorrelation and convergence;

slide-26
SLIDE 26

Implementation

Reversible-jump-MCMC: GP MAP estimates within birth/death/move and relationship-change MH; Start off 100 MCMC chains from random points in model space; Run each chain for 10,000 iterations and checking for autocorrelation and convergence; Hope that you have something that has converged… We need to be wary of identifiability issues and local maxima.

slide-27
SLIDE 27

Toy example

Another step function.

slide-28
SLIDE 28

Toy example

Predictive mean Voronoi GP model Standard GP model

slide-29
SLIDE 29

Toy example

Predictive mean Voronoi GP model Treed GP model

slide-30
SLIDE 30

Toy example

Predictive mean Treed GP model MAP division

slide-31
SLIDE 31

Toy example

From our posterior, we can get various plausible tessellations.

slide-32
SLIDE 32

Toy example

Predictive mean Voronoi GP model Predictive std dev.

slide-33
SLIDE 33

Toy example

We can also look at our MAP tessellation and the probabilities

  • f getting different numbers of

regions.

slide-34
SLIDE 34
slide-35
SLIDE 35
slide-36
SLIDE 36
slide-37
SLIDE 37
slide-38
SLIDE 38

MSE: 1.98 MSE: 1.84

slide-39
SLIDE 39

Spatial statistics example

In the context of traditional kriging, we considered ammonia concentration at ground level across 250 US sites (2007).

slide-40
SLIDE 40

Spatial statistics example

slide-41
SLIDE 41

Spatial statistics example

NH4

slide-42
SLIDE 42

Proposing new points

To improve estimation, we could 1) Target areas with high uncertainty; 2) Just continue with a space filling theme; 3) Try to improve our estimation of the region boundaries. Finding points that lie on a boundary in 2d is relatively simple.

slide-43
SLIDE 43

Finding the boundary

slide-44
SLIDE 44

Finding the boundary

slide-45
SLIDE 45

Finding the boundary

slide-46
SLIDE 46

Finding the boundary

slide-47
SLIDE 47

Finding the boundary

slide-48
SLIDE 48

Finding the boundary

slide-49
SLIDE 49

Finding the boundary

slide-50
SLIDE 50

Finding the boundary

slide-51
SLIDE 51

Proposing new points

Algorithm that lacks subtlety:

1) Randomly choose a centre within region of interest. 2) Randomly choose a point on the boundary from that centre’s Voronoi tile. 3) Check if the point is on edge of region, and keep if it is. 4) Repeat 1-3 many times to get candidate set.

slide-52
SLIDE 52

Proposing new points

Algorithm that lacks subtlety:

1) Randomly choose a centre within region of interest. 2) Randomly choose a point on the boundary from that centre’s Voronoi tile. 3) Check if the point is on edge of region, and keep if it is. 4) Repeat 1-3 many times to get candidate set.

Then attempt to maintain space-filling property:

1) Find point in candidate set that is furthest from the training points. 2) Add that point to set of training points. 3) Find point in remaining candidate set that is furthest from the training points and the added point. 4) Add that point to set of training points. 5) Continue repeating process until enough points are found.

slide-53
SLIDE 53

Toy example revisited

We decide that we can afford to sample at five extra points.

slide-54
SLIDE 54

Toy example revisited

First data set Second data set

slide-55
SLIDE 55

Toy example revisited

We decide that we can afford to sample again at five extra points.

slide-56
SLIDE 56

Toy example revisited

Second data set Third data set

slide-57
SLIDE 57

Modelling a Cloud Field

Cloud fields are a prime example of non-stationary behaviour in the natural world.

Coverage fraction stratocumulus stratiformis

slide-58
SLIDE 58

Modelling a Cloud Field

Exploring the sensitivity of cloud fraction to uncertainty in aerosol concentration. Eddy/cloud resolving model (System for Atmospheric Modelling) Grid mesh: Dx = Dy = 200 m, Dz = 10 m, Dt = 2 s; Domain size: 40 km x 40 km x 1.5 km. The model is reasonably expensive: approx. 3 hours per run, with 240 cores on a 760 Tflop Cray computer cluster.

slide-59
SLIDE 59

Modelling a Cloud Field

We have run an ensemble of simulations according to a Latin hypercube design of size 105 over a 6d parameter space.

slide-60
SLIDE 60
slide-61
SLIDE 61

Emulation results

The MAP model has two regions: one with 87 centres and the

  • ther with 18.

We have posterior probabilities of: Pr(1 region) = 0.14, Pr(2 regions) = 0.66, Pr(3 regions) = 0.20. We have 30 β€œtest” runs of the cloud model. We can also perform other standard emulator diagnostics.

Method MSE of prediction Standard GP 0.028 Treed GP 0.032 Voronoi GP 0.016

slide-62
SLIDE 62

Visualisation difficulties

Here are points that lie on the boundary between regions (based on the MAP estimate).

slide-63
SLIDE 63

Visualisation difficulties

Each square in the picture gives the proportion of points that fall in region 1 when we consider the 4d grid of points for

j

x

  • i

that particular x combination. Darker -> higher proportion.

slide-64
SLIDE 64

Cloud modelling next steps

  • We have rerun the analysis including the validation points

and found a new MAP boundary.

  • We have now passed 30 candidate points to the model
  • wners to help us refine the region boundaries.

We need to think of a sensible way to describe this (potentially) 4d region to them…

slide-65
SLIDE 65

Updated results

Each square in the picture gives the proportion of points that fall in region 1 when we consider the 4d grid of points for

j

x

  • i

that particular x combination. Darker -> higher proportion.

slide-66
SLIDE 66

Possible extensions

  • Why stop at straight lines and convex regions?
  • Why stick to Gaussian processes?
  • Our approach is related to k-nearest-neighbour classification

and regression – ML methods for computations and visualisations?

slide-67
SLIDE 67

Possible extensions

  • Why stop at straight lines and convex regions?
  • Why stick to Gaussian processes?
  • Our approach is related to k-nearest-neighbour classification

and regression – ML methods for computations and visualisations?

Multiplicatively weighted Voronoi

slide-68
SLIDE 68

Possible extensions

  • Why stop at straight lines and convex regions?
  • Why stick to Gaussian processes?
  • Our approach is related to k-nearest-neighbour classification

and regression – ML methods for computations and visualisations?

Standard Voronoi with city-block distance.

slide-69
SLIDE 69

References

Gramacy, R. B., & Lee, H. K. H. (2008). Bayesian treed Gaussian process models with an application to computer modeling. Journal of the American Statistical Association, 103(483), 1119-1130. Kim, H. M., Mallick, B. K., & Holmes, C. C. (2005). Analyzing nonstationary spatial data using piecewise Gaussian processes. Journal of the American Statistical Association, 100(470), 653-668. Pope, C. et al. (2017). Modelling spatial heterogeneity and discontinuities with Voronoi

  • tessellations. In preparation as it has been since 2007…