Applications of bivariate generalized Pareto distribution and the - - PowerPoint PPT Presentation

applications of bivariate generalized pareto distribution
SMART_READER_LITE
LIVE PREVIEW

Applications of bivariate generalized Pareto distribution and the - - PowerPoint PPT Presentation

Applications of bivariate generalized Pareto distribution and the threshold choice Toshikazu Kitano Nagoya Institute of Technology One of the biggest disasters in Japan, 2018 Mori, N. & T. Yasuda, T. Arikawa, T. Kataoka, S. Nakajo, K.


slide-1
SLIDE 1

Applications of bivariate generalized Pareto distribution and the threshold choice Toshikazu Kitano Nagoya Institute of Technology

slide-2
SLIDE 2

One of the biggest disasters in Japan, 2018

slide-3
SLIDE 3

Field Survey of 2018 Typhoon Jebi in Japan: Lessons for Disaster Risk Management, by Takabatake,T. et al., Geoscience, 2018. Mori, N. & T. Yasuda, T. Arikawa, T. Kataoka, S. Nakajo, K. Suzuki,

  • Y. Yamanaka: 2018 Typhoon Jebi Post-Event Survey of Coastal

Damage in the Kansai Region, Japan. Coastal Engineering Journal

slide-4
SLIDE 4
slide-5
SLIDE 5

台風21号による高潮は,第二室戸台風(昭和36年)を越える規模(ほぼ同程度) 淀川での高潮の河川遡上,高潮による水位が堤防高を超過.大阪府の3大水門の閉鎖による浸水回避. 淀川本川の3橋の防潮鉄扉(陸閘)の閉鎖(1979年以来)

slide-6
SLIDE 6

1959, September 26th Isewan Typhoon

* Large area innandated * Yodo River also damaged * Ship landed (like 2011)

slide-7
SLIDE 7
slide-8
SLIDE 8

This year 2019, September 9th Tyhoon no.15 Faxai behaved violently, ...

slide-9
SLIDE 9

Return Period Daily max Wind Speed [m/s] .5 1 2 5 10 20 50 100 200 15 20 25 30

Hamburg Hanover Bremerhaven Fehmarn Schleswig

Hanover Hamburg

Dependency of Extremes

When spatially near communities are attacked at the same time, disaster will expand more. Support, Back-up & Recovery will become more diffjcult than ...

slide-10
SLIDE 10

Hamburg Hanover Bremerhaven Fehmarn Schleswig

5 10 15 20 25 5 10 15 20 25 30 35

Wind Speed [m/s] at Hamburg Wind Speed [m/s] at Hanover

5 10 15 20 25 5 10 15 20 25 30 35

Wind Speed [m/s] at Hamburg Wind Speed [m/s] at Hanover

Return Period Daily max Wind Speed [m/s] .5 1 2 5 10 20 50 100 200 15 20 25 30

Rank(Hamburg) Rank(Hanover) 1 10 100 1000 10000 1 10 100 1000 10000

Hanover Hamburg

Corresponding relations

slide-11
SLIDE 11

5 10 15 20 25 5 10 15 20 25 30 35

Wind Speed [m/s] at Hamburg Wind Speed [m/s] at Hanover

5 10 15 20 25 5 10 15 20 25 30 35

Wind Speed [m/s] at Hamburg Wind Speed [m/s] at Hanover

Return Period Daily max Wind Speed [m/s] .5 1 2 5 10 20 50 100 200 15 20 25 30

Rank(Hamburg) Rank(Hanover) 1 10 100 1000 10000 1 10 100 1000 10000

Hanover Hamburg

Return Period [Year] at Hamburg Return Period [Year] at Hanover . . 2 5 20 50 200 . . 2 5 20 50 200

Return Period for Joint Occurrence

slide-12
SLIDE 12

Return Period Daily max Wind Speed [m/s] .5 1 2 5 10 20 50 100 200 15 20 25 30

Rank(Hamburg) Rank(Hanover) 1 10 100 1000 10000 1 10 100 1000 10000

Hanover Hamburg

1 10 100 1000 10000 0.0 0.2 0.4 0.6 0.8 1.0

Rank (decreasing Order), r Correlation Coef.

Hamburg & Hanover

x x x x x x x x x x x By formula By prod.−mom. By EM algo. r = 1281

r H(1) 100 1000 10000 0.5 1.0 1.5 2.0 1 10

  • Thresh. Choice by the conventional method
  • Thresh. Choice by the method of

correlation coeffjcient

  • f occurrence numbers

Extremes by POT requires a threshold. (Peaks Over Threshold)

  • Thresh. Choice should

be determined by the correlations, because the dependence problem can be solved by the properties of dependence. It is a logic.

slide-13
SLIDE 13

http://www.jamstec.go.jp/tougou/program/index.html

slide-14
SLIDE 14

To evaluate the uncertainty requires the extreme value theory and the statistical techniques to the applications.

slide-15
SLIDE 15

http://www.miroc-gcm.jp/~pub/d4PDF/index_en.html

slide-16
SLIDE 16

The importance of bivariate extreme statistics:

Overlap of several hazards: storm surge, high waves, river runoff and

  • od . accumulates risk and its prediction will become troublesome.

Simultaneous occurrences (or joint occurrences) at several sites (at least two important sites) also aggregate the loss by damage, and they will enlarge the loss more than the proportional one. The importance of statistical distribution of bivariate extremes is now increasing in disaster risk reduction plan, but the bivariate GP distribution has been not yet developed enough for those applications.

slide-17
SLIDE 17

One of the reasons is the unclearness of mathematical understanding of the multivariate (bivariate) extremes for the practical engineers.

Fu = λ∗(yA ∧ uA, yB ∧ uB) − λ∗(yA, yB) λ∗(uA, uB)

D1 (uA, uB) (yA, yB) (uA, uB) (yA, yB) D2

¯ Fu(yA, yB) = 1 − Fu(yA, yB) = λ∗(yA, yB) λ∗(uA, uB)

case 1 case 2 Here let us give a glance to the probability distribution:

Fu(yA, yB) = λB(uB) − λB(yB) − {λAB(yA, uB) − λAB(yA, yB)} λ∗(uA, uB)

where we make the function λ∗ endlessly, ... One of the simplest ones is:

λ∗(yA, yB) =

  • λ 1/α

A (yA) + λ 1/α B (yB)

α

slide-18
SLIDE 18

And therefore, though GP distribution requires a suitable threshold to extract the extremes of hazard magnitudes, the methods of threshold choice have not been enough discussed for bivariate extremes. This research focuses on the orro o o orr r,

  • y md ro ord d o

wind velocities at two cities. And the numerous datasets of daily rainfall in d4PDF are also applied to nonparametric analysis of bivariate extremes to demonstrate the spacial change of depemdence of the pairwise points against the distances. dependent <--- ---> independent Kasukabe to Tsukuba, Kamagatani, Sano

slide-19
SLIDE 19
slide-20
SLIDE 20

Hamburg Hanover 5 10 15 20 25 5 10 15 20 25 30 Hamburg Hanover 5 10 15 20 25 5 10 15 20 25 30

Counting the excess numbers is another way of evaluating extremes, ...

kA(uA) =

n

  • i=1

1{YA(i) > uA}, kB(uB) =

n

  • i=1

1{YB(i) > uB}

kA = 4 kB = 3 uB uA

Hamburg Hanover 5 10 15 20 25 5 10 15 20 25 30 Hamburg Hanover 5 10 15 20 25 5 10 15 20 25 30

k∗ = 5 kAB = 2 (uA, uB) uB uA

Inclusive occurrence number Joint occurrence number

k∗(uA, uB) =

n

  • i=1

1{YA(i) > uA} ∨ 1{YB(i) > uB} kAB(uA, uB) = kA(uA) + kB(uB) − k∗(uA, uB)

slide-21
SLIDE 21
slide-22
SLIDE 22

For one-component, it will be easier to understand the relation between the Poisson disitribution and the extreme variable. The uni-variate Poisson distribution is described in terms of the mean rate as follows: The case of no occurrence gives the cumulative distribution (= non-exceedance probability) function: where the rate function is set to then Eq.(*) corresponds to a GEV (Generalized Extreme Value) distribution.

f(k) = λk

1

k! e−λ1 f(k = 0) = e−λ1 → λ1(y) =

  • 1 + ξ y − µ1

σ1 −1/ξ G1(y) = exp

  • 1 + ξ y − µ1

σ1 −1/ξ G1(y) = e− λ1

  • λ1 = λ1(y) = e− λ1(y)
slide-23
SLIDE 23

Key'2’:''A'Poisson'distribu=on'into'an'extreme'value'd.' *'Univariate'case:'

p(kx = 0) = {λ1(x)}0 0! e−λ1(x) = exp ( − ✓ 1 + ξx x − µx,1 σx,1 ◆− 1/ξx)

No'occurrence'prob.''' '''''''''''''''''='cumula=ve'prob.'distribu=on'of'max. *'Bivariate'case:' Therefore'the'inclusive'occurrence'rate'becomes'' Important'in'the'theore=cal'treatment.

p(kx,1 = 0, ky,1 = 0) = e−λ∗,1(x,y) = F1(x, y)

= F1(x)

slide-24
SLIDE 24

λ∗,1(x, y) = E(k∗) = E X

i

1 {Xi > x} ∨ 1 {Yi > y}

= Z 1 Z ∞

x1 t ∧ y1 1−t

drdH(t) r2

= Z 1 ✓ t x1 ∨ 1 − t y1 ◆ dH(t)

= Z 1 {tλ1(x) ∨ (1 − t)λ1(y)} dH(t)

A(ω) = λ∗,1(x, y) λ1(x) + λ1(y) = Z 1 {t(1 − ω) ∨ (1 − t)ω} dH(t)

ω = λ1(y) λ1(x) + λ1(y) = x1 x1 + x2

= Z 1 ⇢ r > x1 t ∧ y1 1 − t

  • dV (r, t)

= Z 1 ⇢ x0

1

x1 ∨ y0

1

y1 > 1

  • dV (x0

1, y0 1)

y0

1 = r(1 − t)

x0

1 = rt

^84"@@C'u 647':9-$0'g i8.A"47%'7$E$47$4#'>54.=94

E%$579'E9@"-'.99-784"#$% X'8%'.94%#-"84$7'94@C'YC&

Z 1 dH(t) = 2

Z 1 tdH(t) = Z 1 (1 − t)dH(t) = 1

slide-25
SLIDE 25

Essential for bivariate extremes: Homogeneity (of order - 1) whose property shows the proportionality & similarity of the occurrence rate. And it will be checked by using the sample data.

Rank for Wind Speed in Hamburg Rank for Wind Speed in Hanover 40 1 10 100 1000 10000 20 1 10 100 1000 10000 Rank for Wind Speed in Hamburg Rank for Wind Speed in Hanover 8 1 100 1000 10000 1 10 100 1000 10000 4

25 15 5 25 + 15 = 40 5 + 3 = 8 5 + 15 = 20 1 + 3 = 4 5 3 1 where we know ranks in decreasing order indicate the occurrence numbers.

slide-26
SLIDE 26

Correlation coef. of occurrence numbers is given by which is based on the bivariate Poisson distribution, and it will be estimated by sample, as for the common number of rank j.

  • cf. c.c. by prod. mom. est.

ρxy = λxy

  • λx λy

˜ ρxy = (kx − ¯ kx)(ky − ¯ ky) (kx − ¯ kx)2 (ky − ¯ ky)2

Rank(Hamburg) Rank(Hanover)

1 10 100 1000 10000 1 10 100 1000 10000 1 10 100 1000 10000 0.0 0.2 0.4 0.6 0.8 1.0 Rank (decreasing Order), j Correlation Coef. Hamburg & Hanover x x x x x x By formula By prod.−mom. r = 1281

ˆ ρxy = kxy j

slide-27
SLIDE 27

Con=ngency'tables'for'excess'&'no'excess Perfect'dependent

↔ kxy = 0

k∗ = kx ∨ ky k∗ = kx + ky

Independent

ux \ uy Excess No Excess Total Excess kxy kx − kxy kx No Excess ky − kxy (n − kx − ky + kxy) (n − kx) Total ky (n − ky) (n)

ux \ uy Excess No Excess Total Excess 20 25 45 No Excess 19 (667) (686) Total 39 (692) (731)

ux \ uy Excess No Excess Total Excess 45 45 No Excess 39 (647) (686) Total 39 (692) (731) ux \ uy Excess No Excess Total Excess 39 6 45 No Excess (686) (686) Total 39 (692) (731)

Daily'max.' 'wind'speeds' for''2'years

slide-28
SLIDE 28

r H(1) 1 10 100 1000 10000 0.5 1.0 1.5 2.0

Conventional method uses the identity equation: what this equation stands for?

  • > The occurrrence numbers are the same in the following two regions.

(The red line is a contourline of 1/rank_A + 1/rank_B.)

H(1) = 2

However this method will be overestimated, as seen in this example where we can take r= 1281, while the C.C. is not stable. Actually the stable data is limited around 100.

Sample Pareto Margin for Hamburg Sample Pareto Margin for Hanover .0002 .001 .01 .1 1 .0002 .001 .01 .1 1

1 10 100 1000 10000 0.0 0.2 0.4 0.6 0.8 1.0 Rank (decreasing Order), j Correlation Coef. Hamburg & Hanover x x x x x x By formula By prod.−mom. r = 1281

slide-29
SLIDE 29

The important thing is that demension reduction is possible by transforming the occurrence rate λ∗(x, y) into the Pickands dependence function A(t). (Bivariate extreme distribution is so simple that there are included the wide range of the mathematical functions for the distributions.)

λ∗(x, y) = E(k∗) = E

  • i

1{Xi > x} ∨ 1{Yi > y} = · · · omitting the details of derivation · · · = 1 {ωλA(x)} ∨ {(1 − ω)λB(y)} dH(ω) A(t) = 1 {ω(1 − t)} ∨ {(1 − ω)t} dH(ω) t = λB(y) λA(x) + λB(y) = 1/λA(y) 1/λA(x) + 1/λB(y)

r d rr r and the radius (lengthwise) variable It is the pseudo polar coordinates.

r = 1/λA(x) + 1/λB(y)

slide-30
SLIDE 30

t A(t) 0.0 0.5 1.0 0.5 1.0

independent p e r f e c t d e p e n d e n t p e r f e c t d e p e n d e n t

A(t) ≡ 1 A(t) = t A(t) = 1 − t

t Density 0.0 0.5 1.0 0.0 0.5 1.0 1.5 0.005 < r < 0.02

Sample Pareto Margin for Hamburg Sample Pareto Margin for Hanover .0002 .001 .01 .1 1 .0002 .001 .01 .1 1 r=0.02 r=0.01 r=0.005

Pickands dependence function shows well the dependency properties.

ti = pi/ri ri = pi + qi pi (= 1/rankA) qi (= 1/rankB)

pseudo-polar coordinate

˜ Ai = k∗ rankA + rankB

slide-31
SLIDE 31

We analyze the spatial dependence of daily precipitations by using d4PDF.

slide-32
SLIDE 32

<- Cor.

Coef.

<- Pickands Dependent Function <- Transverse Density <- Contours

  • f Retrun

Period of joint

  • ccurrences
slide-33
SLIDE 33

Conclusions

What we show here today is actually not new in extreme value theory. But these facts have not been well known for hydrology and water-lerated engineerings (Coastal and Hydraulic Eng.). Parametric approach as well as non-parametric approach should be applied, because the wide range of parametric functions is possible to describe the joint (or inclusive) occurrence rate. It is so important to examine the joint occurrence rates (and the return period) for the accumulative risk. For the future works, the bivariate extreme analysis should be extended to the spatial modeling of the whole river basin and the comprehensive coastal zone.

Return Period [Year] at Hamburg Return Period [Year] at Hanover . . 2 5 20 50 200 . . 2 5 20 50 200

slide-34
SLIDE 34