Evolution of the rate of evolution An analytical solution to the - - PowerPoint PPT Presentation

evolution of the rate of evolution an analytical solution
SMART_READER_LITE
LIVE PREVIEW

Evolution of the rate of evolution An analytical solution to the - - PowerPoint PPT Presentation

Evolution of the rate of evolution An analytical solution to the compound Poisson process


slide-1
SLIDE 1

Evolution of the rate of evolution — An analytical solution to the compound Poisson process

  • Stéphane Guindon

Department of Statistics, University of Auckland, New Zealand. LIRMM, UMR 5506 CNRS Montpellier, France.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-2
SLIDE 2

Outline

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-3
SLIDE 3

A bit of history...

  • Linus Pauling and Emile Zuckerkandl (1962): “molecular

clock hypothesis”.

  • Allan Wilson (1967): molecular dating under the molecular

clock assumption.

  • 30 years passed...
  • Michael Sanderson (1997) and Jeffrey Thorne (1998):

estimation of evolutionary divergence times without the restriction of a uniform rate across lineages.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-4
SLIDE 4

Molecular clock rate and time estimation

?

  • l1

l2 l3 l4 l5 l6 l7 l8 t0 t1 t2

l5 = µ × (t1 − t0) ֒ → µ = l5 t1 − t0 t2 = l1 + l2 + l3 µ + t0

  • l1

l2 l3 l4 l5 l6 l7 l8 t0 t1 t2 µ1 µ2

µ1 = l4 + l3 − l1 − l2 t0 − t1 µ2 = l5 + l6 + l7 + l8 − l4 t1 − t2

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-5
SLIDE 5

Beyond the molecular clock

  • Local clocks
  • Substitution rate is organised into a small number of classes,
  • Assign each branch to one of these classes.
  • Penalized likelihood
  • Ψ(R, T ): penalty term for rate changes,
  • Maximise log(P(D|R, T )) − λΨ(R, T ).
  • Bayesian approaches
  • Explicit stochastic models of the evolution of the

substitution rate.

  • Rate trajectory is continuous or discrete.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-6
SLIDE 6

Models of rate evolution (1/2)

  • Log-normal model
  • µ is the mean of the rate at the nodes that begin and end

the branch (r(0) and r(T )).

  • log(r(T )) ∼ N(log(r(0)), νT ).
  • Logarithm of the rate undergoes Brownian motion.
  • Correlation of mean rates on adjacent branches.
  • Exponential model
  • µ ∼ Exp(φ).
  • No correlation of mean rates.
  • Shape of the distribution does not depend on time duration.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-7
SLIDE 7

Models of rate evolution (2/2)

t0 t1 t2 T r0 r1 r2 r3

  • Compound Poisson process
  • Rates change in discrete jumps.
  • r(t) ∼ Γ(α, β)
  • Number of jumps: n(T ) ∼ Poisson(λT )
  • Correlation of mean rates across branches: governed by λ.
  • λT large: distribution of mean rate is approximately

Normal.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-8
SLIDE 8

Implementation of the compound Poisson process

  • “Jump” event: Poisson(λ∆t)
  • Substitution rates: Γ(α, β)

t0 t0 t0 t1 t1 t1 r0 r0 r1 r2 r2 r2

  • MCMC → posterior distribution of λ and α

1 2 3 4 0.0 0.2 0.4 0.6

λ

Density

0.0 0.5 1.0 1.5 2.0 2.5 3.0 0.0 0.2 0.4 0.6 0.8 1.0

α

Density

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-9
SLIDE 9

Advantages and drawbacks

  • Log-normal
  • Computationally tractable
  • Crude (deterministic) description of the mean rates.
  • Biologically relevant ?
  • Exponential
  • Computationally tractable.
  • Distribution of mean substitution rate does not depend on

time duration.

  • No correlation of mean rates across branches.
  • Compound Poisson
  • Description of rate changes plausible from a biological

perspective.

  • Elegant way to account for correlation of mean rates across

branches.

  • No analytical solution.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-10
SLIDE 10

Outline

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-11
SLIDE 11

First question

t0 t1 t2 T r0 r1 r2 r3

  • ri ∼ Γ(α, β). Hence, E(ri) = αβ, V (ri) = αβ2.
  • n ∼ Poisson(λT).
  • µ = n

i=0 kiri, where ki = ∆ti T .

What is the distribution of µ ?

  • Work out the distribution of µ for a given value of n.
  • µ = n

i=0 kiri is well approximated by a Gamma

distribution.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-12
SLIDE 12

One jump

t0 T r0 r1

  • µ = k0r0 + (1 − k0)r1
  • Distribution of t0 = k0T ?

P(t0 = x|n = 1) = λe−λx × e−λ(T−x) λTe−λT = 1 T .

  • k0 ∼ U[0, 1] → E(k0) = 1

2 and V (k0) = 1 12.

  • E(µ) = E(k0)E(r0) + E(1 − k0)E(r1) = αβ.
  • V (µ) = V (k0r0) + V ((1 − k0)r1) + 2Cov(k0r0, (1 − k0)r1) =

2 3αβ2.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-13
SLIDE 13

n ≥ 1 jumps

t0 t1 t2 T r0 r1 r2 r3

  • Distribution of k0 ?

P(t0 = x|n = y) = λe−λx × (λ(T − x))y−1e−λ(T−x) (λT)ye−λT /y! = y T y (T − x)y−1.

  • After little algebra...
  • E(k0) =

1 n+1,

  • E(k2

0) = 2 (n+1)(n+2).

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-14
SLIDE 14

n ≥ 1 jumps

t0 t1 t2 T r0 r1 r2 r3

  • µ = k0r0 + k1r1 + k2r2 + k3r3.
  • µn = k0r0 + (1 − k0)µn−1.
  • E(µn) = E(k0)E(r0) + E(1 − k0)E(µn−1) → E(µn) = αβ .

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-15
SLIDE 15

n ≥ 1 jumps

t0 t1 t2 T r0 r1 r2 r3

  • The variance is a bit more challenging but can be done.

V (µn) = 2αβ2 + n(n + 1)V (µn−1) (n + 1)(n + 2)

  • Solve the recursion:

V (µn) = 2 n + 2αβ2

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-16
SLIDE 16

Likelihood calculation

  • Data:
  • l, an expected number of substitutions.
  • T , elapsed time.
  • µ = l/T
  • Likelihood:

pµ(u|λ, α, β, T) =

  • n=0

P(n|λ, T)pµn(u|α, β, n)

  • P(n|λ, T): Poisson distribution with mean and variance λT.
  • pµn(u|α, β, n): Gamma distribution with mean αβ, and

variance

2 n+2αβ2.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-17
SLIDE 17

The approximation seems good

Density 1 2 3 4 5 0.0 0.2 0.4 0.6 0.8

µ λ = 1E − 04 (E(n) = 0.001)

Density 1 2 3 4 5 0.0 0.2 0.4 0.6 0.8

µ λ = 0.1 (E(n) = 1)

Density 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 0.0 0.5 1.0 1.5

µ λ = 0.5 (E(n) = 5)

Density 0.0 0.5 1.0 1.5 2.0 2.5 0.0 0.5 1.0 1.5

µ λ = 1 (E(n) = 10) Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-18
SLIDE 18

Second question t0 t1 t2 S T r0 r1 r2 r3

  • Two adjacent time intervals: [0, S] and [S, T].
  • µ1 and µ2 mean rates in [0, S] and [S, T] respectively.
  • µ1 and µ2 are correlated because of r1.

What is the joint distribution of µ1 and µ2 ?

  • Work out the density pµ2|µ1(u2|u1, λ, α, T − S).

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-19
SLIDE 19

Second question t0 t1 t2 S T r0 r1 r2 r3

  • I was unable to find an analytical expression...
  • First idea: integrate over t0 in [0, S], t1 in [S, T] and r1 in

[0, ∞]...

  • ...didn’t work.
  • Second idea: use an approximation.
  • ‘Many’ jumps in [0, T ]: µ1 and µ2 are independent.
  • No jump in [0, T ]: pµ2|µ1(u2|u1) = 1 if u2 = u1.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-20
SLIDE 20

Second question t0 t1 t2 S T r0 r1 r2 r3

  • Use a mixture model:
  • µ2|µ1 ∼ N(µ1, 0.01) with probability P(n = 0|λ, T ),
  • µ2|µ1 ∼ N(µ1, 0.04) with probability P(n = 1|λ, T ),
  • µ2|µ1 ∼ N(µ1, 0.09) with probability P(n = 2|λ, T ),
  • µ2 independent from µ1 with probability P(n > 2|λ, T ).

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-21
SLIDE 21

pµ1,µ2(u1, u2|λ, α, T), E(n) = 10

Mixture Independent

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-22
SLIDE 22

pµ1,µ2(u1, u2|λ, α, T), E(n) = 4

Mixture Independent

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-23
SLIDE 23

pµ1,µ2(u1, u2|λ, α, T), E(n) = 2

Mixture Independent

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-24
SLIDE 24

pµ1,µ2(u1, u2|λ, α, T), E(n) = 0.002

Mixture Independent

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution

slide-25
SLIDE 25

Acknowledgements

  • The University of Auckland.
  • Dumont d’Urville programme:
  • Ministry of Research, Science & Technology, New Zealand.
  • EGIDE, France.
  • Allen Rodrigo, Olivier Gascuel and Vincent Lefort.

Models of evolution of the rate of evolution The compound Poisson process: an analytical solution