Tree-like reticulation networks Andrew R Francis Centre for - - PowerPoint PPT Presentation

tree like reticulation networks
SMART_READER_LITE
LIVE PREVIEW

Tree-like reticulation networks Andrew R Francis Centre for - - PowerPoint PPT Presentation

Tree-like reticulation networks Andrew R Francis Centre for Research in Mathematics University of Western Sydney Australia Phylomania 2014. Andrew R Francis (CRM @ UWS) November 2014 1 / 13 Andrew R. Francis, Mike Steel Tree-like


slide-1
SLIDE 1

Tree-like reticulation networks

Andrew R Francis

Centre for Research in Mathematics University of Western Sydney Australia

Phylomania 2014.

Andrew R Francis (CRM @ UWS) November 2014 1 / 13

slide-2
SLIDE 2

Andrew R. Francis, Mike Steel Tree-like reticulation networks - when do tree-like distances also support reticulate evolution? Mathematical Biosciences, in press (arXiv:1405.2965).

Andrew R Francis (CRM @ UWS) November 2014 2 / 13

slide-3
SLIDE 3

Tree metrics

◮ A phylogenetic tree with edge weights defines a metric on the leaves:

◮ add weights on the unique path.

u v w y 2 1 1 6 2 3 u v w y u 3 5 10 v 6 11 w 11 y

◮ A metric that can be placed on a tree is called a tree metric.

Andrew R Francis (CRM @ UWS) November 2014 3 / 13

slide-4
SLIDE 4

Tree metrics

◮ A phylogenetic tree with edge weights defines a metric on the leaves:

◮ add weights on the unique path.

u v w y 2 1 1 6 2 3 u v w y u 3 5 10 v 6 11 w 11 y

◮ A metric that can be placed on a tree is called a tree metric. ◮ A metric d is a tree metric if and only if it satisfies the four point

condition:

◮ for all quartets of leaves {u, v, w, y}, two out of

d(u, v) + d(w, y), d(u, w) + d(v, y), d(u, y) + d(v, w) are equal, and are greater than or equal to the other one.

Andrew R Francis (CRM @ UWS) November 2014 3 / 13

slide-5
SLIDE 5

Tree metrics

◮ A phylogenetic tree with edge weights defines a metric on the leaves:

◮ add weights on the unique path.

u v w y 2 1 1 6 2 3 u v w y u 3 5 10 v 6 11 w 11 y

◮ A metric that can be placed on a tree is called a tree metric. ◮ A metric d is a tree metric if and only if it satisfies the four point

condition:

◮ for all quartets of leaves {u, v, w, y}, two out of

d(u, v) + d(w, y), d(u, w) + d(v, y), d(u, y) + d(v, w) are equal, and are greater than or equal to the other one.

◮ In the above we have 3 + 11 = 14, 5 + 11 = 16 and 10 + 6 = 16. Andrew R Francis (CRM @ UWS) November 2014 3 / 13

slide-6
SLIDE 6

Reticulated networks

Any metric may be able to be represented on a reticulated network: reticulation vertices tree vertices

slide-7
SLIDE 7

Reticulated networks

Any metric may be able to be represented on a reticulated network: reticulation vertices tree vertices hybridization vertex HGT vertex

slide-8
SLIDE 8

Reticulated networks

Any metric may be able to be represented on a reticulated network: reticulation vertices tree vertices hybridization vertex HGT vertex tree arc reticulation arc

Andrew R Francis (CRM @ UWS) November 2014 4 / 13

slide-9
SLIDE 9

So we have

  • 1. 4PC satisfied =

⇒ there is a tree that can represent the metric.

  • 2. 4PC not satisfied =

⇒ there may be a reticulated network that can represent the metric. (Note: the 4PC statement is an if-and-only-if).

◮ What’s missing?

Andrew R Francis (CRM @ UWS) November 2014 5 / 13

slide-10
SLIDE 10

So we have

  • 1. 4PC satisfied =

⇒ there is a tree that can represent the metric.

  • 2. 4PC not satisfied =

⇒ there may be a reticulated network that can represent the metric. (Note: the 4PC statement is an if-and-only-if).

◮ What’s missing? ◮ The 4PC does not rule out a tree metric also being representable on a

reticulated network.

◮ One side of the if-and-only-if is an existence statement. Andrew R Francis (CRM @ UWS) November 2014 5 / 13

slide-11
SLIDE 11

Take home message of this talk:

◮ A tree metric can also be represented on reticulated networks using

average distances.

◮ We are able to characterise this precisely in some cases, but not all yet!

. . . now to clarify what is meant by average distances . . .

Andrew R Francis (CRM @ UWS) November 2014 6 / 13

slide-12
SLIDE 12

Metrics on reticulated networks.

◮ Let T(N) be the set of trees “displayed” by N. E.g. a b c

w1 w3 w2 w4 w5 α 1 − α

a c

w1 w3 w2 + w4 w5

a c

w2 w4 w1 + w3 w5

b b b ◮ For the purposes of distance, we treat a reticulated network N as the

weighted sum of the trees in T(N):

Andrew R Francis (CRM @ UWS) November 2014 7 / 13

slide-13
SLIDE 13

Metrics on reticulated networks.

  • 1. HGT.

a

w1 w2 w3 w4

c

α 1 − α w5

b

d(a, b) = α(w1 + w5) + (1 − α)(w1 + w2 + w3 + w5) = w1 + w5 + (1 − α)(w2 + w3) d(b, c) = w4 + w5 + α(w2 + w3) d(a, c) = w1 + w2 + w3 + w4

  • 2. Hybridization.

a

w1 w2 w3 w4

c

w5 w6

b

α d(a, b) = w1 + w5 + w6 d(b, c) = α(w4 + w6) + (1 − α)(w2 + w3 + w4 + w5 + w6) = w4 + w6 + (1 − α)(w2 + w3 + w5) d(a, c) = α(w1 + w5 + w4)+ (1 − α)(w1 + w2 + w3 + w4) = w1 + w4 + αw5 + (1 − α)(w2 + w3)

Andrew R Francis (CRM @ UWS) November 2014 8 / 13

slide-14
SLIDE 14

Results

Theorem

Suppose that all the trees in T(N) are isomorphic as unrooted phylogenetic X-trees to some tree T. Then dN is a tree metric that is represented by T.

◮ For example, if there is a single reticulation near the root, the network

is treelike.

◮ We can be more precise.

Andrew R Francis (CRM @ UWS) November 2014 9 / 13

slide-15
SLIDE 15

Tree-like hybridization networks

Theorem

Let X be a finite set of taxa, and suppose d is a metric on X.

◮ If d is a binary tree metric, then it is a metric on a primitive

1-hybridization network N.

◮ If N is a hybridization network, and d is a tree metric on N, then N is

either a tree, or is a primitive 1-hybridization network.

Andrew R Francis (CRM @ UWS) November 2014 10 / 13

slide-16
SLIDE 16

Tree-like hybridization networks

Theorem

Let X be a finite set of taxa, and suppose d is a metric on X.

◮ If d is a binary tree metric, then it is a metric on a primitive

1-hybridization network N.

◮ If N is a hybridization network, and d is a tree metric on N, then N is

either a tree, or is a primitive 1-hybridization network.

Theorem

For each tree metric on n leaves, there are 4(n − 3) 1-hybridization networks that realise the metric. Proof:

A B C D A B C D B A C D A B C D A B D C

Andrew R Francis (CRM @ UWS) November 2014 10 / 13

slide-17
SLIDE 17

Tree-like HGT networks

◮ Let N be an HGT network with TN the underlying tree (delete all

reticulation arcs).

Lemma

If each reticulation arc in N is between adjacent tree-arcs of TN, then dN is tree-like on TN.

◮ This means we have huge numbers of reticulated (HGT) networks

whose metrics are tree-like!

Andrew R Francis (CRM @ UWS) November 2014 11 / 13

slide-18
SLIDE 18

Tree-like HGT networks

◮ Let N be an HGT network with TN the underlying tree (delete all

reticulation arcs).

Lemma

If each reticulation arc in N is between adjacent tree-arcs of TN, then dN is tree-like on TN.

◮ This means we have huge numbers of reticulated (HGT) networks

whose metrics are tree-like!

Theorem

Suppose dN is a metric from an HGT network N with a single reticulation arc. Then dN is tree-like if and only if that arc is either

  • 1. from one arc to an adjacent arc, or
  • 2. between a root arc and one of the two children of the other root arc.

The only tree that harbours a representation for dN is TN.

Andrew R Francis (CRM @ UWS) November 2014 11 / 13

slide-19
SLIDE 19

Strange magic

◮ There are 2-reticulated HGT networks N that can be represented on

TN and (for other parameter settings) on a tree that is different from TN, even when the mixing distribution treats the two reticulations independently.

1 2 3 4 c a ∗ ∗ * d e * b α α′ ◮ Setting α ≥ 1 2, b ≥ a and α′a = (1 − α′)c gives 14|23.

Andrew R Francis (CRM @ UWS) November 2014 12 / 13

slide-20
SLIDE 20

Further questions

  • 1. Is the following true:

For any two binary phylogenetic X-trees T1 and T2, is there an HGT network N for which TN = T1 and yet where dN is representable on T2 (mixing distribution given by the independence model)?

Andrew R Francis (CRM @ UWS) November 2014 13 / 13

slide-21
SLIDE 21

Further questions

  • 1. Is the following true:

For any two binary phylogenetic X-trees T1 and T2, is there an HGT network N for which TN = T1 and yet where dN is representable on T2 (mixing distribution given by the independence model)?

  • 2. Let ρ(d) denote the minimum number of hybridizations required to

represent d on a hybridization or an HGT network.

◮ What conditions characterise those metrics d with ρ(d) = 1? ◮ What about ρ(d) = k for any k ≥ 1?

(We know about ρ(d) = 0: the 4PC).

Andrew R Francis (CRM @ UWS) November 2014 13 / 13