1. Autocatalytic chemical reactions in the flow reactor 2. - - PowerPoint PPT Presentation

1 autocatalytic chemical reactions in the flow reactor 2
SMART_READER_LITE
LIVE PREVIEW

1. Autocatalytic chemical reactions in the flow reactor 2. - - PowerPoint PPT Presentation

1. Autocatalytic chemical reactions in the flow reactor 2. Replication, mutation, selection and Shannon information 3. Evolution in silico and optimization of RNA structures 4. Random walks and ensemble learning 5. Sequence-structure


slide-1
SLIDE 1
slide-2
SLIDE 2

1. Autocatalytic chemical reactions in the flow reactor 2. Replication, mutation, selection and Shannon information 3. Evolution in silico and optimization of RNA structures 4. Random walks and ‚ensemble learning‘ 5. Sequence-structure maps, neutral networks, and intersections

slide-3
SLIDE 3

Evolution in silico

  • W. Fontana, P. Schuster,

Science 280 (1998), 1451-1455

slide-4
SLIDE 4

5'-End 5'-End 5'-End 3'-End 3'-End 3'-End

70 60 50 40 30 20 10 GCGGAUUUAGCUCAGDDGGGAGAGCMCCAGACUGAAYAUCUGGAGMUCCUGUGTPCGAUCCACAGAAUUCGCACCA

Sequence Secondary structure Symbolic notation

  • A symbolic notation of RNA secondary structure that is equivalent to the conventional graphs
slide-5
SLIDE 5

G G G G G G G G G G G G G G G G U U U U U U U U U U U A A A A A A A A A A A A U C C C C C C C C C C C C 5’-end 3’-end GGCGCGCCCGGCGCC GUAUCGAAAUACGUAGCGUAUGGGGAUGCUGGACGGUCCCAUCGGUACUCCA UGGUUACGCGUUGGGGUAACGAAGAUUCCGAGAGGAGUUUAGUGACUAGAGG

RNAStudio.lnk

Folding of RNA sequences into secondary structures of minimal free energy, G0

300

slide-6
SLIDE 6

Hamming distance d (S ,S ) =

H 1 2

4 d (S ,S ) = 0

H 1 1

d (S ,S ) = d (S ,S )

H H 1 2 2 1

d (S ,S ) d (S ,S ) + d (S ,S )

H H H 1 3 1 2 2 3

  • (i)

(ii) (iii)

The Hamming distance between structures in parentheses notation forms a metric in structure space

slide-7
SLIDE 7

f0 f f1 f2 f3 f4 f6 f5 f7

Replication rate constant: fk = / [+ dS

(k)]

  • dS

(k) = dH(Sk,S

)

Evaluation of RNA secondary structures yields replication rate constants

slide-8
SLIDE 8

Stock Solution Reaction Mixture

Replication rate constant: fk = / [+ dS

(k)]

  • dS

(k) = dH(Sk,S

) Selection constraint: # RNA molecules is controlled by the flow N N t N ± ≈ ) ( Constant mutation rate: p = 0.001 per site and replication The flowreactor as a device for studies of evolution in vitro and in silico

slide-9
SLIDE 9

5'-End 3'-End

70 60 50 40 30 20 10

Randomly chosen initial structure Phenylalanyl-tRNA as target structure

slide-10
SLIDE 10

s p a c e Sequence Concentration

Master sequence Mutant cloud “Off-the-cloud” mutations

The molecular quasispecies in sequence space

slide-11
SLIDE 11

S{ = ( ) I{ f S

{ {

ƒ = ( )

S{ f{ I{

Mutation Genotype-Phenotype Mapping Evaluation of the Phenotype

Q{

j

I1 I2 I3 I4 I5 In

Q

f1 f2 f3 f4 f5 fn

I1 I2 I3 I4 I5 I{ In+1 f1 f2 f3 f4 f5 f{ fn+1

Q

Evolutionary dynamics including molecular phenotypes

slide-12
SLIDE 12

In silico optimization in the flow reactor: trajectory Time (arbitrary units) A v e r a g e s t r u c t u r e d i s t a n c e t

  • t

a r g e t d

  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

slide-13
SLIDE 13

44

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Final structure of the optimization process

slide-14
SLIDE 14

44 43

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of the last step 43 44

slide-15
SLIDE 15

44 43 42

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of last-but-one step 42 43 ( 44)

slide-16
SLIDE 16

44 43 42 41

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of step 41 42 ( 43 44)

slide-17
SLIDE 17

44 43 42 41 40

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of step 40 41 ( 42 43 44)

slide-18
SLIDE 18

44 43 42 41 40 39 Evolutionary process Reconstruction

Average structure distance to target dS

  • Evolutionary trajectory

1250 10

44 42 40 38 36 Relay steps Number of relay step Time

Reconstruction of the relay series

slide-19
SLIDE 19

Transition inducing point mutations Neutral point mutations

Change in RNA sequences during the final five relay steps 39 44

slide-20
SLIDE 20

In silico optimization in the flow reactor: Trajectory and relay steps Time (arbitrary units) A v e r a g e s t r u c t u r e d i s t a n c e t

  • t

a r g e t d

  • S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

Relay steps

slide-21
SLIDE 21

10 08 12 14 Time (arbitrary units) Average structure distance to target dS

  • 500

250 20 10

Uninterrupted presence Evolutionary trajectory Number of relay step

28 neutral point mutations during a long quasi-stationary epoch Transition inducing point mutations Neutral point mutations

Neutral genotype evolution during phenotypic stasis

slide-22
SLIDE 22

In silico optimization in the flow reactor: Main transitions Main transitions Relay steps Time (arbitrary units) Average structure distance to target d S

500 750 1000 1250 250 50 40 30 20 10

Evolutionary trajectory

slide-23
SLIDE 23

00 09 31 44

Three important steps in the formation of the tRNA clover leaf from a randomly chosen initial structure corresponding to three main transitions.

slide-24
SLIDE 24

AUGC GC Movies of optimization trajectories over the AUGC and the GC alphabet

slide-25
SLIDE 25

Runtime of trajectories F r e q u e n c y

1000 2000 3000 4000 5000 0.05 0.1 0.15 0.2

Statistics of the lengths of trajectories from initial structure to target (AUGC-sequences)

slide-26
SLIDE 26

Alphabet Runtime Transitions Main transitions

  • No. of runs

AUGC 385.6 22.5 12.6 1017 GUC 448.9 30.5 16.5 611 GC 2188.3 40.0 20.6 107

Statistics of trajectories and relay series (mean values of log-normal distributions)

slide-27
SLIDE 27

Massif Central Mount Fuji

Examples of smooth landscapes on Earth

slide-28
SLIDE 28

Dolomites

Examples of rugged landscapes on Earth

Bryce Canyon

slide-29
SLIDE 29

Genotype Space Fitness

Start of Walk End of Walk

Evolutionary optimization in absence of neutral paths in sequence space

slide-30
SLIDE 30

Genotype Space F i t n e s s

Start of Walk End of Walk Random Drift Periods Adaptive Periods

Evolutionary optimization including neutral paths in sequence space

slide-31
SLIDE 31

Grand Canyon

Example of a landscape on Earth with ‘neutral’ ridges and plateaus

slide-32
SLIDE 32

Neutral ridges and plateus

slide-33
SLIDE 33

1. Autocatalytic chemical reactions in the flow reactor 2. Replication, mutation, selection and Shannon information 3. Evolution in silico and optimization of RNA structures 4. Random walks and ‚ensemble learning‘ 5. Sequence-structure maps, neutral networks, and intersections

slide-34
SLIDE 34

10 08 12 14 Time (arbitrary units) Average structure distance to target dS

  • 500

250 20 10

Uninterrupted presence Evolutionary trajectory Number of relay step

28 neutral point mutations during a long quasi-stationary epoch Transition inducing point mutations Neutral point mutations

Neutral genotype evolution during phenotypic stasis

slide-35
SLIDE 35

Variation in genotype space during optimization of phenotypes

Mean Hamming distance within the population and drift velocity of the population center in sequence space.

slide-36
SLIDE 36

Spread of population in sequence space during a quasistationary epoch: t = 150

slide-37
SLIDE 37

Spread of population in sequence space during a quasistationary epoch: t = 170

slide-38
SLIDE 38

Spread of population in sequence space during a quasistationary epoch: t = 200

slide-39
SLIDE 39

Spread of population in sequence space during a quasistationary epoch: t = 350

slide-40
SLIDE 40

Spread of population in sequence space during a quasistationary epoch: t = 500

slide-41
SLIDE 41

Spread of population in sequence space during a quasistationary epoch: t = 650

slide-42
SLIDE 42

Spread of population in sequence space during a quasistationary epoch: t = 820

slide-43
SLIDE 43

Spread of population in sequence space during a quasistationary epoch: t = 825

slide-44
SLIDE 44

Spread of population in sequence space during a quasistationary epoch: t = 830

slide-45
SLIDE 45

Spread of population in sequence space during a quasistationary epoch: t = 835

slide-46
SLIDE 46

Spread of population in sequence space during a quasistationary epoch: t = 840

slide-47
SLIDE 47

Spread of population in sequence space during a quasistationary epoch: t = 845

slide-48
SLIDE 48

Spread of population in sequence space during a quasistationary epoch: t = 850

slide-49
SLIDE 49

Spread of population in sequence space during a quasistationary epoch: t = 855

slide-50
SLIDE 50

Element of class 2: The ant worker

slide-51
SLIDE 51

Ant colony Random foraging Food source

Foraging behavior of ant colonies

slide-52
SLIDE 52

Ant colony Food source detected Food source

Foraging behavior of ant colonies

slide-53
SLIDE 53

Ant colony Pheromone trail laid down Food source

Foraging behavior of ant colonies

slide-54
SLIDE 54

Ant colony Pheromone controlled trail Food source

Foraging behavior of ant colonies

slide-55
SLIDE 55

RNA model Foraging behavior of ant colonies Element RNA molecule Individual worker ant Mechanism relating elements Mutation in quasi-species Genetics of kinship Search process Optimization of RNA structure Recruiting of food Search space Sequence space Three-dimensional space Random step Mutation Element of ant walk Self-enhancing process Replication Secretion of pheromone Interaction between elements Mean replication rate Mean pheromone concentration Goal of the search Target structure Food source Temporary memory RNA sequences in population Pheromone trail ‘Learning’ entity Population of molecules Ant colony

Learning at population or colony level by trial and error

Two examples: (i) RNA model and (ii) ant colony

slide-56
SLIDE 56

1. Autocatalytic chemical reactions in the flow reactor 2. Replication, mutation, selection and Shannon information 3. Evolution in silico and optimization of RNA structures 4. Random walks and ‚ensemble learning‘ 5. Sequence-structure maps, neutral networks, and intersections

slide-57
SLIDE 57

Minimum free energy criterion

Inverse folding of RNA secondary structures

1st 2nd 3rd trial 4th 5th

The inverse folding algorithm searches for sequences that form a given RNA secondary structure under the minimum free energy criterion.

slide-58
SLIDE 58
slide-59
SLIDE 59

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Structure space Real numbers Mapping from sequence space into structure space and into function

slide-60
SLIDE 60

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Structure space Real numbers

slide-61
SLIDE 61

Sk I. = ( ) ψ

fk f Sk = ( )

Sequence space Structure space Real numbers

The pre-image of the structure Sk in sequence space is the neutral network Gk

slide-62
SLIDE 62

Neutral networks are sets of sequences forming the same structure. Gk is the pre-image of the structure Sk in sequence space: Gk =

  • 1(Sk) π{

j |

(Ij) = Sk} The set is converted into a graph by connecting all sequences of Hamming distance one. Neutral networks of small RNA molecules can be computed by exhaustive folding of complete sequence spaces, i.e. all RNA sequences of a given chain length. This number, N=4n , becomes very large with increasing length, and is prohibitive for numerical computations. Neutral networks can be modelled by random graphs in sequence

  • space. In this approach, nodes are inserted randomly into sequence

space until the size of the pre-image, i.e. the number of neutral sequences, matches the neutral network to be studied.

slide-63
SLIDE 63

λj = 27 = 0.444 ,

/

12 λk = (k)

j

| | Gk

λ κ

cr = 1 -

  • 1 (

1)

/ κ- λ λ

k cr . . . .

> λ λ

k cr . . . .

< network is connected Gk network is connected not Gk Connectivity threshold: Alphabet size : = 4

  • AUGC

G S S

k k k

= ( ) | ( ) =

  • 1

U

  • I

I

j j

  • cr

2 0.5 3 0.423 4 0.370

GC,AU GUC,AUG AUGC

Mean degree of neutrality and connectivity of neutral networks

slide-64
SLIDE 64

A connected neutral network

slide-65
SLIDE 65

Giant Component

A multi-component neutral network

slide-66
SLIDE 66

5'-End 5'-End 5'-End 5'-End 3'-End 3'-End 3'-End 3'-End

70 70 70 70 60 60 60 60 50 50 50 50 40 40 40 40 30 30 30 30 20 20 20 20 10 10 10 10

Alphabet Degree of neutrality

AU AUG AUGC UGC GC

  • -
  • -

0.275 0.064 0.263 0.071 0.052 0.033

  • -

0.217 0.051 0.279 0.063 0.257 0.070

  • 0.057 0.034
  • 0.073 0.032

0.201 0.056 0.313 0.058 0.250 0.064 0.068 0.034

  • Degree of neutrality of cloverleaf RNA secondary structures over different alphabets
slide-67
SLIDE 67

Reference for postulation and in silico verification of neutral networks

slide-68
SLIDE 68

Gk Neutral Network

Structure S

k

Gk C k

Compatible Set Ck

The compatible set Ck of a structure Sk consists of all sequences which form Sk as its minimum free energy structure (the neutral network Gk) or one of its suboptimal structures.

slide-69
SLIDE 69

Structure S Structure S

1

The intersection of two compatible sets is always non empty: C0 C1 π

slide-70
SLIDE 70

Reference for the definition of the intersection and the proof of the intersection theorem

slide-71
SLIDE 71

C U G G G A A A A A U C C C C A G A C C G G G G G U U U C C C C G G

3’-end

M i n i m u m f r e e e n e r g y c

  • n

f

  • r

m a t i

  • n

S S u b

  • p

t i m a l c

  • n

f

  • r

m a t i

  • n

S 1

G G G G G G G G G G G G C C C C U U U U C C C C C C U A A A A A C G G G G G G C C C C U U G G G G G C C C C C C C U U A A A A A U G

A sequence at the intersection of two neutral networks is compatible with both structures

slide-72
SLIDE 72
slide-73
SLIDE 73
slide-74
SLIDE 74

A ribozyme switch

E.A.Schultes, D.B.Bartel, Science 289 (2000), 448-452

slide-75
SLIDE 75

Two ribozymes of chain lengths n = 88 nucleotides: An artificial ligase (A) and a natural cleavage ribozyme of hepatitis-

  • virus (B)
slide-76
SLIDE 76

The sequence at the intersection: An RNA molecules which is 88 nucleotides long and can form both structures

slide-77
SLIDE 77

Two neutral walks through sequence space with conservation of structure and catalytic activity

slide-78
SLIDE 78

Evolution of RNA molecules based on Qβ phage

D.R.Mills, R.L.Peterson, S.Spiegelman, An extracellular Darwinian experiment with a self-duplicating nucleic acid molecule. Proc.Natl.Acad.Sci.USA 58 (1967), 217-224 S.Spiegelman, An approach to the experimental analysis of precellular evolution. Quart.Rev.Biophys. 4 (1971), 213-253 C.K.Biebricher, Darwinian selection of self-replicating RNA molecules. Evolutionary Biology 16 (1983), 1-52 G.Bauer, H.Otten, J.S.McCaskill, Travelling waves of in vitro evolving RNA. Proc.Natl.Acad.Sci.USA 86 (1989), 7937-7941 C.K.Biebricher, W.C.Gardiner, Molecular evolution of RNA in vitro. Biophysical Chemistry 66 (1997), 179-192 G.Strunk, T.Ederhof, Machines for automated evolution experiments in vitro based on the serial transfer concept. Biophysical Chemistry 66 (1997), 193-202

slide-79
SLIDE 79

RNA sample Stock solution: Q RNA-replicase, ATP, CTP, GTP and UTP, buffer

  • Time

1 2 3 4 5 6 69 70 The serial transfer technique applied to RNA evolution in vitro

slide-80
SLIDE 80

Reproduction of the original figure of the serial transfer experiment with Q RNA β D.R.Mills, R,L,Peterson, S.Spiegelman, . Proc.Natl.Acad.Sci.USA (1967), 217-224 An extracellular Darwinian experiment with a self-duplicating nucleic acid molecule 58

slide-81
SLIDE 81

Decrease in mean fitness due to quasispecies formation

The increase in RNA production rate during a serial transfer experiment

slide-82
SLIDE 82

Evolutionary design of RNA molecules

D.B.Bartel, J.W.Szostak, In vitro selection of RNA molecules that bind specific ligands. Nature 346 (1990), 818-822 C.Tuerk, L.Gold, SELEX - Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. Science 249 (1990), 505-510 D.P.Bartel, J.W.Szostak, Isolation of new ribozymes from a large pool of random sequences. Science 261 (1993), 1411-1418 R.D.Jenison, S.C.Gill, A.Pardi, B.Poliski, High-resolution molecular discrimination by RNA. Science 263 (1994), 1425-1429

  • Y. Wang, R.R.Rando, Specific binding of aminoglycoside antibiotics to RNA. Chemistry &

Biology 2 (1995), 281-290 Jiang, A. K. Suri, R. Fiala, D. J. Patel, Saccharide-RNA recognition in an aminoglycoside antibiotic-RNA aptamer complex. Chemistry & Biology 4 (1997), 35-50

slide-83
SLIDE 83

yes

Selection Cycle

no

Genetic Diversity

Desired Properties ? ? ? Selection Amplification Diversification

Selection cycle used in applied molecular evolution to design molecules with predefined properties

slide-84
SLIDE 84

Retention of binders Elution of binders C h r

  • m

a t

  • g

r a p h i c c

  • l

u m n

The SELEX technique for the evolutionary design of aptamers

slide-85
SLIDE 85

Sequences of aptamers binding theophyllin, caffeine, and related compounds

R.D.Jenison, S.C.Gill, A.Pardi, B.Poliski, High-resolution molecular discrimination by RNA. Science 263 (1994), 1425-1429

slide-86
SLIDE 86

Secondary structures of aptamers binding theophyllin, caffeine, and related compounds

slide-87
SLIDE 87

additional methyl group

Dissociation constants and specificity of theophylline, caffeine, and related derivatives

  • f uric acid for binding to a discriminating

aptamer TCT8-4

slide-88
SLIDE 88

Schematic drawing of the aptamer binding site for the theophylline molecule

slide-89
SLIDE 89

tobramycin

A A A A A C C C C C C C C G G G G G G G G U U U U U U

5’- 3’-

A A A A A U U U U U U C C C C C C C C G G G G G G G G

5’-

  • 3’

RNA aptamer

Formation of secondary structure of the tobramycin binding RNA aptamer

  • L. Jiang, A. K. Suri, R. Fiala, D. J. Patel, Saccharide-RNA recognition in an aminoglycoside

antibiotic-RNA aptamer complex. Chemistry & Biology 4:35-50 (1997)

slide-90
SLIDE 90

The three-dimensional structure of the tobramycin aptamer complex

  • L. Jiang, A. K. Suri, R. Fiala, D. J. Patel,

Chemistry & Biology 4:35-50 (1997)

slide-91
SLIDE 91

Acknowledgement of support

Fonds zur Förderung der wissenschaftlichen Forschung (FWF) Projects No. 09942, 10578, 11065, 13093 13887, and 14898 Jubiläumsfonds der Österreichischen Nationalbank Project No. Nat-7813 European Commission: Project No. EU-980189 Siemens AG, Austria The Santa Fe Institute and the Universität Wien The software for producing RNA movies was developed by Robert Giegerich and coworkers at the Universität Bielefeld

Universität Wien

slide-92
SLIDE 92

Coworkers

Universität Wien

Walter Fontana, Santa Fe Institute, NM Christian Reidys, Christian Forst, Los Alamos National Laboratory, NM Peter Stadler, Bärbel Stadler, Universität Leipzig, GE Ivo L.Hofacker, Christoph Flamm, Universität Wien, AT Andreas Wernitznig, Michael Kospach, Universität Wien, AT Ulrike Langhammer, Ulrike Mückstein, Stefanie Widder Jan Cupal, Kurt Grünberger, Andreas Svrček-Seiler, Stefan Wuchty, Stefan Bernhardt, Andreas De Stefani Ulrike Göbel, Institut für Molekulare Biotechnologie, Jena, GE Walter Grüner, Stefan Kopp, Jaqueline Weber

slide-93
SLIDE 93

Web-Page for further information: http://www.tbi.univie.ac.at/~pks

slide-94
SLIDE 94