Deploying a Pharmacoinformatics Grid for Integrative Biomedical - - PowerPoint PPT Presentation

deploying a pharmacoinformatics grid for integrative
SMART_READER_LITE
LIVE PREVIEW

Deploying a Pharmacoinformatics Grid for Integrative Biomedical - - PowerPoint PPT Presentation

Deploying a Pharmacoinformatics Grid for Integrative Biomedical Researches Jung-Hsin Lin () School of Pharmacy, National Taiwan University & Institute of Biomedical Sciences, Academia Sinica http://rx.mc.ntu.edu.tw/~jlin/


slide-1
SLIDE 1

Deploying a Pharmacoinformatics Grid for Integrative Biomedical Researches

Jung-Hsin Lin (林榮信) School of Pharmacy, National Taiwan University & Institute of Biomedical Sciences, Academia Sinica http://rx.mc.ntu.edu.tw/~jlin/

slide-2
SLIDE 2

Pharmacoinformatics integrates Bioinformatics and Chemoinformatics for Drug Discovery

slide-3
SLIDE 3

Identifying Drug Targets using Microarray

slide-4
SLIDE 4

http://bioinfo.mc.ntu.edu.tw:8080/GenePathway/

slide-5
SLIDE 5
slide-6
SLIDE 6
slide-7
SLIDE 7
slide-8
SLIDE 8

Constructing Biological Pathways and Networks

slide-9
SLIDE 9

Mathematical Modeling of Signaling Pathways

  • Am. J. Phys. Endo. Metab. 283:E1084 (2002)
slide-10
SLIDE 10

Coupled Differential Equations for Biological Pathways

11 10 8 12 8 11 10 11 8 7 12 8 p 5 4 9 7 10 p 5 4 9 7 10 7 9 8 6 8 4 5 4 8 7 6 7 4 4 4 7 6 4 2 4 8 7 6 6 5 5 6 5 4 8 4 5 3 5 1 2 4 2 3 3 5 4 4 7 4 4 2 5 1 2 4 3 3 3 1 2 1 1 3 2 4 6 4 2 1 1 5 3 3 1 2

) ] PTP [ ( ) IR /( ) ( ) IR /( ) ( ] PTP [ ] PTP [ ] PTP [ ) ]( PTP [ ] PTP [ ] PTP [ x x k x k dt dx x x k k x k x x x k dt dx x x x k x k dt dx x k x k x k dt dx x k x k x k dt dx x k x k x x k x k k dt dx x k x k x k x x k x k x k dt dx x k x k x k x x k dt dx x k x k x x k dt dx x k x k x x k x k x k dt dx − = + − + + = + − = − − = − − = − + + + + = − − + − + = + − + = − − = − + − + =

− − − − − − − − − − − − − − − − − 21 13 20 ' 13 13 21 20 14 14 20 ' 13 13 21 13 20 19 12 18 12 19 18 12 19 12 18 17 11 16 11 17 16 11 17 11 16 15 10 13 10 15 14 9 13 9 14 13 10 9 15 10 14 9 13 12 8 11 10 8 12

) ( ) ( ] SHIP [ ] PTEN [ ]) SHIP [ ] PTEN [ ( x k x k k dt dx x k k x k k x k dt dx x k x k dt dx x k x k dt dx x k x k dt dx x k x k dt dx x k x k dt dx x k x k dt dx x k k x k x k dt dx x k x x k dt dx

− − − − − − − − − − −

− + = − + + − = − = − = − = − = − = − = + − + = − =

slide-11
SLIDE 11

Virtual Screening after Target Identified

1.

  • 1. Start with crystal coordinates of target

receptor and locate the active site 3.

  • 3. Search for the optimal position and location

based on some scoring function 4.

  • 4. Pick up the conformations (or compounds) with best

scores 2.

  • 2. Generate the molecular surface for the receptor
slide-12
SLIDE 12

State Vector in the Flexible Docking Problem

) , , , , , , , , , (

2 1 k CM CM CM

z y x χ χ χ ψ θ φ L

slide-13
SLIDE 13

Characteristics of Biological Complex Problems

  • The potential energy function is extremely rugged.
  • The potential energy surface is usually highly asymmetric.
  • The true global minimum is often surrounded by many

deceptive local minima.

  • The biological complex problems are mostly in the space of

high dimensionality.

slide-14
SLIDE 14

How to explore the phase space?

(Or, how to find a needle in a haystack?)

  • --Importance sampling

We should only explore the important region of the phase space, not the entire phase space. Stochastic methods usually outperform deterministic approaches in higher dimensional space.

slide-15
SLIDE 15

Genetic Algorithm

  • 1. [Start] Generate random population of n chromosomes (suitable solutions

for the problem)

  • 2. [Fitness] Evaluate the fitness f(x) of each chromosome x in the population
  • 3. [New population] Create a new population by repeating following steps

until the new population is complete

  • a. [Selection] Select two parent chromosomes from a population

according to their fitness (the better fitness, the bigger chance to be selected)

  • b. [Crossover] With a crossover probability cross over the parents to

form new offspring (children). If no crossover was performed,

  • ffspring is the exact copy of parents.
  • c. [Mutation] With a mutation probability mutate new offspring at each

locus (position in chromosome).

  • d. [Accepting] Place new offspring in the new population
  • 4. [Replace] Use new generated population for a further run of the algorithm
  • 5. [Test] If the end condition is satisfied, stop, and return the best solution in

current population

  • 6. [Loop] Go to step 2
slide-16
SLIDE 16

Chromosomes for GA Docking

Crossover operation Leach, 2001.

slide-17
SLIDE 17

Lamarckian Genetic Algorithm

LGA is a hybrid of the Genetic Algorithm with the adaptive local search method. As in the GA scheme, energy is regarded as the phenotype, and the compound conformation and location are regarded as the genotype. In the LGA scheme, phenotype is modified by the local searcher, and then the genotype is modified by the locally optimized phenotype. In AutoDock, the so-called Solis-Wet algorithm is used (basically energy-based random move).

slide-18
SLIDE 18

A Maximum Entropy Evolutionary Algorithm for the Docking Problem

  • n individuals, denoted by s1, s2, …, sn, are generated. Each si is a vector

corresponding to a point in the domain of the objective function f . In order to achieve a scale-free representation, each component of si is linearly mapped to the numerical range of [0,1].

  • The individuals in each generation of population are then sorted in the

ascending order based on the values of the energy function on evaluated on these individuals. Let t1, t2, … tn denote the ordered individuals and we have f(t1)<f(t2)<f(tn).

  • n Gaussian distributions, denoted by G1, G2, … Gn, are generated before the

new generation of population is created. The center of each Gaussian distribution is selected randomly and independently from t1, t2, … tn, where the probability is not uniform but instead follows a discrete diminishing distribution, n : n-1 : … : 1.

( )

⎟ ⎟ ⎠ ⎞ ⎜ ⎜ ⎝ ⎛ − − ⎟ ⎟ ⎠ ⎞ ⎜ ⎜ ⎝ ⎛ ⋅ =

2 2

2 exp 2 1 ) (

i i i i i

' ' p σ σ π μ s s

1 ) (

2

− − + = n i

i

α β α σ

Nucleic Acids Research 33: W233-W238 (2005)

slide-19
SLIDE 19

LGA versus ME

  • The ME algorithm avoids the “purification” effect inherent in the genetic

algorithm and its derivatives, and therefore reduce the over-compression of information in the searching process.

slide-20
SLIDE 20
slide-21
SLIDE 21
slide-22
SLIDE 22

http://bioinfo.mc.ntu.edu.tw/medock/, Nucleic Acids Research 33: W233-W238 (2005)

slide-23
SLIDE 23

Overexpression of P-glycoprotein is the major cause for multidrug resistance problems in cancer chemotherapies

  • Multidrug resistance (MDR) has posed a serious clinical

problem in cancer chemotherapy.

  • MDR will cause the reduction of bioavailability of

drugs.

  • P-gp, product of the mdr1 gene in humans, localization
  • n chromosome 7q21, is a member of the large ATP

binding cassette (ABC) family of proteins.

  • P-gp is 1280 amino acids long and is very dynamic

inside membranes.

slide-24
SLIDE 24

Structure Prediction the MDR Protein Pgp

  • 1. Predicting the structure of

Pgp using homology modeling

  • 2. Molecular dynamics

simulation in a lipid bilayer

slide-25
SLIDE 25

Paclitaxel (Taxol)

  • THR-199 [ TM3 ]
  • PHE-303,TYR-307,PHE-314 [ TM5 ]
  • SER-344, VAL-345, GLN-347 [ TM6 ]
slide-26
SLIDE 26

Summary

  • Grid is the ideal computing architecture that enables

integrative biomedical and pharmaceutical researches, which

  • ften require access to heterogeneous computing resources.
  • The GenePathway Viewer allows conversion of microarray

data into colored pathway information, which uses the Web Service technology that can update the pathway database from KEGG in a real-time and automatic fashion.

  • Molecular dynamics simulations of biomacromolecules

usually generate huge amount of data in a very high speed, and therefore good archiving facilities like DataGrid is important to ensure data security and integrity.

  • The ME algorithm is intrinsically parallel, and therefore

straightforward to be implemented on the Grid architecture.

slide-27
SLIDE 27

Acknowledgement

  • Prof. Yen-Jen Oyang
  • Tien-Hao Chang

Computer Science and Information Engineering, NTU

National Science Council of Taiwan

School of Pharmacy, NTU

  • Pei-Hua Lo
  • Hui-Hsuan Tu