Predicting the outcome of in vitro fertilization G. Corani 1 C. Magli - - PowerPoint PPT Presentation

predicting the outcome of in vitro fertilization
SMART_READER_LITE
LIVE PREVIEW

Predicting the outcome of in vitro fertilization G. Corani 1 C. Magli - - PowerPoint PPT Presentation

Predicting the outcome of in vitro fertilization G. Corani 1 C. Magli 2 A. Giusti 1 L. Gambardella 1 L. Gianaroli 2 1: IDSIA (Ist. Dalle Molle Artificial Intelligence, Switzerland) 2: IIRM (Int. Ist. Reproductive Medicine, Switzerland)


slide-1
SLIDE 1

Predicting the outcome of in vitro fertilization

  • G. Corani1
  • C. Magli2
  • A. Giusti1
  • L. Gambardella1
  • L. Gianaroli2

1: IDSIA (Ist. Dalle Molle Artificial Intelligence, Switzerland) 2: IIRM (Int. Ist. Reproductive Medicine, Switzerland)

❣✐♦r❣✐♦❅✐❞s✐❛✳❝❤

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-2
SLIDE 2

In vitro fertilization (IVF)

Infertility affects more than 80 million people worldwide. During IVF 1-3 embryos are cultured in vitro; they are then transferred to the woman. A pregnancy occurs when at least one of the transferred embryos implants. Embryos are scored by biologists according to their morphology: {non-top, top, top+}.

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-3
SLIDE 3

The EU assumption (Speirs, 1983)

Pregnancy requires a receptive uterus and a viable embryo. The probability of the uterus being receptive and of the embryo being viable are respectively θu and θe. It is assumed the independence of viability and receptivity. In case of a single embryo transferred: P(pregnancy) = θu ·θe. Usually, θu depends on the age of the woman and θe on the score of the embryo.

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-4
SLIDE 4

The BN-EU model

Models the IVF outcome under the EU assumption The goal: estimating the probability of pregnancy, given the score of the transferred embryos and the age of the woman.

Age (A) Ut recep (U) Score 1 (S1) Viab E1 (E1) Score 2 (S2) Viab E2 (E2) Score 3 (S3) Viab E3 (E3) Pregnancy (P)

Nodes with a gray background are affected by a missingness process.

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-5
SLIDE 5

How to estimate θu and θe?

There is a partial observability problem (Zhou and Weinberg, 1998; Robert 2007, 2009). If pregnancy does not occur, we cannot ascertain whether:

the uterus is non-receptive; each embryo is non-viable; the uterus is non-receptive and each embryo is non-viable.

Training instance, in case of no pregnancy.

A U S1 S2 S3 E1 E2 E3 P 40+ ? top ntop toph ? ? ?

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-6
SLIDE 6

Training instances

If pregnancy does occur: the embryo is known to be receptive but ... it is unknown which embryo is viable, unless the number of babies matches the number of embryos. Training instance (single pregnancy).

A U S1 S2 S3 E1 E2 E3 P 40+ u top ntop toph ?

?

? 1

Training instance (triple pregnancy).

A U S1 S2 S3 E1 E2 E3 P 40+ u top ntop toph e e e 1

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-7
SLIDE 7

Estimation procedure

The missingness process is MAR (missing at random); parameters can be estimated via EM (Expectation Maximization). MAP estimation: among m EM runs, the estimate with the highest posterior probability P(θ θ θ|D) (θ θ θ denotes the parameters of the model) is selected. MAP estimation is a good approximation of Bayesian estimation if the posterior is peaked around the maximum; this is not the case when learning from incomplete samples. Different EM runs achieve close values of P(θ θ θ|D), returning however very different parameter estimates.

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-8
SLIDE 8

The averaging approach

Given a parameter θ x

X, we weighted-average its estimates across the m EM

runs: ˆ θ x

X = i=m

  • i=1

ˆ θ x−i

X

P( ˆ θ θ θ i|D)

i=m

  • i=1

P( ˆ θ θ θ i|D) where ˆ θ x−i

X

and P(ˆ θ θ θ

i|D) denote the estimate of θ x X and the MAP score

  • btained in the i-th EM run.

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-9
SLIDE 9

Rationale

Consider the query P(|y,D), where is the set of variables being queried, and y is the available evidence. Fully Bayesian inference P(|y,D) =

  • P(|y,D,θ

θ θ)P(θ θ θ|D)dθ θ θ MAP inference P(|y,D) ≈ P(|y,D, ˆ θ θ θ) Pseudo-Bayesian inference P(|y,D) ≃

i=m

  • i=1

P(|y,D, ˆ θ θ θ i)P( ˆ θ θ θ i|D)

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-10
SLIDE 10

Rationale

The pseudo-Bayesian approach partially reconstructs the shape of the posterior but keeps a collection of m networks, preventing model interpretability. The goal of the averaging approach is to retain the advantages of the pseudo-Bayesian approach, but instantiating only a single model.

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-11
SLIDE 11

Experiments with generated data

For each sample size, 100 repetitions of the following: random drawing of the parameters; generation of incomplete instances; learning of the parameters by the MAP and the averaging approach; classification of the test instances.

100 200 300 400 500 600 0.00 0.20 0.40 n KL-divergence averaging MAP

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-12
SLIDE 12

Averaging decreases the estimation error of both receptivities and viabilities.

200 400 600 0.05 0.10 MAE of ˆ θ u|40+

U

averaging MAP 200 400 600 0.05 0.10 0.15 n MAE of ˆ θ e|top

E

averaging MAP

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-13
SLIDE 13

Averaging increases AUC.

4 AUCs: one for each of no-pregnancy (AUC0), single, double and triple pregnancy.

200 400 600 0.77 0.78 0.79 0.80 0.81 AUC0 averaging MAP true 200 400 600 0.74 0.76 0.78 n AUC1 averaging MAP true

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-14
SLIDE 14

Comments

Averaging significantly increases AUC and decrease KL-divergences. The AUC of the true model is only slightly better than that of the estimated models. At test stage receptivity and viabilities are always unknown: this is where lies a major difficulty of predicting IVF! Test instance

A U S1 S2 S3 E1 E2 E3 P 40+ ? top ntop toph ?

?

? ?

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-15
SLIDE 15

The IIRM data set (Lugano, 388 cycles)

We test BN-EU vs. the high-performance AODE classifier (Webb et al., 2005). To learn AODE we build a complete data set, with features: the age of the woman and the number of embryos of each type transferred to the woman. Despite being learned on a complete data set, AODE does not

  • utperform BN-EU.

BN-EU AODE AUC0 74.1 74.8 AUC1 67.0 68.0 AUC2 83.4 81.6

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2
slide-16
SLIDE 16

Conclusions

BN-EU is more interpretable than AODE: uterine receptivity drops from 78% to 58% and 26% for woman aged respectively {<34, 34-40, 40+}; embryo viability increases from 7% to 21% to 39% for embryos scored respectively as non-top, top and top+. The BN-EU model can be used to cross-check the effectiveness of the embryo scoring system. Future direction of research: studying more covariates on which letting depend both receptivity and viability. The averaging approach can be easily added to any EM implementation.

Predicting the IVF outcome

  • G. Corani1, C. Magli2, A. Giusti1, L. Gambardella1, L. Gianaroli2