Fast Robustness Quantification with Variational Bayes
ITT Career Development Assistant Professor, MIT
Tamara Broderick
With: Ryan Giordano, Rachael Meager, Jonathan Huggins, Michael I. Jordan
Fast Robustness Quantification with Variational Bayes Tamara - - PowerPoint PPT Presentation
Fast Robustness Quantification with Variational Bayes Tamara Broderick ITT Career Development Assistant Professor, MIT With: Ryan Giordano, Rachael Meager, Jonathan Huggins, Michael I. Jordan Bayesian inference Complex, modular
ITT Career Development Assistant Professor, MIT
Tamara Broderick
With: Ryan Giordano, Rachael Meager, Jonathan Huggins, Michael I. Jordan
1
1
1
1
p(θ|x) ∝θ p(x|θ)p(θ)
1
p(θ|x) ∝θ p(x|θ)p(θ)
1
p(θ|x) ∝θ p(x|θ)p(θ)
1
p(θ|x) ∝θ p(x|θ)p(θ)
1
p(θ|x) ∝θ p(x|θ)p(θ)
1
p(θ|x) ∝θ p(x|θ)p(θ)
1
p(θ|x) ∝θ p(x|θ)p(θ)
Bayes Theorem
1
p(θ|x) ∝θ p(x|θ)p(θ)
Bayes Theorem
1
p(θ|x) ∝θ p(x|θ)p(θ)
Bayes Theorem
1
p(θ|x) ∝θ p(x|θ)p(θ)
Bayes Theorem
1
p(θ|x) ∝θ p(x|θ)p(θ)
Bayes Theorem
MCMC
response variational Bayes
1
p(θ|x) ∝θ p(x|θ)p(θ)
Bayes Theorem
MCMC
response variational Bayes
1
p(θ|x) ∝θ p(x|θ)p(θ)
Bayes Theorem
MCMC
response variational Bayes
1
p(θ|x) ∝θ p(x|θ)p(θ)
Bayes Theorem
1
p(θ|x) ∝θ p(x|θ)p(θ)
MCMC
response variational Bayes
Bayes Theorem
1
p(θ|x) ∝θ p(x|θ)p(θ)
MCMC
response variational Bayes
Bayes Theorem
1
p(θ|x) ∝θ p(x|θ)p(θ)
MCMC
response variational Bayes
Bayes Theorem
1
p(θ|x) ∝θ p(x|θ)p(θ)
MCMC
response variational Bayes
Bayes Theorem
2
2
2
2
2
2
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
p(θ|x) q(θ) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
p(θ|x) q(θ) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
p(θ|x) q(θ) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
q(θ)
3
q(θ)
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
p(θ|x)
3
q(θ)
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
p(θ|x)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ) p(θ|x) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
p(θ|x) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
p(θ|x) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
p(θ|x) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
p(θ|x) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
[Broderick, Boyd, Wibisono, Wilson, Jordan 2013]
p(θ|x) q∗(θ)
3
posterior
(KL) divergence: p(θ|x) KL(qkp(·|x))
q∗(θ)
[Broderick, Boyd, Wibisono, Wilson, Jordan 2013]
p(θ|x) q∗(θ)
3
!
!
severely)
[Bishop 2006]
q(θ) =
J
Y
j=1
q(θj) KL(q||p(·|x)) = Z
θ
q(θ) log q(θ) p(θ|x)dθ θ1 θ2
4
!
!
severely)
q(θ) =
J
Y
j=1
q(θj)
4
!
!
severely)
[Bishop 2006]
q(θ) =
J
Y
j=1
q(θj) θ1 θ2 p(θ|x)
4
!
!
severely)
[Bishop 2006]
q(θ) =
J
Y
j=1
q(θj) KL(q||p(·|x)) = Z
θ
q(θ) log q(θ) p(θ|x)dθ θ1 θ2 p(θ|x)
4
!
!
severely)
[Bishop 2006]
q(θ) =
J
Y
j=1
q(θj) KL(q||p(·|x)) = Z
θ
q(θ) log q(θ) p(θ|x)dθ θ1 θ2 p(θ|x)
4
!
!
severely)
[Bishop 2006]
q(θ) =
J
Y
j=1
q(θj) KL(q||p(·|x)) = Z
θ
q(θ) log q(θ) p(θ|x)dθ θ1 θ2 p(θ|x) q∗(θ)
4
!
!
severely)
[Bishop 2006]
q(θ) =
J
Y
j=1
q(θj) KL(q||p(·|x)) = Z
θ
q(θ) log q(θ) p(θ|x)dθ θ1 θ2 p(θ|x) q∗(θ)
4
!
!
severely)
[Bishop 2006]
q(θ) =
J
Y
j=1
q(θj) KL(q||p(·|x)) = Z
θ
q(θ) log q(θ) p(θ|x)dθ θ1 θ2 p(θ|x) q∗(θ)
4
!
!
severely)
q(θ) =
J
Y
j=1
q(θj) KL(q||p(·|x)) = Z
θ
q(θ) log q(θ) p(θ|x)dθ θ1 θ2
[MacKay 2003; Bishop 2006; Wang, Titterington 2004; Turner, Sahani 2011]
p(θ|x) q∗(θ)
4
!
!
severely)
q(θ) =
J
Y
j=1
q(θj) KL(q||p(·|x)) = Z
θ
q(θ) log q(θ) p(θ|x)dθ θ1 θ2
[MacKay 2003; Bishop 2006; Wang, Titterington 2004; Turner, Sahani 2011]
p(θ|x) q∗(θ)
[Fosdick 2013; Dunson 2014; Bardenet, Doucet, Holmes 2015]
4
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ) p(θ|x)
[Bishop 2006]
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ) Σ = d dtT d dtCp(·|x)(t)
5
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
[Bishop 2006]
p(θ|x) q∗(θ)
5
!
!
!
V := d2 dtT dtCq∗(t)
mean = d dtC(t)
Σ := d2 dtT dtCp(·|x)(t)
log pt(θ) := log p(θ|x) + tT θ − C(t), MFVB q∗
t
Σ = d dtT Eptθ
≈ d dtT Eq∗
t θ
=: ˆ Σ
[Bishop 2006]
C(t) := log EetT θ p(θ|x) q∗(θ)
5
ˆ Σ := d dtT Eq∗
t θ
qt mt
6
ˆ Σ := d dtT Eq∗
t θ
qt mt
6
ˆ Σ := d dtT Eq∗
t θ
qt mt = (I − V H)−1V
6
ˆ Σ = ✓ ∂2KL ∂m∂mT
◆−1 ˆ Σ := d dtT Eq∗
t θ
qt mt = (I − V H)−1V
6
ˆ Σ = ✓ ∂2KL ∂m∂mT
◆−1 ˆ Σ := d dtT Eq∗
t θ
qt mt = (I − V H)−1V
6
ˆ Σ = ✓ ∂2KL ∂m∂mT
◆−1 ˆ Σ := d dtT Eq∗
t θ
qt mt = (I − V H)−1V
6
ˆ Σ = ✓ ∂2KL ∂m∂mT
◆−1 ˆ Σ := d dtT Eq∗
t θ
qt mt = (I − V H)−1V
6
t θ
ˆ Σ = ✓ ∂2KL ∂m∂mT
◆−1 ˆ Σ := d dtT Eq∗
t θ
qt mt = (I − V H)−1V
6
t θ
p(θ|x) q∗(θ)
[Bishop 2006]
ˆ Σ = ✓ ∂2KL ∂m∂mT
◆−1 ˆ Σ := d dtT Eq∗
t θ
qt mt = (I − V H)−1V
6
t θ
p(θ|x) q∗(θ)
exact mean (e.g. multivariate normal)
[Bishop 2006]
ˆ Σ = ✓ ∂2KL ∂m∂mT
◆−1 ˆ Σ := d dtT Eq∗
t θ
qt mt = (I − V H)−1V
6
Morocco, Philippines, Ethiopia)
! !
7
Morocco, Philippines, Ethiopia)
! !
7
Morocco, Philippines, Ethiopia)
! !
7
Morocco, Philippines, Ethiopia)
! !
7
Morocco, Philippines, Ethiopia)
! !
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit 1 if microcredit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit 1 if microcredit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit 1 if microcredit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit 1 if microcredit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
profit 1 if microcredit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
✓ µk τk ◆
iid
∼ N ✓✓ µ τ ◆ , C ◆ profit 1 if microcredit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
✓ µk τk ◆
iid
∼ N ✓✓ µ τ ◆ , C ◆ σ−2
k iid
∼ Γ(a, b) profit 1 if microcredit
7
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
✓ µk τk ◆
iid
∼ N ✓✓ µ τ ◆ , C ◆ ✓ µ τ ◆
iid
∼ N ✓✓ µ0 τ0 ◆ , Λ−1 ◆ σ−2
k iid
∼ Γ(a, b) profit 1 if microcredit
7
C ∼ Sep&LKJ(η, c, d)
8
MFVB
8
MFVB
8
MCMC draws: 45 minutes
uncertainties, all sensitivity measures: 58 seconds!
and data sets: Mixture models, generalized linear mixed models, etc
MFVB
8
MCMC draws: 45 minutes
uncertainties, all sensitivity measures: 58 seconds!
and data sets: Mixture models, generalized linear mixed models, etc
MFVB
8
MCMC draws: 45 minutes
uncertainties, all sensitivity measures: 58 seconds!
and data sets: Mixture models, generalized linear mixed models, etc
MFVB
LRVB,! MFVB
8
MCMC draws: 45 minutes
uncertainties, all sensitivity measures: 58 seconds!
and data sets: Mixture models, generalized linear mixed models, etc
MFVB
LRVB,! MFVB
8
9
9
p(θ|x) ∝θ p(x|θ)p(θ)
10
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
10
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
10
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α ≈ dEq∗
α[g(θ)]
dα
∆α =: ˆ S
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α ≈ dEq∗
α[g(θ)]
dα
∆α =: ˆ S LRVB estimator
10
Bayes Theorem
pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α ≈ dEq∗
α[g(θ)]
dα
∆α =: ˆ S LRVB estimator
q∗
α
10
Bayes Theorem
ˆ S = A ✓ ∂2KL ∂m∂mT
◆−1 B pα(θ) := p(θ|x, α) ∝θ p(x|θ)p(θ|α)
S := dEpα[g(θ)] dα
∆α ≈ dEq∗
α[g(θ)]
dα
∆α =: ˆ S LRVB estimator
q∗
α
10
C ∼ Sep&LKJ(η, c, d)
Morocco, Philippines, Ethiopia)
! !
ykn
indep
∼ N(µk + Tknτk, σ2
k)
✓ µk τk ◆
iid
∼ N ✓✓ µ τ ◆ , C ◆ ✓ µ τ ◆
iid
∼ N ✓✓ µ0 τ0 ◆ , Λ−1 ◆ σ−2
k iid
∼ Γ(a, b) profit 1 if microcredit
11
12
MFVB
12
0.03 ➔ 0.04
MFVB
12
0.03 ➔ 0.04 Sensitivity
MFVB LRVB
12
0.03 ➔ 0.04 Sensitivity
MFVB LRVB
12
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
13
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
StdDevqτ = 1.8
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
StdDevqτ = 1.8 Eqτ = 3.7
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
StdDevqτ = 1.8 Eqτ = 3.7 = 2.06 ∗ StdDevqτ
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
StdDevqτ = 1.8 Eqτ = 3.7 = 2.06 ∗ StdDevqτ
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
StdDevqτ = 1.8 Eqτ = 3.7 = 2.06 ∗ StdDevqτ
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
StdDevqτ = 1.8 Eqτ = 3.7 = 2.06 ∗ StdDevqτ Λ12 + = 0.03
13
the expected microcredit effect (τ)
be on scale of standard deviations in τ
the expected microcredit effect (τ)
be on scale of standard deviations in τ
StdDevqτ = 1.8 Eqτ = 3.7 = 2.06 ∗ StdDevqτ Λ12 + = 0.03
13
Eqτ < 1.0 ∗ StdDevqτ
supplements MFVB for fast & accurate covariance estimate
quantification
14
T Broderick, N Boyd, A Wibisono, AC Wilson, and MI Jordan. Streaming variational Bayes. NIPS, 2013.
!
R Giordano, T Broderick, and MI Jordan. Linear response methods for accurate covariance estimates from mean field variational Bayes. NIPS, 2015.!
!
R Giordano, T Broderick, R Meager, J Huggins, and MI
Social Good Applications, 2016. ArXiv:1606.07153.!
!
J Huggins, T Campbell, and T Broderick. Core sets for scalable Bayesian logistic regression. Under review. ArXiv:1605.06423.
15
T Broderick, N Boyd, A Wibisono, AC Wilson, and MI Jordan. Streaming variational Bayes. NIPS, 2013.
!
R Giordano, T Broderick, and MI Jordan. Linear response methods for accurate covariance estimates from mean field variational Bayes. NIPS, 2015.!
!
R Giordano, T Broderick, R Meager, J Huggins, and MI
Social Good Applications, 2016. ArXiv:1606.07153.!
!
J Huggins, T Campbell, and T Broderick. Core sets for scalable Bayesian logistic regression. Under review. ArXiv:1605.06423.
15
R Bardenet, A Doucet, and C Holmes. On Markov chain Monte Carlo methods for tall data. arXiv, 2015. CM Bishop. Pattern Recognition and Machine Learning, 2006. D Dunson. Robust and scalable approach to Bayesian inference. Talk at ISBA 2014. B Fosdick. Modeling Heterogeneity within and between Matrices and Arrays, Chapter 4.7. PhD Thesis, University of Washington, 2013. DJC MacKay. Information Theory, Inference, and Learning Algorithms. Cambridge University Press, 2003. R Meager. Understanding the impact of microcredit expansions: A Bayesian hierarchical analysis of 7 randomised experiments. ArXiv:1506.06669, 2015. RE Turner and M Sahani. Two problems with variational expectation maximisation for time- series models. In D Barber, AT Cemgil, and S Chiappa, editors, Bayesian Time Series Models, 2011. B Wang and M Titterington. Inadequacy of interval estimates corresponding to variational Bayesian approximations. In AISTATS, 2004.
16