Approximate Cross-Validation and Dynamic Experiments for Policy Choice
Approximate Cross-Validation and Dynamic Experiments for Policy Choice
Maximilian Kasy
Department of Economics, Harvard University
April 23, 2018
1 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy - - PowerPoint PPT Presentation
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate Cross-Validation and Dynamic Experiments for Policy Choice Maximilian Kasy Department of Economics, Harvard University April 23, 2018 1 / 23 Approximate
Approximate Cross-Validation and Dynamic Experiments for Policy Choice
Department of Economics, Harvard University
1 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Introduction
◮ First order approximation to leave-one-out estimator. ◮ Relationship to Stein’s unbiased risk estimator. ◮ Accelerated tuning. ◮ Joint with Lester Mackey, MSR.
◮ Experimental design problem for choosing discrete treatment. ◮ Goal: maximize average outcome. ◮ Multiple waves. ◮ Joint with Anja Sautman, J-PAL.
2 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
◮ Covariance penalties, ◮ Stein’s Unbiased Risk Estimate (SURE), ◮ Cross-validation (CV).
◮ Consider repeated draws of some vector. ◮ Then CV for estimating mean is approximately equal to SURE. ◮ Without normality, unknown variance!
◮ Consider penalized M-estimation problem. ◮ Then CV for prediction loss is approximately equal to
◮ with a simple penalty based on gradient, Hessian.
3 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
k Eθ
k ∑ j
◮ Choose tuning parameters to minimize estimated MSE. ◮ Choose between estimators to minimize estimated MSE. ◮ Theoretical tool for proving dominance results.
4 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
k ∑j MSEj, where
5 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
k Eθ
k
◮ X is normally distributed. ◮ Σ is known. ◮
6 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
n ∑ i
1 n−1 ∑ i′=i
n ∑ i
7 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
n ∑i Y n i with (Y n i −θ)/√
n2 ∑i(Y n i − X n)(Y n i − X n)′. Then
◮ normality, ◮ known Σ!
8 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
s(Yi − X)
s Ui
s
s Ui)
n ∑ i
i .
s
+2
θ,(s+ 1
s
θ′)Ui
1
s2
θ′(X)·Ui2+2∆i,Yi− θ−i
n ∑ i
+0+op( 1 n ).
9 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
b
i
b
j=i
n ∑ i
10 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
j
n ∑ i
11 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
n ∑ i
i
i · H−1 · gi.
12 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Approximate cross-validation
13 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
◮ Optimal treatment assignment (multiple treatments) ◮ in multi-wave experiments. ◮ Goal: After experiment, choose a policy ◮ to maximize welfare (average outcome net of costs).
14 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
d
d=1
it .
it ,...,Y ¯ d it ) are i.i.d. across both i and t.
t ]
t = ∑ i
t = ∑ i
it = 1).
15 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
t ,...,n¯ d t ).
t ,...,s¯ d t ).
t ,...,m¯ d t ) = ∑ t′≤t
t ,...,s¯ d t ) = ∑ t′≤t
16 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
17 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
0 ,β d 0 ), independent across d.
t ,β d t )
t = αd 0 + r d t
t = β d 0 + md t − r d t .
0 + r d T
0 +β d 0 + md T
18 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
◮ States (mt,rt) ∈ {0,...,Mt−1}2¯
d,
◮ actions nt ∈ {0,...,Nt}¯
d,
◮ transitions
t = s|mt−1,rt−1,nd t ) =
t
t−1 + s,β d t−1 + nd t − s)
t−1,β d t−1)
19 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
d
d
0 + r d T
0 +β d 0 + md T
20 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
nt: ∑d nd
t ≤Nt
t (mt−1,rt−1) = argmax nt: ∑d nd
t ≤Nt
21 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
22 / 23
Approximate Cross-Validation and Dynamic Experiments for Policy Choice Dynamic experiments for policy choice
23 / 23