The Analysis of Placement Values for Evaluating Discriminatory Measures
Margaret Sullivan Pepe & Tianxi Cai
Biometrics (2004)
Allison Meisner · May 27, 2014
1
The Analysis of Placement Values for Evaluating Discriminatory - - PowerPoint PPT Presentation
The Analysis of Placement Values for Evaluating Discriminatory Measures Margaret Sullivan Pepe & Tianxi Cai Biometrics (2004) Allison Meisner May 27, 2014 1 Overview When we have a continuous test Y and a binary outcome D , the ROC
1
2
◮ PSA levels differ by age: older men typically have higher
◮ Age can potentially affect the ability of PSA to
◮ Among PCa cases, PSA measured closer to diagnosis does
3
4
5
6
7
8
9
10
11
12
13
14
15
◮ ROC model (Pepe, 1997): ROCZD(u) = g(βT ZD + Hα(u))
◮ α = underlying shape of ROC curve ◮ β = impact of ZD on shape of ROC curve
◮ Problem: estimation
◮ Pepe (2000) and Alonzo and Pepe (2002) create indicators
D (1 − u)) for some set of FPRs u and then use
◮ Pepe & Cai propose using placement values and what is
16
◮ Definitions
◮ Placement values: UDi = 1 − FD(YDi) for the ith diseased
◮ If ZD affects the distribution of Y in the reference
◮ ROC curve: ROC(u) = P(YD ≥ F −1
D (1 − u)) = (TPR at
◮ Relationship between ROC and placement values
D (1 − u)) = P(1 − u ≤ FD(YD))
17
18
◮ ROC model (Pepe, 1997): ROCZD(u) = g(βT ZD + Hα(u)) ◮ Proposed model: Hα(UD) = −βT ZD + ǫ, where ǫ ∼ g ◮ Proof of equivalence:
◮ In our example, ZD = age and ZD = (age, time).
19
nD
20
◮ Pepe and Cai advise estimating FD,ZD nonparametrically if
◮ For semiparametric estimation, Pepe and Cai recommend
21
22
◮ YD = α−1 1 {α0 + β1Z1 + (β2 + 0.5α1)Z2 + ǫD}
◮ Z1 ∼ Bernoulli(0.5), Z2 ∼ Uniform(0, 1) ◮ ǫD ∼ N(0, 1), ǫD ∼ N(0, 1)
D (1 − u) ≤ α−1 1 {α0 + β1z1 + (β2 + 0.5α1)z2 + ǫD)
1 {α0 + β1z1 + (β2 + 0.5α1)z2 + ǫD})
23
nD
24
◮ Bias ◮ Empirical SE ◮ Mean estimated SE ◮ Empirical coverage probability ◮ Note: α0 = 1, α1 = 1, β1 = 0.5, β2 = 0.7 throughout ◮ Considered [a, b] = [0.01, 0.99] and [a, b] = [0.01, 0.20]
◮ Bias ◮ MSE ◮ Two sets of parameter values considered ◮ α0 = 1, α1 = 1, β1 = 0.5, β2 = 0.7 ◮ α0 = 1.5, α1 = 0.9, β1 = 0.5, β2 = 0.7 ◮ Considered [a, b] = [0.01, 0.99] and [a, b] = [0.01, 0.50] 25
◮ [a, b] = [0.01, 0.99]
26
◮ α0 = 1, α1 = 1, β1 = 0.5, β2 = 0.7 ◮ [a, b] = [0.01, 0.99]
27
◮ 88 PCa cases, 88 age-matched controls ◮ Recall, ZD = age and ZD = (age, time) ◮ Model: ROCZD,ZD(u) = Φ(α0 + α1Φ−1(u) + β1time + β2age) ◮ SE estimates from the bootstrap (500 replications)
28
◮ The proposed method has nice intuition behind it and
◮ Implementation of the proposed method is less
◮ In most scenarios, the proposed method is more
◮ Both methods are susceptible to misspecification in both
29
2 + N(0, (Z2 + 0.5)2)
30
◮ α0 = 1, α1 = 1, β1 = 0.5, β2 = 0.7
31
◮ α0 = 1.5, α1 = 0.9, β1 = 0.5, β2 = 0.7
32
◮ The proposed method has nice intuition behind it and
◮ Implementation of the proposed method is less
◮ In most scenarios, the proposed method is more
◮ Both methods are susceptible to misspecification in both
33
◮ [a, b] = [0.01, 0.20]
34
◮ α0 = 1, α1 = 1, β1 = 0.5, β2 = 0.7 ◮ [a, b] = [0.01, 0.50]
35
◮ α0 = 1.5, α1 = 0.9, β1 = 0.5, β2 = 0.7 ◮ [a, b] = [0.01, 0.99]
36
◮ α0 = 1.5, α1 = 0.9, β1 = 0.5, β2 = 0.7 ◮ [a, b] = [0.01, 0.0.5]
37