Promoting Fairness in Learned Models by Learning to Active Learn under Parity Constraints
Amr Sharaf University of Maryland amr@cs.umd.edu Hal Daumé III University of Maryland Microsoft Research me@hal3.name
1
Promoting Fairness in Learned Models by Learning to Active Learn - - PowerPoint PPT Presentation
Promoting Fairness in Learned Models by Learning to Active Learn under Parity Constraints Amr Sharaf Hal Daum III University of Maryland University of Maryland Microsoft Research amr@cs.umd.edu me@hal3.name 1 Can we learn to active learn
Amr Sharaf University of Maryland amr@cs.umd.edu Hal Daumé III University of Maryland Microsoft Research me@hal3.name
1
2
Pre-existing data D = (U,)
U
3
Pre-existing data D = (U,)
Transformer Selection Policy π
Feed Forward Decoder
U
4
Pre-existing data D = (U,)
Transformer Selection Policy π
Distribution Q Over U, Y
Feed Forward Decoder
U
Gumbel(0) + Q=π(h0,D) x B sampled items
5
Pre-existing data D = (U,)
Transformer Selection Policy π
Train Classifier hB = A(DB)
Distribution Q Over U, Y
Feed Forward Decoder
U
Gumbel(0) + Q=π(h0,D) x B sampled items
6
Pre-existing data D = (U,)
Transformer Selection Policy π
Train Classifier hB = A(DB)
Evaluate Meta-Loss
𝔽Vℓ(hB) / Δv(hB) Distribution Q Over U, Y
Feed Forward Decoder
Gumbel(0) + Q=π(h0,D) x B sampled items
U
7
8
Transformer Selection Policy π
Train Classifier hB = A(DB)
Evaluate Meta-Loss
𝔽Vℓ(hB) / Δv(hB) Distribution Q Over U, Y
Feed Forward Decoder
U
Gumbel(0) + Q=π(h0,D) x B sampled items
9
Pre-existing data D = (U, Y)
Transformer Selection Policy π
Train Classifier hB = A(DB)
Evaluate Meta-Loss
𝔽Vℓ(hB) / Δv(hB) Distribution Q Over U, Y
Feed Forward Decoder
U Y
Gumbel(0) + Q=π(h0,D) x B sampled items
10
Pre-existing data D = (U, Y)
Transformer Selection Policy π
Train Classifier hB = A(DB)
Evaluate Meta-Loss
𝔽Vℓ(hB) / Δv(hB)
Compute Gradients w.r.t parameters of π update π to minimize performance/parity loss
Distribution Q Over U, Y
Feed Forward Decoder
U Y
Gumbel(0) + Q=π(h0,D) x B sampled items
11
Random Sampling Fairlearn PANDA Fair Active Learning Entropy Sampling Group Aware Random Sampling
12
F-Score vs Budget for different Active Learning Algorithms
F-score 0.3 0.4 0.5 0.6 0.7 Budget 100 200 300 400
Demographic Disparity vs Budget for different Active Learning Algorithms
Demographic Disparity 0.035 0.07 0.105 0.14 Budget 100 200 300 400
Random Sampling Fairlearn PANDA Fair Active Learning Entropy Sampling Group Aware Random Sampling
13
Random Sampling Fairlearn PANDA Fair Active Learning Entropy Sampling Group Aware
Demographic Disparity vs F-Score
F-Score
0.45 0.488 0.525 0.563 0.6
Demographic Disparity
0.025 0.044 0.062 0.081 0.1
Error Rate Balance vs F-Score
F-Score
0.45 0.488 0.525 0.563 0.6
Error Rate Balance
0.025 0.119 0.213 0.306 0.4
Random Sampling Fairlearn PANDA Fair Active Learning Entropy Sampling Group Aware
14
15
16