White-box vs Black-box: Bayes Optimal Strategies for Membership - - PowerPoint PPT Presentation

white box vs black box bayes optimal strategies for
SMART_READER_LITE
LIVE PREVIEW

White-box vs Black-box: Bayes Optimal Strategies for Membership - - PowerPoint PPT Presentation

White-box vs Black-box: Bayes Optimal Strategies for Membership Inference Alexandre Sablayrolles, Matthijs Douze, Yann Ollivier, Cordelia Schmid, Herv Jgou Facebook AI Research, Paris June 11 th , 2019 Context: Membership Inference


slide-1
SLIDE 1

White-box vs Black-box: Bayes Optimal Strategies for Membership Inference

Alexandre Sablayrolles, Matthijs Douze, Yann Ollivier, Cordelia Schmid, Hervé Jégou

Facebook AI Research, Paris June 11th, 2019

slide-2
SLIDE 2

Context: Membership Inference

  • Machine learning

Training set Machine Learning Model

slide-3
SLIDE 3

Context: Membership Inference

  • Machine learning
  • Membership Inference

Training set Machine Learning Model

Model Membership Inference Candidate images Image in training set ?

slide-4
SLIDE 4

Membership Inference

  • Black-box
  • White-box

Black-box model Membership Inference Candidate images Image in training set ? White-box model Membership Inference Candidate images Image in training set ?

slide-5
SLIDE 5

Goals

  • Give a formal framework for membership attacks
  • What is the best possible attack (asymptotically) ?
  • Compare white-box vs black-box attacks
  • Derive new membership inference attacks
slide-6
SLIDE 6

Notations

zi mi mi = 0 mi = 1 λ Sample Membership variable Bernoulli( )

: training set : test set

slide-7
SLIDE 7

Notations and assumptions

  • Assumption: posterior distribution
  • Temperature T represents stochasticity
  • T=1: Bayes
  • T->0: Average SGD, MAP inference

P(✓ | m1:n, z1:n) ∝ exp − 1 T

n

X

i=1

mi`(✓, zi) !

loss membership

slide-8
SLIDE 8

Formal results: optimal attack

  • Membership posterior:
  • Result

M(θ, z1) := P(m1 = 1 | θ, z1)

sigmoid

log ✓ λ 1 − λ ◆ 1 T (⌧pT (z1) − `(✓, z1))

=

M(θ, z1) = ET 2 4σ @s(z1, θ, pT ) | {z } +tλ 1 A 3 5

slide-9
SLIDE 9

Formal results: optimal attack

  • Membership posterior:
  • Result

M(θ, z1) := P(m1 = 1 | θ, z1)

sigmoid

log ✓ λ 1 − λ ◆ 1 T (⌧pT (z1) − `(✓, z1))

=

M(θ, z1) = ET 2 4σ @s(z1, θ, pT ) | {z } +tλ 1 A 3 5 Only depends on through evaluation of the loss!

θ

slide-10
SLIDE 10

Approximation strategies

  • MALT: a global threshold for all samples
  • MAST: compute a threshold for each sample
  • MATT: simulate influence of sample using Taylor approximation

sMALT(✓, z1) = −`(✓, z1) + ⌧ sMAST(✓, z1) = −`(✓, z1) + ⌧(z1) sMATT(✓, z1) = (✓ ✓∗

0)T rθ`(✓∗ 0, z1)

slide-11
SLIDE 11

Experiments

Data Training set Held-out set Learn model Membership inference Hide in/out label

slide-12
SLIDE 12

Membership inference on CIFAR

=> MATT outperforms MALT

Attack accuracy n 0 − 1 MALT MATT 400 52.1 54.4 57.0 1000 51.4 52.6 54.5 2000 50.8 51.7 53.0 4000 51.0 51.4 52.1 6000 50.7 51.0 51.8

Naïve Bayes Threshold-based Taylor based

slide-13
SLIDE 13

Comparison with the state of the art

=> State-of-the-art performance => Less computationally expensive

Method Attack accuracy Na¨ ıve Bayes (Yeom et al. [2018]) 69.4 Shadow models (Shokri et al. [2017]) 73.9 Global threshold 77.1 Sample-dependent threshold 77.6

slide-14
SLIDE 14

Large-scale experiments on Imagenet

=> Data augmentation decreases membership attacks accuracy

Model Augmentation 0-1 MALT Resnet101 None 76.3 90.4 Flip, Crop ±5 69.5 77.4 Flip, Crop 65.4 68.0 VGG16 None 77.4 90.8 Flip, Crop ±5 71.3 79.5 Flip, Crop 63.8 64.3

slide-15
SLIDE 15

Conclusion

  • Black-box attacks as good as white-box attacks
  • Our approximations for membership attacks are state-of-the-art
  • n two datasets
slide-16
SLIDE 16

White-box vs Black-box: Bayes Optimal Strategies for Membership Inference

Poster 172

Alexandre Sablayrolles, Matthijs Douze, Yann Ollivier, Cordelia Schmid, Hervé Jégou

Facebook AI Research, Paris June 20th, 2018