- 1
Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning
Ahmed Salem, Apratim Bhattacharya, Michael Backes Mario Fritz,Yang Zhang
CISPA Helmholtz Center for Information Security, Max Planck Institute for Informatics
Updates-Leak: Data Set Inference and Reconstruction Attacks in - - PowerPoint PPT Presentation
Updates-Leak: Data Set Inference and Reconstruction Attacks in Online Learning Ahmed Salem , Apratim Bhattacharya, Michael Backes Mario Fritz,Yang Zhang CISPA Helmholtz Center for Information Security, Max Planck Institute for Informatics 1
Ahmed Salem, Apratim Bhattacharya, Michael Backes Mario Fritz,Yang Zhang
CISPA Helmholtz Center for Information Security, Max Planck Institute for Informatics
2
Model
Train U p d a t e
today has been created in the last two years alone
Training set Updating set
35 70 0 1 2 3 4 5 6 7 8 9 35 70 0 1 2 3 4 5 6 7 8 9
3
Target Model 25 50 0 1 2 3 4 5 6 7 8 9
Update
25 50 0 1 2 3 4 5 6 7 8 9
?
Research Question: Can this posterior difference be a new attack surface?
model’s dataset
4
Attack Model
Decoder
Target Model 25 50 0 1 2 3 4 5 6 7 8 9 35 70 0 1 2 3 4 5 6 7 8 9
Posterior difference
Encoder
Single-sample label Inference Single-sample reconstruction Multi-sample label distribution Multi-sample reconstruction
5
Probing set Probing set
Update
?
Target Model Shadow Model
updating set 1 updating set n . . .
Shadow Updated Model 1 Shadow Updated Model n
Posterior difference 1 Posterior difference n . . . . . . X Y
6
Update Update
Probing Set
7
Attack Model
Decoder Target Model 25 50 0 1 2 3 4 5 6 7 8 9 35 70 0 1 2 3 4 5 6 7 8 9
Posterior difference
Encoder Probing set Probing set
Update
?
Single-sample label Inference
It is a 0
better than baseline for MNIST and CIFAR-10
8
Attack Model
Decoder Target Model 25 50 0 1 2 3 4 5 6 7 8 9 35 70 0 1 2 3 4 5 6 7 8 9
Posterior difference
Encoder Probing set Probing set
Update
?
Single-sample reconstruction
label
generator
decoder
9
Encoder Decoder Encoder Decoder Autoencoder Transfer
10
11
0.000 0.005 0.010 0.015 0.020 0.025 0.030 0.035
Mean squared error (MSE)
Autoencoder (Oracle) ASSR Label-random Random 0.00 0.02 0.04 0.06 0.08 0.10
Mean squared error (MSE)
Autoencoder (Oracle) ASSR Label-random Random
CIFAR-10 MNIST
?
0 1 2 3 4 5 6 7 8 9
12
Attack Model
Decoder Target Model 25 50 0 1 2 3 4 5 6 7 8 9 35 70 0 1 2 3 4 5 6 7 8 9
Posterior difference
Encoder Probing set Probing set
Update
Multi-sample label distribution
KL-divergence as the loss
MNIST (10) CIFAR-10 (10) 0.00 0.02 0.04 0.06 0.08 0.10 0.12
KL-divergence
ALDE Baseline Transfer 100-10 MNIST (100) CIFAR-10 (100) 0.00 0.01 0.02 0.03 0.04 0.05
KL-divergence
ALDE Baseline Transfer 10-100
13
14
Attack Model
Decoder Target Model 25 50 0 1 2 3 4 5 6 7 8 9 35 70 0 1 2 3 4 5 6 7 8 9
Posterior difference
Encoder Probing set Probing set
Update
?
Multi-sample reconstruction
attack scenario
Image credit: Thalles Silva
15
Encoder Generator Standard Gaussian Noise Discriminator
Best match loss
16
17
MNIST CIFAR-10 0.00 0.01 0.02 0.03 0.04 0.05 0.06
Mean squared error (MSE)
One-to-one match AMSR Baseline
18
19
It is a 0
0 1 2 3 4 5 6 7 8 9
Multi-sample label distribution
20
Target Model
Attack Model
Decoder
Posterior difference
Encoder
Single-sample label Inference Single-sample reconstruction Multi-sample reconstruction
25 50 0 1 2 3 4 5 6 7 8 9 Probing set 35 70 0 1 2 3 4 5 6 7 8 9 Probing set
Update
?
Thank you for your attention! Questions?
ahmed.salem@cispa.saarland https://ahmedsalem2.github.io/ @AhmedGaSalem