Approximate reconstruction of encrypted databases Paul Grubbs, - - PowerPoint PPT Presentation

approximate reconstruction of encrypted databases
SMART_READER_LITE
LIVE PREVIEW

Approximate reconstruction of encrypted databases Paul Grubbs, - - PowerPoint PPT Presentation

Approximate reconstruction of encrypted databases Paul Grubbs, Marie-Sarah Lacharit, Brice Minaud, Kenny Paterson Information Security Group ESSA2, Bertinoro, 9th July 2018 Situation overview General message from previous talk: Dont use


slide-1
SLIDE 1

ESSA2, Bertinoro, 9th July 2018

Information Security Group

Paul Grubbs, Marie-Sarah Lacharité, Brice Minaud, Kenny Paterson

Approximate reconstruction

  • f encrypted databases
slide-2
SLIDE 2

Situation overview

2

General message from previous talk: Don’t use range queries with access pattern leakage! Closer look:

  • KKNO16: full reconstruction…
  • Assuming i.i.d. uniform queries.
  • O(N4 log N) queries.
  • Kenny’s talk: full reconstruction…
  • Assuming density.
  • O(N log N) queries.
slide-3
SLIDE 3

Approximate reconstruction

3

New goal: δ-approximate reconstruction. Recover the values of records within δN. Two new tools:

  • VC theory (machine learning).
  • PQ-trees.

➞ We would like to get best possible reconstruction with given queries. And handle large N’s. And get rid of the density assumption, and i.i.d. queries. LMP18 approximate attack but: only improvement in log factor, complicated analysis, requires density…

slide-4
SLIDE 4

Plan

4

  • 1. VC theory.
  • 2. PQ trees.
slide-5
SLIDE 5

VC theory

slide-6
SLIDE 6

Warm-up

6

Set X with probability distribution D. Let C ⊆ X. Call it a concept. X C Sample complexity: to measure Pr(C) within δ, you need O(1/δ2) samples.

Pr(C) ≈ #points in C #points total

<latexit sha1_base64="DjnHOxRz4I3ci4soPVaWmu0s1+E=">AC1XicbVFNixNBEO2MX2v82KwevTRmF1bQMCOCHheD4DGC2SxkQqj01CTN9sfQXRM3DnMTr/4Of41X9/Yk01gN2tBw+NVe9V7NCSU9xfNmK7ty9d/B3sP2o8dPnu53Dp6dels6gUNhlXVnM/CopMEhSVJ4VjgEPVM4mp3m/xoic5La7SqsCJhrmRuRAgZp2+unAHfdf8RSKwtkLnuYORJV2U8ILqgorDXkuDT/sH9b1Lk+WQNX1tNONe/E6+G2QbECXbWIwPWh9TzMrSo2GhALvx0lc0KQCR1IorNtp6bEAcQ5zHAdoQKOfVGu1NT8KTMZz68IzxNfs9Y4KtPcrPQuVGmjhd3MN+b/cuKT8w6SpigJjbgalJcqyOSNdTyTDgWpVQAgnAy7crGA4BcFg9vto+tzFqiWSEGJQ4PfhNUaTFalOWipVhnmUCoKfvp8i29s6cFs1+SfDIRjBrNLZ6SZc2t4k37j0cmcN1Vc2wxf8zCAl35dQgt0HM1SOmsak4PO7ZftcK1k9za3wenbXhL3ki/vuicfN3fbYy/YS3bMEvaenbDPbMCGTLDf7A/7y6jUVRHP6KfV6VRa9PznN2I6Nc/0+vmnQ=</latexit><latexit sha1_base64="DjnHOxRz4I3ci4soPVaWmu0s1+E=">AC1XicbVFNixNBEO2MX2v82KwevTRmF1bQMCOCHheD4DGC2SxkQqj01CTN9sfQXRM3DnMTr/4Of41X9/Yk01gN2tBw+NVe9V7NCSU9xfNmK7ty9d/B3sP2o8dPnu53Dp6dels6gUNhlXVnM/CopMEhSVJ4VjgEPVM4mp3m/xoic5La7SqsCJhrmRuRAgZp2+unAHfdf8RSKwtkLnuYORJV2U8ILqgorDXkuDT/sH9b1Lk+WQNX1tNONe/E6+G2QbECXbWIwPWh9TzMrSo2GhALvx0lc0KQCR1IorNtp6bEAcQ5zHAdoQKOfVGu1NT8KTMZz68IzxNfs9Y4KtPcrPQuVGmjhd3MN+b/cuKT8w6SpigJjbgalJcqyOSNdTyTDgWpVQAgnAy7crGA4BcFg9vto+tzFqiWSEGJQ4PfhNUaTFalOWipVhnmUCoKfvp8i29s6cFs1+SfDIRjBrNLZ6SZc2t4k37j0cmcN1Vc2wxf8zCAl35dQgt0HM1SOmsak4PO7ZftcK1k9za3wenbXhL3ki/vuicfN3fbYy/YS3bMEvaenbDPbMCGTLDf7A/7y6jUVRHP6KfV6VRa9PznN2I6Nc/0+vmnQ=</latexit><latexit sha1_base64="DjnHOxRz4I3ci4soPVaWmu0s1+E=">AC1XicbVFNixNBEO2MX2v82KwevTRmF1bQMCOCHheD4DGC2SxkQqj01CTN9sfQXRM3DnMTr/4Of41X9/Yk01gN2tBw+NVe9V7NCSU9xfNmK7ty9d/B3sP2o8dPnu53Dp6dels6gUNhlXVnM/CopMEhSVJ4VjgEPVM4mp3m/xoic5La7SqsCJhrmRuRAgZp2+unAHfdf8RSKwtkLnuYORJV2U8ILqgorDXkuDT/sH9b1Lk+WQNX1tNONe/E6+G2QbECXbWIwPWh9TzMrSo2GhALvx0lc0KQCR1IorNtp6bEAcQ5zHAdoQKOfVGu1NT8KTMZz68IzxNfs9Y4KtPcrPQuVGmjhd3MN+b/cuKT8w6SpigJjbgalJcqyOSNdTyTDgWpVQAgnAy7crGA4BcFg9vto+tzFqiWSEGJQ4PfhNUaTFalOWipVhnmUCoKfvp8i29s6cFs1+SfDIRjBrNLZ6SZc2t4k37j0cmcN1Vc2wxf8zCAl35dQgt0HM1SOmsak4PO7ZftcK1k9za3wenbXhL3ki/vuicfN3fbYy/YS3bMEvaenbDPbMCGTLDf7A/7y6jUVRHP6KfV6VRa9PznN2I6Nc/0+vmnQ=</latexit><latexit sha1_base64="DjnHOxRz4I3ci4soPVaWmu0s1+E=">AC1XicbVFNixNBEO2MX2v82KwevTRmF1bQMCOCHheD4DGC2SxkQqj01CTN9sfQXRM3DnMTr/4Of41X9/Yk01gN2tBw+NVe9V7NCSU9xfNmK7ty9d/B3sP2o8dPnu53Dp6dels6gUNhlXVnM/CopMEhSVJ4VjgEPVM4mp3m/xoic5La7SqsCJhrmRuRAgZp2+unAHfdf8RSKwtkLnuYORJV2U8ILqgorDXkuDT/sH9b1Lk+WQNX1tNONe/E6+G2QbECXbWIwPWh9TzMrSo2GhALvx0lc0KQCR1IorNtp6bEAcQ5zHAdoQKOfVGu1NT8KTMZz68IzxNfs9Y4KtPcrPQuVGmjhd3MN+b/cuKT8w6SpigJjbgalJcqyOSNdTyTDgWpVQAgnAy7crGA4BcFg9vto+tzFqiWSEGJQ4PfhNUaTFalOWipVhnmUCoKfvp8i29s6cFs1+SfDIRjBrNLZ6SZc2t4k37j0cmcN1Vc2wxf8zCAl35dQgt0HM1SOmsak4PO7ZftcK1k9za3wenbXhL3ki/vuicfN3fbYy/YS3bMEvaenbDPbMCGTLDf7A/7y6jUVRHP6KfV6VRa9PznN2I6Nc/0+vmnQ=</latexit>
slide-7
SLIDE 7

VC theory

7

Vapnik and Chervonenkis, 1971. X Now you have a set 𝓓 of concepts. The set of samples drawn from X is an ε-sample iff for all C in 𝓓:

  • Pr(C) − #points in C

#points total

  • ≤ ✏
<latexit sha1_base64="UfpOiKm2RL8/P6WTnBh3SDiIqYU=">AC63icbVFLbhNBEG0Pv2A+SWDJpoQTKUjEmkFIsIywkFgaCSdBHitq9TYrfRn6K5xYoY5BTvElnNwDg7AFq5Aj2NLiUNJLT29q7Pe+NCSU9x/KsV3bh56/adjbvte/cfPNzc2n506G3pBA6EVdYdj7lHJQ0OSJLC48Ih12OFR+PTXpM/mqHz0poPNC9wpPnEyFwKToE62fqYKszpC6R9t9d7BvuQ5o6LKu2khOdUFVYa8iAN7PR26nqdJ0tc1Xq5GTaNFH4CVIsvFRN707cjRcB10GyB2jP7JdutzmlRajQkFPd+mMQFjSruSAqFdTstPRZcnPIJDgM0XKMfVQsNatgNTAa5deEZgV7+UfFtfdzPQ6VmtPUr+ca8n+5YUn561ElTVESGnExKC9VuB0aQSGTDgWpeQBcOBl2BTHlQUQKsrfbu5fnTFHNkMIlDg2eCas1N1mV5lxLNc8w56WiILPV/jKlp6b1Zrw1vBgcXCgdEaCVgDTXrfo5M5NFWgbYbPIQyA0i9KaIoO0Myks6YROdy5atkObiXr3lwHhy+6SdxN3r/sHLxZ+rbBnrCnbI8l7BU7YO9Ynw2YD/Zb/aH/Y109DX6Fn2/KI1ayz+P2ZWIfvwD74bvfQ=</latexit><latexit sha1_base64="UfpOiKm2RL8/P6WTnBh3SDiIqYU=">AC63icbVFLbhNBEG0Pv2A+SWDJpoQTKUjEmkFIsIywkFgaCSdBHitq9TYrfRn6K5xYoY5BTvElnNwDg7AFq5Aj2NLiUNJLT29q7Pe+NCSU9x/KsV3bh56/adjbvte/cfPNzc2n506G3pBA6EVdYdj7lHJQ0OSJLC48Ih12OFR+PTXpM/mqHz0poPNC9wpPnEyFwKToE62fqYKszpC6R9t9d7BvuQ5o6LKu2khOdUFVYa8iAN7PR26nqdJ0tc1Xq5GTaNFH4CVIsvFRN707cjRcB10GyB2jP7JdutzmlRajQkFPd+mMQFjSruSAqFdTstPRZcnPIJDgM0XKMfVQsNatgNTAa5deEZgV7+UfFtfdzPQ6VmtPUr+ca8n+5YUn561ElTVESGnExKC9VuB0aQSGTDgWpeQBcOBl2BTHlQUQKsrfbu5fnTFHNkMIlDg2eCas1N1mV5lxLNc8w56WiILPV/jKlp6b1Zrw1vBgcXCgdEaCVgDTXrfo5M5NFWgbYbPIQyA0i9KaIoO0Myks6YROdy5atkObiXr3lwHhy+6SdxN3r/sHLxZ+rbBnrCnbI8l7BU7YO9Ynw2YD/Zb/aH/Y109DX6Fn2/KI1ayz+P2ZWIfvwD74bvfQ=</latexit><latexit sha1_base64="UfpOiKm2RL8/P6WTnBh3SDiIqYU=">AC63icbVFLbhNBEG0Pv2A+SWDJpoQTKUjEmkFIsIywkFgaCSdBHitq9TYrfRn6K5xYoY5BTvElnNwDg7AFq5Aj2NLiUNJLT29q7Pe+NCSU9x/KsV3bh56/adjbvte/cfPNzc2n506G3pBA6EVdYdj7lHJQ0OSJLC48Ih12OFR+PTXpM/mqHz0poPNC9wpPnEyFwKToE62fqYKszpC6R9t9d7BvuQ5o6LKu2khOdUFVYa8iAN7PR26nqdJ0tc1Xq5GTaNFH4CVIsvFRN707cjRcB10GyB2jP7JdutzmlRajQkFPd+mMQFjSruSAqFdTstPRZcnPIJDgM0XKMfVQsNatgNTAa5deEZgV7+UfFtfdzPQ6VmtPUr+ca8n+5YUn561ElTVESGnExKC9VuB0aQSGTDgWpeQBcOBl2BTHlQUQKsrfbu5fnTFHNkMIlDg2eCas1N1mV5lxLNc8w56WiILPV/jKlp6b1Zrw1vBgcXCgdEaCVgDTXrfo5M5NFWgbYbPIQyA0i9KaIoO0Myks6YROdy5atkObiXr3lwHhy+6SdxN3r/sHLxZ+rbBnrCnbI8l7BU7YO9Ynw2YD/Zb/aH/Y109DX6Fn2/KI1ayz+P2ZWIfvwD74bvfQ=</latexit><latexit sha1_base64="UfpOiKm2RL8/P6WTnBh3SDiIqYU=">AC63icbVFLbhNBEG0Pv2A+SWDJpoQTKUjEmkFIsIywkFgaCSdBHitq9TYrfRn6K5xYoY5BTvElnNwDg7AFq5Aj2NLiUNJLT29q7Pe+NCSU9x/KsV3bh56/adjbvte/cfPNzc2n506G3pBA6EVdYdj7lHJQ0OSJLC48Ih12OFR+PTXpM/mqHz0poPNC9wpPnEyFwKToE62fqYKszpC6R9t9d7BvuQ5o6LKu2khOdUFVYa8iAN7PR26nqdJ0tc1Xq5GTaNFH4CVIsvFRN707cjRcB10GyB2jP7JdutzmlRajQkFPd+mMQFjSruSAqFdTstPRZcnPIJDgM0XKMfVQsNatgNTAa5deEZgV7+UfFtfdzPQ6VmtPUr+ca8n+5YUn561ElTVESGnExKC9VuB0aQSGTDgWpeQBcOBl2BTHlQUQKsrfbu5fnTFHNkMIlDg2eCas1N1mV5lxLNc8w56WiILPV/jKlp6b1Zrw1vBgcXCgdEaCVgDTXrfo5M5NFWgbYbPIQyA0i9KaIoO0Myks6YROdy5atkObiXr3lwHhy+6SdxN3r/sHLxZ+rbBnrCnbI8l7BU7YO9Ynw2YD/Zb/aH/Y109DX6Fn2/KI1ayz+P2ZWIfvwD74bvfQ=</latexit>

V & C 1971: If 𝓓 has VC dimension d, then the number of points to get an ε-sample whp is O(d/ε2 log d/ε).

slide-8
SLIDE 8

VC dimension

8

A set S of points in X is shattered by 𝓓 iff every subset of S can be written in the form C∩S for some C in 𝓓. 1 N X 𝓓 = ranges X shattered X not shattered The VC dimension of 𝓓 is the largest cardinality d such that every subset of X of size d is shattered. e.g. for ranges the VC dimension is 2.

slide-9
SLIDE 9

Two main results: ε-samples and ε-nets

9

The set of samples drawn from X is an ε-sample iff for all C in 𝓓:

  • Pr(C) − #points in C

#points total

  • ≤ ✏
<latexit sha1_base64="UfpOiKm2RL8/P6WTnBh3SDiIqYU=">AC63icbVFLbhNBEG0Pv2A+SWDJpoQTKUjEmkFIsIywkFgaCSdBHitq9TYrfRn6K5xYoY5BTvElnNwDg7AFq5Aj2NLiUNJLT29q7Pe+NCSU9x/KsV3bh56/adjbvte/cfPNzc2n506G3pBA6EVdYdj7lHJQ0OSJLC48Ih12OFR+PTXpM/mqHz0poPNC9wpPnEyFwKToE62fqYKszpC6R9t9d7BvuQ5o6LKu2khOdUFVYa8iAN7PR26nqdJ0tc1Xq5GTaNFH4CVIsvFRN707cjRcB10GyB2jP7JdutzmlRajQkFPd+mMQFjSruSAqFdTstPRZcnPIJDgM0XKMfVQsNatgNTAa5deEZgV7+UfFtfdzPQ6VmtPUr+ca8n+5YUn561ElTVESGnExKC9VuB0aQSGTDgWpeQBcOBl2BTHlQUQKsrfbu5fnTFHNkMIlDg2eCas1N1mV5lxLNc8w56WiILPV/jKlp6b1Zrw1vBgcXCgdEaCVgDTXrfo5M5NFWgbYbPIQyA0i9KaIoO0Myks6YROdy5atkObiXr3lwHhy+6SdxN3r/sHLxZ+rbBnrCnbI8l7BU7YO9Ynw2YD/Zb/aH/Y109DX6Fn2/KI1ayz+P2ZWIfvwD74bvfQ=</latexit><latexit sha1_base64="UfpOiKm2RL8/P6WTnBh3SDiIqYU=">AC63icbVFLbhNBEG0Pv2A+SWDJpoQTKUjEmkFIsIywkFgaCSdBHitq9TYrfRn6K5xYoY5BTvElnNwDg7AFq5Aj2NLiUNJLT29q7Pe+NCSU9x/KsV3bh56/adjbvte/cfPNzc2n506G3pBA6EVdYdj7lHJQ0OSJLC48Ih12OFR+PTXpM/mqHz0poPNC9wpPnEyFwKToE62fqYKszpC6R9t9d7BvuQ5o6LKu2khOdUFVYa8iAN7PR26nqdJ0tc1Xq5GTaNFH4CVIsvFRN707cjRcB10GyB2jP7JdutzmlRajQkFPd+mMQFjSruSAqFdTstPRZcnPIJDgM0XKMfVQsNatgNTAa5deEZgV7+UfFtfdzPQ6VmtPUr+ca8n+5YUn561ElTVESGnExKC9VuB0aQSGTDgWpeQBcOBl2BTHlQUQKsrfbu5fnTFHNkMIlDg2eCas1N1mV5lxLNc8w56WiILPV/jKlp6b1Zrw1vBgcXCgdEaCVgDTXrfo5M5NFWgbYbPIQyA0i9KaIoO0Myks6YROdy5atkObiXr3lwHhy+6SdxN3r/sHLxZ+rbBnrCnbI8l7BU7YO9Ynw2YD/Zb/aH/Y109DX6Fn2/KI1ayz+P2ZWIfvwD74bvfQ=</latexit><latexit sha1_base64="UfpOiKm2RL8/P6WTnBh3SDiIqYU=">AC63icbVFLbhNBEG0Pv2A+SWDJpoQTKUjEmkFIsIywkFgaCSdBHitq9TYrfRn6K5xYoY5BTvElnNwDg7AFq5Aj2NLiUNJLT29q7Pe+NCSU9x/KsV3bh56/adjbvte/cfPNzc2n506G3pBA6EVdYdj7lHJQ0OSJLC48Ih12OFR+PTXpM/mqHz0poPNC9wpPnEyFwKToE62fqYKszpC6R9t9d7BvuQ5o6LKu2khOdUFVYa8iAN7PR26nqdJ0tc1Xq5GTaNFH4CVIsvFRN707cjRcB10GyB2jP7JdutzmlRajQkFPd+mMQFjSruSAqFdTstPRZcnPIJDgM0XKMfVQsNatgNTAa5deEZgV7+UfFtfdzPQ6VmtPUr+ca8n+5YUn561ElTVESGnExKC9VuB0aQSGTDgWpeQBcOBl2BTHlQUQKsrfbu5fnTFHNkMIlDg2eCas1N1mV5lxLNc8w56WiILPV/jKlp6b1Zrw1vBgcXCgdEaCVgDTXrfo5M5NFWgbYbPIQyA0i9KaIoO0Myks6YROdy5atkObiXr3lwHhy+6SdxN3r/sHLxZ+rbBnrCnbI8l7BU7YO9Ynw2YD/Zb/aH/Y109DX6Fn2/KI1ayz+P2ZWIfvwD74bvfQ=</latexit><latexit sha1_base64="UfpOiKm2RL8/P6WTnBh3SDiIqYU=">AC63icbVFLbhNBEG0Pv2A+SWDJpoQTKUjEmkFIsIywkFgaCSdBHitq9TYrfRn6K5xYoY5BTvElnNwDg7AFq5Aj2NLiUNJLT29q7Pe+NCSU9x/KsV3bh56/adjbvte/cfPNzc2n506G3pBA6EVdYdj7lHJQ0OSJLC48Ih12OFR+PTXpM/mqHz0poPNC9wpPnEyFwKToE62fqYKszpC6R9t9d7BvuQ5o6LKu2khOdUFVYa8iAN7PR26nqdJ0tc1Xq5GTaNFH4CVIsvFRN707cjRcB10GyB2jP7JdutzmlRajQkFPd+mMQFjSruSAqFdTstPRZcnPIJDgM0XKMfVQsNatgNTAa5deEZgV7+UfFtfdzPQ6VmtPUr+ca8n+5YUn561ElTVESGnExKC9VuB0aQSGTDgWpeQBcOBl2BTHlQUQKsrfbu5fnTFHNkMIlDg2eCas1N1mV5lxLNc8w56WiILPV/jKlp6b1Zrw1vBgcXCgdEaCVgDTXrfo5M5NFWgbYbPIQyA0i9KaIoO0Myks6YROdy5atkObiXr3lwHhy+6SdxN3r/sHLxZ+rbBnrCnbI8l7BU7YO9Ynw2YD/Zb/aH/Y109DX6Fn2/KI1ayz+P2ZWIfvwD74bvfQ=</latexit>

➞ If d is the VC dim, number of points to get an ε-sample whp is:

O ⇣ d ✏2 log d ✏ ⌘

<latexit sha1_base64="4GXem9WHxiF+N+iWKvOLpbkqg=">ACynicbVFNaxRBEO0dv+L4tdGjl8YlEGXmSCY4gIHgQjuElgZ16empm/TH0F29Mg5z83f4a7zq3X9jz2YXkqwFDY/3qrqXuW1FA6T5O8gunX7zt17O/fjBw8fPX4y3H16oy3HCbcSGPc+ZACg0TFCjhvLbAVC7hL941+tnS7BOGP0FmxpmilValIzDNR8ePgpOxbVPs1Ky3hbdG0GtRPS6K8Hc2kqbaVrq94OR+OknGyCroN0jUYkXWczHcH37PCcK9AI5fMuWma1DhrmUXBJXRx5h3UjF+wCqYBaqbAzdrVih3dC0xBS2PD0hX7NWKlinGpWHTMVw4W5qPfk/beqxPJy1QtceQfPLRqWXFA3t/aKFsMBRNgEwbkWYlfIFC5ZgcDWO9672WYBcAoZNLGj4xo1STBdtVjIlZFNAybzE4KMrN/jalI7pzZj0vWbhgo6it1roihpNe/m1AytK2mdRZQp4RUMD6t0qBRdgKeilsEb3Joc9N1/G4Vrpzdtsg9ODcZqM089vRkfH67vtkOfkBdknKXlLjsgHckImhJOf5Bf5Tf5EHyMbNVF7mRoN1jXPyLWIfvwDBJniXQ=</latexit><latexit sha1_base64="4GXem9WHxiF+N+iWKvOLpbkqg=">ACynicbVFNaxRBEO0dv+L4tdGjl8YlEGXmSCY4gIHgQjuElgZ16empm/TH0F29Mg5z83f4a7zq3X9jz2YXkqwFDY/3qrqXuW1FA6T5O8gunX7zt17O/fjBw8fPX4y3H16oy3HCbcSGPc+ZACg0TFCjhvLbAVC7hL941+tnS7BOGP0FmxpmilValIzDNR8ePgpOxbVPs1Ky3hbdG0GtRPS6K8Hc2kqbaVrq94OR+OknGyCroN0jUYkXWczHcH37PCcK9AI5fMuWma1DhrmUXBJXRx5h3UjF+wCqYBaqbAzdrVih3dC0xBS2PD0hX7NWKlinGpWHTMVw4W5qPfk/beqxPJy1QtceQfPLRqWXFA3t/aKFsMBRNgEwbkWYlfIFC5ZgcDWO9672WYBcAoZNLGj4xo1STBdtVjIlZFNAybzE4KMrN/jalI7pzZj0vWbhgo6it1roihpNe/m1AytK2mdRZQp4RUMD6t0qBRdgKeilsEb3Joc9N1/G4Vrpzdtsg9ODcZqM089vRkfH67vtkOfkBdknKXlLjsgHckImhJOf5Bf5Tf5EHyMbNVF7mRoN1jXPyLWIfvwDBJniXQ=</latexit><latexit sha1_base64="4GXem9WHxiF+N+iWKvOLpbkqg=">ACynicbVFNaxRBEO0dv+L4tdGjl8YlEGXmSCY4gIHgQjuElgZ16empm/TH0F29Mg5z83f4a7zq3X9jz2YXkqwFDY/3qrqXuW1FA6T5O8gunX7zt17O/fjBw8fPX4y3H16oy3HCbcSGPc+ZACg0TFCjhvLbAVC7hL941+tnS7BOGP0FmxpmilValIzDNR8ePgpOxbVPs1Ky3hbdG0GtRPS6K8Hc2kqbaVrq94OR+OknGyCroN0jUYkXWczHcH37PCcK9AI5fMuWma1DhrmUXBJXRx5h3UjF+wCqYBaqbAzdrVih3dC0xBS2PD0hX7NWKlinGpWHTMVw4W5qPfk/beqxPJy1QtceQfPLRqWXFA3t/aKFsMBRNgEwbkWYlfIFC5ZgcDWO9672WYBcAoZNLGj4xo1STBdtVjIlZFNAybzE4KMrN/jalI7pzZj0vWbhgo6it1roihpNe/m1AytK2mdRZQp4RUMD6t0qBRdgKeilsEb3Joc9N1/G4Vrpzdtsg9ODcZqM089vRkfH67vtkOfkBdknKXlLjsgHckImhJOf5Bf5Tf5EHyMbNVF7mRoN1jXPyLWIfvwDBJniXQ=</latexit><latexit sha1_base64="4GXem9WHxiF+N+iWKvOLpbkqg=">ACynicbVFNaxRBEO0dv+L4tdGjl8YlEGXmSCY4gIHgQjuElgZ16empm/TH0F29Mg5z83f4a7zq3X9jz2YXkqwFDY/3qrqXuW1FA6T5O8gunX7zt17O/fjBw8fPX4y3H16oy3HCbcSGPc+ZACg0TFCjhvLbAVC7hL941+tnS7BOGP0FmxpmilValIzDNR8ePgpOxbVPs1Ky3hbdG0GtRPS6K8Hc2kqbaVrq94OR+OknGyCroN0jUYkXWczHcH37PCcK9AI5fMuWma1DhrmUXBJXRx5h3UjF+wCqYBaqbAzdrVih3dC0xBS2PD0hX7NWKlinGpWHTMVw4W5qPfk/beqxPJy1QtceQfPLRqWXFA3t/aKFsMBRNgEwbkWYlfIFC5ZgcDWO9672WYBcAoZNLGj4xo1STBdtVjIlZFNAybzE4KMrN/jalI7pzZj0vWbhgo6it1roihpNe/m1AytK2mdRZQp4RUMD6t0qBRdgKeilsEb3Joc9N1/G4Vrpzdtsg9ODcZqM089vRkfH67vtkOfkBdknKXlLjsgHckImhJOf5Bf5Tf5EHyMbNVF7mRoN1jXPyLWIfvwDBJniXQ=</latexit>

➞ If d is the VC dim, number of points to get an ε-net whp is:

O ⇣d ✏ log d ✏ ⌘

<latexit sha1_base64="bG12hBMKQYLjA5/zK1nlVMJu+Ls=">ACyHicbVHbahRBEO0dL4njbaOPvjQugQi6zIgYH0NEF+M4CaBnWp7amZbdKXobtmwzjsi9/h1/iqP+Df2LPZhdwKGg6nTnVnZpVSnpKkn+96M7de/e3th/EDx89fvK0v/Ps2NvaCRwJq6w7nYFHJQ2OSJLC08oh6JnCk9nZxy5/skDnpTXfqalwoqE0spACKFDT/vuv2aEs93hWOBtvmwzrLxU1ix5pmx5C9/pX037g2SYrILfBOkaDNg6jqY7vR9ZbkWt0ZBQ4P04TSqatOBICoXLOKs9ViDOoMRxgAY0+km7WnDJdwOT8K68AzxFXu5ogXtfaNnQamB5v56riNvy41rKj5MWmqmtCIi0ZFrThZ3rnFc+lQkGoCAOFkmJWLOQRLKHgax7uX+8xRLZDCJg4NngurNZi8zQrQUjU5FlArCj76YoOvTOnBbMbknwyE+3lOtTPSlNwa3qXfeHSy4J2Ka5vjax4a8NqvJDRHx9EspLOmMznsufkyDtdKr9/mJjh+O0yTYfrt3eDgcH23bfaCvWR7LGX7IB9ZkdsxAT7xX6zP+xv9CWqovOouZBGvXNc3Ylop/AVtr4bk=</latexit><latexit sha1_base64="bG12hBMKQYLjA5/zK1nlVMJu+Ls=">ACyHicbVHbahRBEO0dL4njbaOPvjQugQi6zIgYH0NEF+M4CaBnWp7amZbdKXobtmwzjsi9/h1/iqP+Df2LPZhdwKGg6nTnVnZpVSnpKkn+96M7de/e3th/EDx89fvK0v/Ps2NvaCRwJq6w7nYFHJQ2OSJLC08oh6JnCk9nZxy5/skDnpTXfqalwoqE0spACKFDT/vuv2aEs93hWOBtvmwzrLxU1ix5pmx5C9/pX037g2SYrILfBOkaDNg6jqY7vR9ZbkWt0ZBQ4P04TSqatOBICoXLOKs9ViDOoMRxgAY0+km7WnDJdwOT8K68AzxFXu5ogXtfaNnQamB5v56riNvy41rKj5MWmqmtCIi0ZFrThZ3rnFc+lQkGoCAOFkmJWLOQRLKHgax7uX+8xRLZDCJg4NngurNZi8zQrQUjU5FlArCj76YoOvTOnBbMbknwyE+3lOtTPSlNwa3qXfeHSy4J2Ka5vjax4a8NqvJDRHx9EspLOmMznsufkyDtdKr9/mJjh+O0yTYfrt3eDgcH23bfaCvWR7LGX7IB9ZkdsxAT7xX6zP+xv9CWqovOouZBGvXNc3Ylop/AVtr4bk=</latexit><latexit sha1_base64="bG12hBMKQYLjA5/zK1nlVMJu+Ls=">ACyHicbVHbahRBEO0dL4njbaOPvjQugQi6zIgYH0NEF+M4CaBnWp7amZbdKXobtmwzjsi9/h1/iqP+Df2LPZhdwKGg6nTnVnZpVSnpKkn+96M7de/e3th/EDx89fvK0v/Ps2NvaCRwJq6w7nYFHJQ2OSJLC08oh6JnCk9nZxy5/skDnpTXfqalwoqE0spACKFDT/vuv2aEs93hWOBtvmwzrLxU1ix5pmx5C9/pX037g2SYrILfBOkaDNg6jqY7vR9ZbkWt0ZBQ4P04TSqatOBICoXLOKs9ViDOoMRxgAY0+km7WnDJdwOT8K68AzxFXu5ogXtfaNnQamB5v56riNvy41rKj5MWmqmtCIi0ZFrThZ3rnFc+lQkGoCAOFkmJWLOQRLKHgax7uX+8xRLZDCJg4NngurNZi8zQrQUjU5FlArCj76YoOvTOnBbMbknwyE+3lOtTPSlNwa3qXfeHSy4J2Ka5vjax4a8NqvJDRHx9EspLOmMznsufkyDtdKr9/mJjh+O0yTYfrt3eDgcH23bfaCvWR7LGX7IB9ZkdsxAT7xX6zP+xv9CWqovOouZBGvXNc3Ylop/AVtr4bk=</latexit><latexit sha1_base64="bG12hBMKQYLjA5/zK1nlVMJu+Ls=">ACyHicbVHbahRBEO0dL4njbaOPvjQugQi6zIgYH0NEF+M4CaBnWp7amZbdKXobtmwzjsi9/h1/iqP+Df2LPZhdwKGg6nTnVnZpVSnpKkn+96M7de/e3th/EDx89fvK0v/Ps2NvaCRwJq6w7nYFHJQ2OSJLC08oh6JnCk9nZxy5/skDnpTXfqalwoqE0spACKFDT/vuv2aEs93hWOBtvmwzrLxU1ix5pmx5C9/pX037g2SYrILfBOkaDNg6jqY7vR9ZbkWt0ZBQ4P04TSqatOBICoXLOKs9ViDOoMRxgAY0+km7WnDJdwOT8K68AzxFXu5ogXtfaNnQamB5v56riNvy41rKj5MWmqmtCIi0ZFrThZ3rnFc+lQkGoCAOFkmJWLOQRLKHgax7uX+8xRLZDCJg4NngurNZi8zQrQUjU5FlArCj76YoOvTOnBbMbknwyE+3lOtTPSlNwa3qXfeHSy4J2Ka5vjax4a8NqvJDRHx9EspLOmMznsufkyDtdKr9/mJjh+O0yTYfrt3eDgcH23bfaCvWR7LGX7IB9ZkdsxAT7xX6zP+xv9CWqovOouZBGvXNc3Ylop/AVtr4bk=</latexit>

The set of samples drawn from X is an ε-net iff for all C in 𝓓:

Pr(C) ≥ ✏ ⇒ C contains a sample

<latexit sha1_base64="GLJ0wq3GFBIt6qZvzQ86d7jvkM=">AC0HicbVFbaxNBFJ6st7reUn305WBaqKBhVwR9kmIQfIzFtIVsCJPZs8nQuawzZ9PGJYiv/g5/ja/64r9xNk2gFw8MfHzfmXP7JqWSnpLkbyu6cfPW7Ttbd+N79x8fNTefnzobeUEDoRV1h1PuEclDQ5IksLj0iHXE4VHk5Neox/N0XlpzWdalDjSfGpkIQWnQI3b7K+g73ec8im+AUyL1U1kB2IKcz4s7ZU8gIz6je6e2AsIa4NB4eK5Lhctxu5N0k1XAdZCuQYetoz/ebn3NcisqjYaE4t4P06SkUc0dSREKxlnlseTihE9xGKDhGv2oXi26hN3A5FBYF54hWLEXf9Rce7/Qk5CpOc38Va0h/6cNKyrejmpyorQiPNGRaWALDRXg1w6FKQWAXDhZJgVxIw7LijcNo53L/aZoZojhU0cGjwVmtu8joruJZqkWPBK0XLOvPFBl+a0nOzGRM+GB589ECVM9JMIRjTyC89OlAkwXa5vgCQgOo/CqFZugAzVw6a5ojhz03JePgVnrVm+vg8FU3Tbrp9ed/fdr37bYU/aM7bGUvWH7CPrswET7Cf7xX6zP9FBdBZ9i76fp0at9Z8n7FJEP/4Bzr/jVw=</latexit><latexit sha1_base64="GLJ0wq3GFBIt6qZvzQ86d7jvkM=">AC0HicbVFbaxNBFJ6st7reUn305WBaqKBhVwR9kmIQfIzFtIVsCJPZs8nQuawzZ9PGJYiv/g5/ja/64r9xNk2gFw8MfHzfmXP7JqWSnpLkbyu6cfPW7Ttbd+N79x8fNTefnzobeUEDoRV1h1PuEclDQ5IksLj0iHXE4VHk5Neox/N0XlpzWdalDjSfGpkIQWnQI3b7K+g73ec8im+AUyL1U1kB2IKcz4s7ZU8gIz6je6e2AsIa4NB4eK5Lhctxu5N0k1XAdZCuQYetoz/ebn3NcisqjYaE4t4P06SkUc0dSREKxlnlseTihE9xGKDhGv2oXi26hN3A5FBYF54hWLEXf9Rce7/Qk5CpOc38Va0h/6cNKyrejmpyorQiPNGRaWALDRXg1w6FKQWAXDhZJgVxIw7LijcNo53L/aZoZojhU0cGjwVmtu8joruJZqkWPBK0XLOvPFBl+a0nOzGRM+GB589ECVM9JMIRjTyC89OlAkwXa5vgCQgOo/CqFZugAzVw6a5ojhz03JePgVnrVm+vg8FU3Tbrp9ed/fdr37bYU/aM7bGUvWH7CPrswET7Cf7xX6zP9FBdBZ9i76fp0at9Z8n7FJEP/4Bzr/jVw=</latexit><latexit sha1_base64="GLJ0wq3GFBIt6qZvzQ86d7jvkM=">AC0HicbVFbaxNBFJ6st7reUn305WBaqKBhVwR9kmIQfIzFtIVsCJPZs8nQuawzZ9PGJYiv/g5/ja/64r9xNk2gFw8MfHzfmXP7JqWSnpLkbyu6cfPW7Ttbd+N79x8fNTefnzobeUEDoRV1h1PuEclDQ5IksLj0iHXE4VHk5Neox/N0XlpzWdalDjSfGpkIQWnQI3b7K+g73ec8im+AUyL1U1kB2IKcz4s7ZU8gIz6je6e2AsIa4NB4eK5Lhctxu5N0k1XAdZCuQYetoz/ebn3NcisqjYaE4t4P06SkUc0dSREKxlnlseTihE9xGKDhGv2oXi26hN3A5FBYF54hWLEXf9Rce7/Qk5CpOc38Va0h/6cNKyrejmpyorQiPNGRaWALDRXg1w6FKQWAXDhZJgVxIw7LijcNo53L/aZoZojhU0cGjwVmtu8joruJZqkWPBK0XLOvPFBl+a0nOzGRM+GB589ECVM9JMIRjTyC89OlAkwXa5vgCQgOo/CqFZugAzVw6a5ojhz03JePgVnrVm+vg8FU3Tbrp9ed/fdr37bYU/aM7bGUvWH7CPrswET7Cf7xX6zP9FBdBZ9i76fp0at9Z8n7FJEP/4Bzr/jVw=</latexit><latexit sha1_base64="GLJ0wq3GFBIt6qZvzQ86d7jvkM=">AC0HicbVFbaxNBFJ6st7reUn305WBaqKBhVwR9kmIQfIzFtIVsCJPZs8nQuawzZ9PGJYiv/g5/ja/64r9xNk2gFw8MfHzfmXP7JqWSnpLkbyu6cfPW7Ttbd+N79x8fNTefnzobeUEDoRV1h1PuEclDQ5IksLj0iHXE4VHk5Neox/N0XlpzWdalDjSfGpkIQWnQI3b7K+g73ec8im+AUyL1U1kB2IKcz4s7ZU8gIz6je6e2AsIa4NB4eK5Lhctxu5N0k1XAdZCuQYetoz/ebn3NcisqjYaE4t4P06SkUc0dSREKxlnlseTihE9xGKDhGv2oXi26hN3A5FBYF54hWLEXf9Rce7/Qk5CpOc38Va0h/6cNKyrejmpyorQiPNGRaWALDRXg1w6FKQWAXDhZJgVxIw7LijcNo53L/aZoZojhU0cGjwVmtu8joruJZqkWPBK0XLOvPFBl+a0nOzGRM+GB589ECVM9JMIRjTyC89OlAkwXa5vgCQgOo/CqFZugAzVw6a5ojhz03JePgVnrVm+vg8FU3Tbrp9ed/fdr37bYU/aM7bGUvWH7CPrswET7Cf7xX6zP9FBdBZ9i76fp0at9Z8n7FJEP/4Bzr/jVw=</latexit>
slide-10
SLIDE 10

Example: learning range queries

10

1 N Suppose we know the value of some records in the database (with uniformly random values). + we have access pattern leakage. We want to approximately learn queries in the sense: for every query we want to know its endpoints within εN. Q: How many known records do we need? A: This is an ε-net. X = values [1,N] 𝓓 = ranges so we need O(1/ε log 1/ε) known samples.

slide-11
SLIDE 11

Example continued

11

1 N So this was an ε-net ➞ we need O(1/ε log 1/ε) known samples. Q: How about if we add complements? Multi-dimensional ranges? etc. A: Actually we don’t care. All these things have finite VC dim. In fact this is actually PAC learning. PAC = Probably Approximately Correct.

slide-12
SLIDE 12

Database reconstuction

slide-13
SLIDE 13

Basic KKNO16 attack variant

13

1 N Less probable More probable Assume uniformly distributed range queries. Idea: count #times record is hit ➞ estimate probability it’s hit ➞ deduce its value Fact: to correctly deduce all values within δN you need to correctly estimate all probabilities within ε = δ2.

slide-14
SLIDE 14

Basic KKNO16 attack variant

14

1 N …so we need to estimate the probability of each value being hit, all within ε = δ2… This is an ε-sample. X = ranges 𝓓 ={{ranges ∋ x}: x ∈ [1,N]} so we need O(1/ε2 log 1/ε) known samples.

slide-15
SLIDE 15

Approximate KKNO attack

15

With uniformly distributed queries: All values are in the database are recovered within δN after

  • bserving the access pattern of O(1/δ4 log 1/δ) queries.

Remarks:

  • KKNO16: N4 log N ➞ Kenny’s talk: N log N with density

➞ this: O(1) for approximate reconstruction within 5%…

  • Setting δ = 1/N recovers KKNO’s attack.
  • Lower bound of Ω(1/δ4).
  • Direct application of VC theory.
slide-16
SLIDE 16

Extensions of this approach

16

In fact O(1/δ2 log 1/δ) queries suffice under very reasonable assumptions. e.g. there exists record in DB with value within [N/8,3N/8]. Other query types:

  • Prefix queries on strings, wildcard queries, etc.
  • “Meta-theorem”: all these have finite VC dim…
  • This is WIP

. One limitation:

  • VC theory gives bad constants.

It says something of general behavior. Need experiments.

slide-17
SLIDE 17

Limitation of previous result

17

So far we are assuming uniformly distributed queries. This is not just an assumption about adversarial knowledge. This is an assumption that queries are independent identically distributed (i.i.d.). This is quite unrealistic. What can you learn without that hypothesis?

slide-18
SLIDE 18

PQ trees

slide-19
SLIDE 19

PQ trees

19

X: linearly ordered set. Order is unknown. You are given a set S containing some some intervals in X. A PQ tree is a compact (linear in |X|) representation of the set

  • f all permutations of X that are compatible with S.

As new sets are added to S, the PQ tree can be updated in linear time. Was used in DR13, didn’t target reconstruction.

slide-20
SLIDE 20

PQ trees

20

X = {a, b, c, d, e} P a b c = any permutation of {a, b, c}. a b c Q = ‘abc’ or ‘cba’. P d e a b c Q = ‘abc’ or ‘cba’, with ‘d’ and ‘e’ permuted in any way on either side. i.e. ‘abcde’, ‘abced’, ‘dabce’, ‘eabcd’, ‘deabc’, ‘edabc’, ‘cbade’ etc.

slide-21
SLIDE 21

… …

Database order reconstruction

21

P Q No information r1 r2 r3 … … r1 r2 r3 Full reconstruction N log N queries LMP18 (aka Kenny’s talk) reinterpreted: you fully recover

  • rder information with O(N log N) queries.

Density not required. Density was only to convert from order to values.

slide-22
SLIDE 22

… …

Approximate order reconstruction

22

P Q No information r1 r2 r3 … … r1 r2 r3 Full reconstruction … … Q Interval ≤ δN … … … Approximate (order) reconstruction = full order reconstruction, except for values that are very close (less than δN apart). Approximate reconstruction

slide-23
SLIDE 23

… …

Approximate order reconstruction

23

P Q No information r1 r2 r3 … … r1 r2 r3 Full reconstruction … … Q Interval ≤ δN … … … Approximate reconstruction N log N queries 1/δ log 1/δ queries The proof uses an ε-net…

slide-24
SLIDE 24

Converting from order to values

24

Known (approximation of) database value distribution ➞ frequency matching. Known (approximation of) query distribution, see previous attack. Some known records ➞ order allows to compare records to known values. …

slide-25
SLIDE 25

Some history

25

OPE/ORE were developed to allow range queries. Leak order by design. Led to devastating leakage-abuse attacks GSB+17, DDC16. Second-generation schemes eschew ORE to enable range queries without leaking order. We just saw access pattern leaks order… So if you leak access pattern it’s back to square one! (Difference: OPE/ORE attacks only required a snapshot adversary, now we need access pattern leakage.)

slide-26
SLIDE 26

Features of the approximate order attack

26

It is fully general:

  • Does not rely on i.i.d. queries.
  • No density assumption.
  • No dependency on N (for approximate order).

Also…

  • Only O(1/δ log 1/δ) queries!
  • Setting δ=1/N recovers LMP18. Without requiring density.
  • Not “all or nothing”: precision improves with #queries.
slide-27
SLIDE 27

Conclusion

27

Introduced approximate reconstruction. Leads to very powerful attacks. Approximate order attack is very efficient with truly minimal assumption. Clarifies the setting. Two techniques prove very potent in this setting:

  • VC theory.
  • PQ trees.

VC theory extends to other query classes (under investigation).