/32
Analog-to-Digital Compression
Oral PhD Exam Alon Kipnis
Fundamental performance limits of
Advisor: Andrea Goldsmith
1
Analog-to-Digital Compression Oral PhD Exam Alon Kipnis Advisor: - - PowerPoint PPT Presentation
Fundamental performance limits of Analog-to-Digital Compression Oral PhD Exam Alon Kipnis Advisor: Andrea Goldsmith 1 /32 Outline analog digital quantization 010010011001001000 sampling (lossy compression) 0100101010010001
/32
Oral PhD Exam Alon Kipnis
Fundamental performance limits of
Advisor: Andrea Goldsmith
1
/32
Motivation — Factors affecting analog-to-digital conversion
Main problem — Combined problem sampling and lossy compression
Corollary — Optimal sampling under compression constraints Summary — Toward a unified spectral theory of analog signal processing and lossy compression
2
sampling
analog
quantization (lossy compression)
010010011001001000 0100101010010001…
digital
/32
010010011001 001000010000 1000100111…
information loss
A/D conversion
The analog-to-digital (A/D) conversion problem:
3
/32
A/D parameters
Minimal distortion in A/D:
4
analog
010010011001001 000010010101001
digital
reconstruction
analog
quantization (lossy compression)
sampling
/32
5
sampling
quantization (lossy compression)
digital analog
. . .
0 . . . 00 0 . . . 01
1 . . . 11
RT
. . .
T X(t)
R
bitrate:
[bits/sec]
The Source Coding Theorem [Shannon ‘48]:
D(R)
Shannon’s distortion-rate function
=
Theoretic lower bound for distortion in A/D Ignores effect of sampling
=
probability distributions
reconstruction
analog
Enc
/32
fs
The Sampling Theorem [Whittaker, Kotelinkov, Shannon]:
distortion
sampling rate fs
t
X(t)
fs > fNyq , 2fB
t
sinc(t)
6
fNyq = 2fB
Y [n] = X(t/fs)
Ignores effect of quantization
sampling
quantization (lossy compression)
digital analog
Shannon’s distortion-rate function
D(R)
/32
7
sampling
compression
digital
analog
D(fs, R)
=
Minimal distortion under sampling and lossy compression
distortion
sampling rate fs
u n l i m i t e d b i t r a t e
Shannon’s distortion-rate function
D(R)
unlimited sampling rate
fNyq
/32
Can we attain D(R) by sampling below Nyquist ?
The Sampling Theorem [Whittaker, Kotelinkov, Shannon]
t X(t)
Y [n] = X(t/fs)
fs > fNyq , 2fB t
sinc(t)
8
“we are not interested in exact transmission when we have a continuous [amplitude] source, but only in transmission to within a given tolerance” [Shannon ’48]
D(fs, R)
=
distortion
sampling rate fs
D(R)
fNyq = 2fB
/32
2) Can we attain D(R) by sampling below Nyquist ?
9
sampling rate fs
fNyq = 2fB
D(R)
D(fs, R)
=
1) What is the minimal distortion in sampling and lossy compression?
u n l i m i t e d b i t r a t e
unlimited sampling rate
/32
, inf
enc−dec,T
1 T Z T E ⇣ X(t) − b X(t) ⌘2 dt D(fs, R)
is zero mean Gaussian stationary with PSD
X(t) SX(f)
SX(f)
f
SX(f)
Y [n] = X(n/fs)
10
sampling lossy compression
reconstruction
Enc Dec
fs Y [·]
X(t) b X(t) R
/32
Enc Dec
Y [·]
fs > fNyq
fs
⇒ = D(R) D(fs, R)
[Pinsker ’54]
Dθ(R) = Z ∞
−∞
min {SX(f), θ} d f Rθ = 1 2 Z ∞
−∞
log+ [SX(f)/θ] d f
θ
SX(f) f
θ
SX(f) f
R
D(R)
(water-filling) X(t) b X(t)
11
R
/32
Y [·]
Enc
R
fs
R → ∞ ⇒ mmse(X|Y ) = D(fs, R) mmse(fs) =
MMSE in sub-Nyquist sampling [Chan & Donaldson ‘71, Matthews ’00]
X
k∈Z
SX(f − fsk)
e SX|Y (f) = P
k S2 X(f − fsk)
P
k SX(f − fsk)
fs
f
SX(f)
SX(f − fs)
S
X
( f + f
s
)
b X(t)
Dec
X(t)
12
/32
e SX|Y (f)
f
fs
Distortion due to sampling Distortion due to bitrate constraint
Theorem*[K., Goldsmith, Eldar, Weissman ‘13]
D(fs, R) mmse(fs) = + WF ⇣ e SX|Y ⌘
(*) A. Kipnis, A. J. Goldsmith, T. Weissman and
Nyquist sampled Gaussian sources corrupted by noise’, Allerton 2013
13
Enc Dec
fs Y [·]
R X(t) b X(t)
/32
D(fs, R)
f
SX(f)
fB
distortion
fs
D(R)
D(fs, R) vs fs (R = 1)
mmse(fs)
fNyq = 2fB
14
/32
Enc Dec
fs Y [·]
D(fs, R) = + Y [·]
estimator
E [X(t)|Y [·]]
e X(·)
Enc Enc Enc
mmse(fs) WF ⇣ e SX|Y ⌘
transformation
e X∆2[·]
e X∆k[·]
e X∆1[·]
. . .
*
15
(*) A. Kipnis, A. J. Goldsmith and
Gaussian processes’, (under review) 2016
R X(t) b X(t)
X
i
Ri ≤ R R1
R2
Rk
/32
Enc Dec
fs
H(f)
e SX|Y (f)
fs
θ
without pre-sampling filter
Linear pre-processing can reduce distortion
fs
θ
with pre-sampling filter
e SX|Y (f)
fs D(R) distortion
16
b X(t) X(t) R
H(f) ≡ 1
H(f)
/32
Theorem* [K., Goldsmith, Eldar, Weissman ’14] The optimal pre-sampling filter is (i) anti-aliasing (ii) maximizes passband energy
(*) A. Kipnis, A. J. Goldsmith,
sampled Gaussian sources’, IEEE Trans. on Information Theory, January, 2016 SX(f)
fs
θ
H?(f)
SX(f)
fs
θ
H?(f)
no aliasing
D?(fs, R) = mmse?(fs) + WF ⇣ |H?|2 SX ⌘
17
/32
(*) A. Kipnis, A. J. Goldsmith,
sampled Gaussian sources’, IEEE Trans. on Information Theory, January, 2016
Theorem* [K., Goldsmith, Eldar, Weissman ’14] The optimal pre-sampling filter is (i) anti-aliasing (ii) maximizes passband energy fs
f
low-pass is optimal
fs fs fs
f
maximal aliasing-free set is optimal
18
/32
X1 ∼ N
1
2
X2 + = Y h1 h2
fs fs fs f
1(σ1 > σ2) 1(σ1 < σ2)
= = ∗ ∗ Answer:
19
h1 h2
argmin ? = Question: {mmse(X1|Y ) + mmse(X2|Y )} mmse(Xi|Y ) = E (Xi − E[Xi|Y ])2
fs
/32
D?(R, fs) vs fs
D(R)
fNyq
fR
mmse(fs)
θ
fs
θ
fs
θ
fs
fs
distortion
(R is fixed) Sub-Nyquist sampling achieves optimal distortion-rate performance
D?(fs, R) = D(R) fs ≥ fR
SX(f)
20
/32
Theorem* [K., Goldsmith, Eldar ’15]
D?(fs, R) = D(R) fs ≥ fR
θ
fR
(+) A. Kipnis, A. J. Goldsmith and
Nyquist nonuniform sampling’, Allerton 2014
Extends Kotelnikov-Whittaker-Shannon sampling theorem:
Incorporates lossy compression Valid when input signal is not band limited
Alignment of degrees of freedom
Holds under non-uniform sampling
+
(*) A. Kipnis, A. J. Goldsmith and
Information Theory Workshop (ITW), 2015
21
/32
R 1 fR fNyq
critical sub-sampling ratio vs R
22
/32
23
/32
Degrees of freedom alignment in other sampling models ?
24
Enc Dec
{0, 1}nR b X
Example: compressed sensing
X Y sampler ∈ Rn
/32
(*) A. Kipnis, A. J. Goldsmith, T. Weissman and
Nyquist sampled Gaussian sources corrupted by noise’, Allerton 2013
25
Enc Dec
fs
sampler
Y [·]
X(t) b X(t)
H(f)
+
η(t)
R
Theorem*[K. Goldsmith, Weissman, Eldar ’13]
∗
fs
θ e SX|Y (f)
P
k S2 X (f − fsk) |H(f − fsk)|2
P
k (SX(f − fsk) + Sη (f − fsk)) |H(f − fsk)|2
=
sampling quantization (lossy compression) digital
analog
noise
/32
e SX|Y (f)
fs
θ
H(f)
+
η(t)
Enc Dec
26
Lossy compression Sampling Linear filtering
Does not incorporate time-flow
/32
Incorporating time-flow and lossy compression
[Kolmogorov ’56]: “Since a function with a bounded spectrum is always singular in the sense of my work and the
not related … to the stationary flow
not remain completely clear”
θ
SX(f)
f
27
Example: minimal distortion in causal estimation under bitrate constraint
X(t) t past future
/32
[3] A. Kipnis, A. J. Goldsmith and
processes’, (under review) 2016 [2] A. Kipnis,
compression’, (under review) 2015 [1] A. Kipnis, A. J. Goldsmith,
sampled Gaussian sources’, IEEE Trans. on Information Theory, January, 2016
function under sub-Nyquist nonuniform sampling’, Allerton 2015
function of sub-Nyquist sampled Gaussian sources corrupted by noise’, Allerton 2013
Analog-to-digital compression:
sampled Wiener processes’, ISIT 2016
sampling rate and quantization precision in Sigma-Delta A/D conversion’, SampTA 2015
Between Sampling Rate and Quantization Precision in A/D conversion’, Allerton 2015
28
/32
Lossy source coding:
[4] A. Kipnis, A. J. Goldsmith and
Gaussian processes’, (under review) 2016 [5] A. Kipnis, S. Rini and A. J. Goldsmith, ‘Multiterminal compress-and-estimate rate-distortion’, (in progress)
stationary Gaussian processes’, ISIT 2014
function of a binary i.i.d source’, ITW 2015
estimate source coding’, ISIT 2016
29
/32
Yonina Eldar Tsachy Weissman
WSLers and ISLers Stefano Yuxin Milind Mainak Alexandros Nima Mahnoosh Yonathan Nariman Jiantao Kartik Idoia Miguel
30
/32
31
/32
e SX|Y (f)
fs
θ
32
D?(R, fs) vs fs
mmse(fs)
fs
distortion
D(R)
fNyq
fR
/32
[Berger ’68]: Joint typicality with respect to continuous-time waveform [Yaglom-Pinsker ‘57, Gallager ’68]: Karhunen–Loève transform [Shannon ’49]: Degrees of freedom = time X bandwidth [Berger ’71, Neuhoff & Pradhan 2013]: Analog distortion-rate function by discrete-time time approximations
X(t)
fs
Sampler
Y [n] continuous-time discrete-time
LTI
Constrained by hardware Constrained in bandwidth Modelling constraint
33
/32
digital
Remote source coding [Dubroshin&Tsybakov ’62, Wolf&Ziv ’70]:
Enc Dec
X(0 : T)
information source reconstruction
b X(0 : T)
fs
Sampler Y [·]
M ∈ {0, 1}bT Rc
(*) A. Kipnis, A. J. Goldsmith and
Gaussian Processes’, (under review) 2016
cyclo-stationary∗ e n c
e c
D(fs, R)
e n c
e c
D(R)
X(t)
sampling
Y [n]
b X(t)
enc-dec
D
e X
( R )
b e X(t)
kx b xk2
T
m m s e ( f
s
)
estimate
e X(t) = E [X(t)|Y [·]]
Decomposition:
=
+
mmse(fs)
D e
X(R)
D(fs, R)
34
/32
(analog-to-digital compression for the Wiener process)
X(t) EX(t)X(s) = min{t, s}
Dec Enc
fs
M ∈ {0, 1}bT Rc
YT [n] = XT (n/fs)
X(0 : T) b X(0 : T)
T
S(t)
Z t dS(t) S(t) = µt + σX(t)
35
/32
digital
(*) A. Kipnis, A. J. Goldsmith and
Processes’, (under review) 2016
Theorem*
“Estimate-and-compress” is optimal for the Wiener process
e n c
e c
D(fs, R)
e n c
e c
D(R)
X(t)
sampling
Y [n]
b X(t)
=
+
mmse(fs)
D e
X(R)
D(fs, R)
enc-dec
D e
X(R)
b e X(t) e X(t)
m m s e ( f
s
)
estimate
≤
D(R) D(R)
[Berger ’70]:
= 2 π2 ln 2R−1
kx b xk2
T
enc-dec DY ( ¯ R)
b Y [n]
estimate
36
/32
(*) A. Kipnis, A. J. Goldsmith and
Processes’, (under review) 2016
Theorem*
D(fs, R) = mmse(fs) + 1 fs Z 1 min n e S(φ), θ
R(θ) = fs 2 Z 1 log+ h e S(φ)/θ i dφ
e S(φ) = 1 4 sin2(πφ/2) − 1 6
mmse(fs) = 1 6fs
θ φ
1
37
/32
38
fs=1 fixed
distortion
1 6fs
mmse(fs)
D ( f
s
, R )
D(R)
/32
t1
t2
t3 t4
t5 t6
t7
· · · · · · Λ
Sampler
X(·)
Y [n] = X(tn)
tn ∈ Λ
h(t, τ)
Theorem*
D? d−(Λ), R
(*) A. Kipnis,
Compression’, (under review) 2015
d−(Λ) = lim
r→∞ inf u∈R
|Λ ∩ [u, u + r)| r
is the lower Beurling density of Λ
Nonuniform sampling cannot improve over uniform
[Landau ’67]: necessary and sufficient condition for zero interpolation error:
d−(Λ) ≥ µ(suppSX)
39
/32
digital analog
X(t)
anti-aliasing filter
XLP F (t)
fs
Y [n] = XLP F (n/fs)
¯ R-bit quantizer
YQ[n] estimator b X(t)
DP CM(fs, R) = 1 T Z T E ⇣ X(t) − b X(t) ⌘2 dt
(*) A. Kipnis,
Compression’, (under review) 2015
Theorem* (stationary input, linear estimation)
DP CM(fs, R) = mmse(X|Y ) + DQ( ¯ R, fs)
DQ( ¯ R, fs) = c0 2fB fs σ22−2 ¯
R
mmse(X|Y ) = σ2 − Z
fs 2
− fs
2
SX(f)d f
R = ¯ Rfs
40
/32
DP CM(fs, R) = mmse(X|Y ) + DQ( ¯ R, fs)
DQ( ¯ R, fs) = c0 2fB fs σ22−2 ¯
R
mmse(X|Y ) = σ2 − Z
fs 2
− fs
2
SX(f)d f
Optimal sampling rate in PCM is smaller than Nyquist (!)
(R is fixed) DP CM(fs, R)
fs fNyq
1
σ2
distortion
41