The Kikuchi Hierarchy and Tensor PCA
Alex Wein
Courant Institute, NYU
Joint work with: Ahmed El Alaoui
Stanford
Cris Moore
Santa Fe Institute
1 / 19
The Kikuchi Hierarchy and Tensor PCA Alex Wein Courant Institute, - - PowerPoint PPT Presentation
The Kikuchi Hierarchy and Tensor PCA Alex Wein Courant Institute, NYU Joint work with: Ahmed El Alaoui Cris Moore Stanford Santa Fe Institute 1 / 19 Statistical Physics of Inference High-dimensional inference problems: compressed
1 / 19
2 / 19
2 / 19
2 / 19
◮ Known/believed to be optimal in many settings 2 / 19
◮ Known/believed to be optimal in many settings ◮ Sharp results: exact MMSE, phase transitions 2 / 19
◮ Known/believed to be optimal in many settings ◮ Sharp results: exact MMSE, phase transitions
a ’11, Lesieur-Krzakala-Zdeborov´ a ’15] 2 / 19
◮ Known/believed to be optimal in many settings ◮ Sharp results: exact MMSE, phase transitions
a ’11, Lesieur-Krzakala-Zdeborov´ a ’15]
2 / 19
3 / 19
3 / 19
3 / 19
3 / 19
3 / 19
3 / 19
3 / 19
3 / 19
4 / 19
5 / 19
5 / 19
5 / 19
5 / 19
6 / 19
6 / 19
6 / 19
6 / 19
6 / 19
6 / 19
7 / 19
7 / 19
7 / 19
7 / 19
7 / 19
[Hopkins-Shi-Steurer ’15, Hopkins-Kothari-Potechin-Raghavendra-Schramm-Steurer ’17] 7 / 19
[Hopkins-Shi-Steurer ’15, Hopkins-Kothari-Potechin-Raghavendra-Schramm-Steurer ’17]
7 / 19
8 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16] 8 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16]
8 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16]
8 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16]
8 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16]
8 / 19
9 / 19
9 / 19
9 / 19
[Barak-Hopkins-Kelner-Kothari-Moitra-Potechin ’16, Hopkins-Steurer ’17, Hopkins-Kothari-Potechin-Raghavendra-Schramm-Steurer ’17, Hopkins PhD thesis ’18] 9 / 19
[Barak-Hopkins-Kelner-Kothari-Moitra-Potechin ’16, Hopkins-Steurer ’17, Hopkins-Kothari-Potechin-Raghavendra-Schramm-Steurer ’17, Hopkins PhD thesis ’18]
9 / 19
[Barak-Hopkins-Kelner-Kothari-Moitra-Potechin ’16, Hopkins-Steurer ’17, Hopkins-Kothari-Potechin-Raghavendra-Schramm-Steurer ’17, Hopkins PhD thesis ’18]
9 / 19
[Barak-Hopkins-Kelner-Kothari-Moitra-Potechin ’16, Hopkins-Steurer ’17, Hopkins-Kothari-Potechin-Raghavendra-Schramm-Steurer ’17, Hopkins PhD thesis ’18]
9 / 19
[Barak-Hopkins-Kelner-Kothari-Moitra-Potechin ’16, Hopkins-Steurer ’17, Hopkins-Kothari-Potechin-Raghavendra-Schramm-Steurer ’17, Hopkins PhD thesis ’18]
9 / 19
10 / 19
◮ Analogous to SoS hierarchy 10 / 19
◮ Analogous to SoS hierarchy
◮ Both for poly-time and for subexponential-time tradeoff 10 / 19
◮ Analogous to SoS hierarchy
◮ Both for poly-time and for subexponential-time tradeoff
10 / 19
◮ Analogous to SoS hierarchy
◮ Both for poly-time and for subexponential-time tradeoff
10 / 19
◮ Analogous to SoS hierarchy
◮ Both for poly-time and for subexponential-time tradeoff
10 / 19
◮ Analogous to SoS hierarchy
◮ Both for poly-time and for subexponential-time tradeoff
10 / 19
11 / 19
11 / 19
11 / 19
11 / 19
11 / 19
11 / 19
11 / 19
12 / 19
12 / 19
12 / 19
12 / 19
12 / 19
12 / 19
12 / 19
12 / 19
13 / 19
a ’14] 13 / 19
a ’14]
13 / 19
a ’14]
13 / 19
a ’14]
13 / 19
a ’14]
13 / 19
a ’14]
13 / 19
a ’14]
13 / 19
a ’14]
13 / 19
14 / 19
14 / 19
14 / 19
14 / 19
14 / 19
14 / 19
15 / 19
ℓ) where vS is an estimate of xS :=
15 / 19
ℓ) where vS is an estimate of xS :=
15 / 19
ℓ) where vS is an estimate of xS :=
15 / 19
16 / 19
16 / 19
16 / 19
16 / 19
16 / 19
17 / 19
17 / 19
17 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16] 17 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16]
17 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16]
17 / 19
[Raghavendra-Rao-Schramm ’16, Bhattiprolu-Guruswami-Lee ’16]
17 / 19
18 / 19
◮ Similar construction (symmetric difference matrix) with
◮ Hamiltonian of system of bosons 18 / 19
◮ Similar construction (symmetric difference matrix) with
◮ Hamiltonian of system of bosons
◮ A different form of “redemption” for local algorithms ◮ Replicated gradient descent 18 / 19
19 / 19
19 / 19
◮ E.g. gradient descent, AMP 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm? 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices ◮ Keep track of beliefs about higher-order correlations 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices ◮ Keep track of beliefs about higher-order correlations ◮ Minimize Kikuchi free energy 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices ◮ Keep track of beliefs about higher-order correlations ◮ Minimize Kikuchi free energy ◮ Matches SoS (conjectured optimal) 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices ◮ Keep track of beliefs about higher-order correlations ◮ Minimize Kikuchi free energy ◮ Matches SoS (conjectured optimal) ◮ Proof is much simpler than prior work 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices ◮ Keep track of beliefs about higher-order correlations ◮ Minimize Kikuchi free energy ◮ Matches SoS (conjectured optimal) ◮ Proof is much simpler than prior work
19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices ◮ Keep track of beliefs about higher-order correlations ◮ Minimize Kikuchi free energy ◮ Matches SoS (conjectured optimal) ◮ Proof is much simpler than prior work
◮ Unify statistical physics and SoS? 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices ◮ Keep track of beliefs about higher-order correlations ◮ Minimize Kikuchi free energy ◮ Matches SoS (conjectured optimal) ◮ Proof is much simpler than prior work
◮ Unify statistical physics and SoS? ◮ Systematically obtain optimal spectral methods in general? 19 / 19
◮ E.g. gradient descent, AMP ◮ Keep track of an n-dimensional state ◮ Nearly-linear runtime
◮ Soft threshold: optimal algorithm cannot be nearly-linear time ◮ For p-way data, need p-way algorithm?
◮ Hierarchy of message-passing algorithms: symm. diff. matrices ◮ Keep track of beliefs about higher-order correlations ◮ Minimize Kikuchi free energy ◮ Matches SoS (conjectured optimal) ◮ Proof is much simpler than prior work
◮ Unify statistical physics and SoS? ◮ Systematically obtain optimal spectral methods in general?
19 / 19