SLIDE 1
Gradient Descent Finds Global Minima of Deep Neural Networks
Simon S. Du, Jason D. Lee, Haochuan Li, Liwei Wang, Xiyu Zhai
1
Gradient Descent Finds Global Minima of Deep Neural Networks Simon - - PowerPoint PPT Presentation
Gradient Descent Finds Global Minima of Deep Neural Networks Simon S. Du, Jason D. Lee, Haochuan Li, Liwei Wang, Xiyu Zhai 1 Empirical Observations on Empirical Risk Zhang et al, 2017, Understanding Deep Learning Requires Rethinking
1
2
3
i=1 , xi ∈ Rd, yi ∈ R
n
i=1
4
5
L
`=1
ij(t) = 1
m→∞ L
`=1
m→∞ L
`=1
L
`=1
2 exp (λ0t) ku(0) yk2 2, λ0 = λmin (H∞)
6
12 3456(8) , then with high probability over
@<(=(0)).
7
01 23 ,
;7(8(0)).
8