Beta. Content is under active construction and has not been peer-reviewed. Report errors on GitHub.Disclaimer

Neural Network Optimization Landscape

3 questionsDifficulty 6-7View topic
Intermediate
0 / 3
1 intermediate2 advancedAdapts to your performance
1 / 3
intermediate (6/10)conceptual
Consider the saddle point at the origin of . Gradient descent initialized at exactly gets stuck because the gradient is zero. What property of SGD noise helps escape such saddle points?