Beta. Content is under active construction and has not been peer-reviewed. Report errors on GitHub.Disclaimer

Gradient Flow and Vanishing Gradients

4 questionsDifficulty 5-7View topic
Intermediate
0 / 4
3 intermediate1 advancedAdapts to your performance
1 / 4
intermediate (5/10)counterexample
In a deep network with sigmoid activations, gradients in early layers can become exponentially small. Which mechanism is primarily responsible for vanishing gradients?