/notes/

Recent

Sources of Test Error
Aug 18, 2025
- dl
UDL Chapter 8 Problems
Aug 17, 2025
- dl
Parameter Initialization
Aug 11, 2025
- dl

See 1415 more →

❯

❯

Vanishing + Exploding Gradient Problem

Vanishing + Exploding Gradient Problem

Jan 28, 20241 min read

ml

This is a common problem – side effect of Backpropagation.

The gradient is the slope of the loss function along the error curve.

When the gradient is too small, it continues to become smaller, updating the weight parameters until they become insignificant, which means that the algorithm is no longer learning.
Exploding gradients occur when the gradient is too large, creating an unstable model (NaN results).

Graph View

Backlinks

Long Term Short Memory
Model Optimization
Parameter Initialization

Created with Quartz v4.4.0 © 2025

Main Website
GitHub
LinkedIn