/notes/

Recent

  • Net Neutrality Discussion

    Feb 04, 2026

    • hist216
  • Automatic Differentiation

    Jan 27, 2026

    • amath449
    • dl
  • Library Discussion

    Jan 27, 2026

    • hist216

See 1585 more →

Home

❯

ML

❯

Optimizing Neural Networks

Optimizing Neural Networks

Jan 28, 20241 min read

  • ml

An optimizer takes in a gradient and decides how to update the parameters based on the gradients. The simplest form is Stochastic Gradient Descent.

  • Adaptive step-size
    • Running Averages
    • Momentum (ML)
  • Adadelta
  • Adaptive moment estimation

Graph View

Created with Quartz v4.4.0 © 2026

  • Main Website
  • GitHub
  • LinkedIn