/notes/

Recent

  • Japanese Denim Chords

    Jul 04, 2026

    • Decoder Model

      Jul 03, 2026

      • dl
    • Encoder Model

      Jul 03, 2026

      • dl

    See 1742 more →

    Home

    ❯

    ML

    ❯

    Optimizing Neural Networks

    Optimizing Neural Networks

    Jan 28, 20241 min read

    • ml

    An optimizer takes in a gradient and decides how to update the parameters based on the gradients. The simplest form is Stochastic Gradient Descent.

    • Adaptive step-size
      • Running Averages
      • Momentum (ML)
    • Adadelta
    • Adaptive moment estimation

    Graph View

    Created with Quartz v4.4.0 © 2026

    • Main Website
    • GitHub
    • LinkedIn