Maximum Likelihood Estimation

To determine the parameters in a probability distribution using an observed data set, known as the maximum likelihood, is to find the parameters that maximize the likelihood function, which usually take some form like this:

p (x ∣ μ, σ^{2}) = n = 1 \prod N N (x_{n} ∣ μ, σ^{2})

The most convenient way to do this is to take the log of the likelihood function; since logarithms are monotonically increasing, maximizing the log of a function is equivalent to maximizing the function itself, and lets us simplify the mathematical analysis. It’s also easier to do programmatically because products of small numbers can cause underflow.

This $ln p (x ∣ μ, σ^{2})$ expression can then be maximized with several methods:

Analytical solution: Find partial derivatives with respect to $μ$ and $σ^{2}$ , then set to zero and solve.
- See Gaussian Maximum Likehood Estimation for an example of this method.
Learning solution: Define an error function and minimize.

/notes/

Recent

Sources of Test Error

UDL Chapter 8 Problems

Parameter Initialization

Maximum Likelihood Estimation

Graph View

Backlinks