Likelihood Function

Suppose we have a data set of observations represented by row vector $x = (x_{1}, \dots, x_{n})$ , representing $N$ observations of a scalar variable $x$ . These observations are drawn from a Gaussian whose parameters, mean $μ$ and variance $σ^{2}$ , are unknown. Given our observations, we want to estimate these parameters to find the distribution that they came from.

Data points that are drawn independently from the same distribution are independent and identically distributed (IID or i.i.d). The joint probability of independent events is given by the product of the marginal probabilities for each event separately. Because our dataset is i.i.d, we can therefore write the probability of the dataset, given $μ$ and $σ^{2}$ , as

p (x ∣ μ, σ^{2}) = n = 1 \prod N N (x_{n} ∣ μ, σ^{2})

When viewed as a function of $μ$ and $σ^{2}$ , this is called a likelihood function for the Gaussian.

In the diagram, 2.55 refers to the equation above.

/notes/

Recent

Backpropagation Algorithm

Backpropagation Intuition

Backpropagation Toy Example

Likelihood Function

Graph View

Backlinks