Transformation of Densities

How does a probability density transform under a non-linear change of variable? Probability densities have different behavior than simple functions under such transforms.

Consider a single variable $x$ , and we make a change of variables $x = g (y)$ , such that $f (x)$ becomes a new function $\tilde{f} (y)$ such that

\tilde{f} (y) = f (g (y))

For a probability density $p_{x} (x)$ , if we want to find a density for a new variable $y$ , such that $x = g (y)$ . This density is expressed as $p_{y} (y)$ . To make this transformation, we consider the probabilities of $x$ and $y$ falling into infinitesimally small ranges.

The probability that $x$ in the range $(x, x + δ x)$ is $p_{x} (x) δ x$ (see probability density).
Similarly, the probability that $x$ in the range $(y, y + δy)$ is $p_{y} (y) δy$ .

Now, since $x$ and $y$ are related by $x = g (y)$ , we can say that a small change in $y$ will cause a corresponding small change in $x$ . This can be expressed mathematically by considering that probability is conserved when we change variables, such that:

p_{x} (x) δ x \approx p_{y} (y) δy

This becomes exactly equal when we take the limit of $δ x \to 0$ and $δy \to 0$ :

δ x \to 0 lim p_{x} (x) δ x = δy \to 0 lim p_{y} (y) δy

We can then turn this into:

p_{y} (y) = p_{x} (x) \frac{d x}{d y} = p_{x} (g (y)) \frac{d g}{d y}

Here we’re using the modulus $∣ \cdot ∣$ because the derivative could be negative, but we want to scale the density by the proportion of lengths, which is a positive value.

This sort of procedure is very powerful, as any density $p (y)$ can be obtained from a fixed density $q (x)$ by making a non-linear change of variable $y = f (x)$ in which $f (x)$ is monotonic so that $0 \leq f^{'} (x) < \infty$ , and $q (x)$ is non-zero everywhere. However, it can also make things more complicated, such as wen trying to find Maximum of Transformed Density.

This property is important to Normalizing Flows.

/notes/

Recent

Sources of Test Error

UDL Chapter 8 Problems

Parameter Initialization

Transformation of Densities

Graph View

Backlinks