A convex function satisfies

where and , for any set of points . This is known as Jensen’s Inequality.

If we interpret as the probability distribution over a discrete variable taking the values , then we can rewrite the above as:

where denotes the expectation.

Continuous Form

For continuous variables, Jensen’s Inequality takes the form