A convex function satisfies
where and , for any set of points . This is known as Jensen’s Inequality.
If we interpret as the probability distribution over a discrete variable taking the values , then we can rewrite the above as:
where denotes the expectation.
Continuous Form
For continuous variables, Jensen’s Inequality takes the form