Core idea:

Use a feature function to transform original data from to some other

Basic example of this is to transform a Linear Classifier through the origin to one not through the origin.

We can even turn un-separable datasets into separable ones!

  • This is a very useful and generalizable strategy – it serves as the basis for kernel methods.

  • One systematic strategy for constructing a new feature space is to use a polynomial basis.

  • Features can also be improved through Feature Engineering.