Support Vector Machine

A supervised learning algorithm for linear classifiers that aim to find a hyperplane decision boundary that maximizes the margin to data.

Specifically, SVM aims to choose the hyperplane so that the distance from the hyperplane to the nearest data point on each side is maximized. These nearest points are called support vectors because they “support” the hyperplane.

The distance between the two hyperplanes is $\frac{2}{∣ θ ∣}$ . So, to maximize the margin, $∣ θ ∣$ needs to be minimized. This is why objective functions for SVMs typically include a $λ ∣∣ θ ∣ ∣^{2}$ regularizer. A typical training object would be:

J (θ) = \frac{1}{n} i = 1 \sum n L_{h} (y^{(i)} θ \cdot x^{(i)}) + \frac{λ}{2} ∣∣ θ ∣ ∣^{2}

where $L_{h}$ is Hinge Loss. Hinge loss is ideal for support vector machines because its definition includes maximizing the distance between $θ \cdot x$ and the decision boundary, which is the same as the margin maximization objective here.

/notes/

Recent

Sources of Test Error

UDL Chapter 8 Problems

Parameter Initialization

Support Vector Machine

Graph View

Backlinks