Artificial Neuron

The basic element of a neural network is called a neuron. This is also sometimes called a Perceptron but I prefer to use perceptron to refer to the learning algorithm; it is also referred to as “unit” or “node”.

The neuron is essentially a non-linear function of an input vector $x \in R^{m}$ to a single value $a \in R$ . It is parametrized by:

A vector of weights $(w_{1}, \dots, w_{m}) \in R^{m}$ and an offset/threshold $w_{0} \in R$ .
An activation function $f : R \to R$ , which gives us non-linearity.

In total, the function represented by the neuron can be summarized as:

a = f (z) = f (j = 1 \sum m x_{j} w_{j} + w_{0}) = f (w^{T} x + w_{0})

The final formulation is basically just the activation function applied to linear classifier.

Training

How do we train a single unit? Given a loss function $L (gu ess, a c t u a l)$ , and a dataset ${(x^{(1)}, y^{(1)}), \dots, (x^{n}, y^{(n)})}$ , we can do gradient descent, adjusting the weights $w, w_{0}$ to minimize:

J (w, w_{0}) = i = 1 \sum n L (NN (x^{(i)}; w, w_{0}), y^{(i)})

where $NN (\cdot)$ is the output of our neural net for a given input.

Linear classifiers with hinge loss and regression with quadratic loss, which we’ve studied, are two special cases of the neuron; both of them have an activation function of $f (x) = x$ .

/notes/

Recent

Backpropagation Intuition

Backpropagation Toy Example

UDL Chapter 7 Problems

Artificial Neuron

Training

Graph View

Backlinks