Convolutional Neural Networks consist of a sequence of convolutional layers. For a given hidden unit in the network, its receptive field is the region of the original input that feeds into it.

Consider a convolutional network where each convolutional layer has kernel size 3.

  • The hidden units in the first layer take the weighted sum of the three closest inputs, so have receptive fields of size 3.
  • The units in the second layer take a weighted sum of the three closest positions in the first layer, which are themselves weighted sums of three inputs. Hence, the hidden units in the second layer have a receptive field of size 5.

Thus, the receptive field of units in successive layers increases, and information from across the input is gradually integrated.