January 2019
Intermediate to advanced
386 pages
11h 13m
English
Let's start with a quote from professor Hinton himself:
What he means is that the CNNs are translation-invariant. To understand this, let's imagine a picture with a face, located in the right half of the picture. Translation invariance means that a CNN is very good at telling us that the picture contains a face, but it cannot tell us whether the face is in the left or right part of the image. The main culprit for this behavior is the pooling layers. Every pooling layer introduces a little translation invariance. For example, the max pooling routes forward the activation of only ...