The output of a capsule is a vector, compared to the output of a neuron, which is a scalar value. The capsule output vector carries the following meaning:

  • The elements of the vector represent the pose and other properties of the object.
  • The length of the vector is in the (0, 1) range and represents the probability of detecting the feature at that location. As a reminder, the length of a vector is , where vi are the vector elements.

Let's consider a capsule that detects faces. If we start moving a face across an image, the values of the capsule vector will change to reflect the change in the position. However, its length will always ...

Get Advanced Deep Learning with Python now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.