Let's first discuss what the conv layer computes intuitively. The Conv layer's parameters consist of a set of learnable filters (also called tensors). Each filter is small spatially (depth, width, and height), but extends through the full depth of the input volume (image). A filter on the first layer of a ConvNet typically has a size of 5 x 5 x 3 (that is, five pixels width and height, and three for depth, because images have three depths for color channels). During the forward pass, filters slide (or convolve) across the width and height of the input volume and compute the dot product between the entries of the filter and the input at any point. As the filter slides over the width and height of the input volume, ...
Single Conv Layer Computation
Get Neural Network Programming with TensorFlow now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.