8 Training neural networks: Forward propagation and backpropagation

This chapter covers

  • Sigmoid functions as differential surrogates for Heaviside step functions
  • Layering in neural networks: expressing linear layers as matrix-vector multiplication
  • Regression loss, forward and backward propagation, and their math

So far, we have seen that neural networks make complicated real-life decisions by modeling the decision-making process with mathematical functions. These functions can become arbitrarily involved, but fortunately, we have a simple building block called a perceptron that can be repeated systematically to model any arbitrary function. We need not even explicitly know the function being modeled in closed form. All we need is a reasonably ...

Get Math and Architectures of Deep Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.