August 2021
Intermediate to advanced
752 pages
21h 59m
English
This appendix is related to Chapter 2, “Gradient-Based Learning.”
The pervasive method for adjusting the weights in deep learning (DL) is gradient descent. It is an iterative method used to minimize the output value of a function. We believe that many readers are already familiar with a different iterative minimization method known as Newton-Raphson. We have included this appendix for readers who are curious about how the two methods relate to each other.
We often feel bad for poor Raphson, whose name is often left out—the method is more commonly referred to as just Newton’s method.
We describe Newton-Raphson in a single dimension, similarly to how we introduced gradient descent in Chapter ...
Read now
Unlock full access