February 2019
Intermediate to advanced
260 pages
6h 3m
English
We've learned about the sigmoid function so far, but it is used comparatively less in the modern era of deep learning. The reason for this is because the tanh function works much better than the sigmoid function. The tanh function is grahically represented as follows:

If you look at the graph, you can see that this function looks similar to the sigmoid function, but is centered at the zero. The reason it works better is because it's easier to center your data around 0 than around 0.5.
However, they both share a downside: when the weights become bigger or smaller, this slope in the graph becomes smaller, to almost zero, ...
Read now
Unlock full access