February 2019
Beginner to intermediate
308 pages
7h 42m
English
The Forget gate (f) of an LSTM is highlighted in the following diagram:

The Forget gate (f) forms the first part of the LSTM repeating unit, and its role is to decide how much data we should forget or remember from the previous cell state. It does so by first concatenating the Previous Hidden State (ht−1) and the current Input (xt), then passing the concatenated vector through a sigmoid function. Recall that the sigmoid function outputs a vector with values between 0 and 1. A value of 0 means to stop the information from passing through (forget), and a value of 1 means to pass the information through (remember).
The output of the ...