January 2018
Intermediate to advanced
470 pages
11h 9m
English
The QLearning class encapsulates the Q-learning algorithm, more specifically the action-value updating equation. It is a data transformation of type ETransform (we will see this later on) with an explicit configuration of type QLConfig. This class is a generic parameterized class that implements the QLearning algorithm. The Q-learning model is initialized and trained during the instantiation of the class so it can be in the correct state for runtime prediction.
Therefore, the class instances have only two states: successfully trained and failed training (we'll see this later).
The implementation does not assume that every episode (or training cycle) will be successful. At the completion of training, the ...
Read now
Unlock full access