Tuning hyperparameters

We have now defined our environment and iterated over all possible actions and results from any given state to calculate the quality value of every move and stored these values in our Q object. At this point, we can now begin to tune the options for this model to see how it impacts performance.

If we recall, there are three parameters for reinforcement learning, and these are alpha, gamma, and epsilon. The following list describes the role of each parameter and the impact of adjusting their value:

  • Alpha: The alpha rate for reinforcement learning is the same as the learning rate for many other machine learning models. It is the constant value used to control how quickly probabilities are updated as calculations are ...

Get Hands-On Deep Learning with R now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.