Deep Q-learning solution
As we said in the Deep Q-learning section, Deep Q-learning identifies a reinforcement learning method of the approximation of a function. It therefore represents an evolution of the basic Q-learning method since the state-action table is replaced by a neural network, with the aim of approximating the optimal value function.
Compared to the previous approaches, where it was used to structure the network in order to request both input and action, and provided its expected return, Deep Q-learning revolutionizes the structure to request only the state of the environment and supply as many status-action values as there are actions that can be performed in the environment.
In the following code, a Deep Q-learning solution ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access