April 2017
Intermediate to advanced
406 pages
10h 15m
English
Now that we have learned the world's simplest game, let's try learning something a bit more dynamic. The cart pole task is a classic reinforcement learning problem. The agent must control a cart, on which is balanced a pole, attached to the cart via a joint. At every step, the agent can choose to move the cart left or right, and it receives a reward of 1 every time step that the pole is balanced. If the pole ever deviates by more than 15 degrees from upright, then the game ends:

Figure 5: The cart pole task
To run the cart pole task, we will use OpenAIGym, an open source project set up in 2015, which gives a way to run reinforcement ...