January 2020
Intermediate to advanced
432 pages
10h 18m
English
TDL for first step or TD(0) then essentially simplifies to Q-learning. To do a full comparison of this method against DP and MC, we will first revisit the FrozenLake environment from Gym. Open up example code Chapter_4_4.py and follow the exercise:
from os import system, namefrom time import sleepimport numpy as npimport gymimport randomfrom tqdm import tqdm
env = gym.make("FrozenLake-v0")env.render()action_size = env.action_space.n ...Read now
Unlock full access