November 2019
Intermediate to advanced
304 pages
8h 40m
English
Sample: MalmoEnv mdp = new MalmoEnv("cliff_walking_rl4j.xml", actionSpace, observationSpace, obsPolicy);
Sample: double rewards = 0; for (int i = 0; i < 10; i++) { double reward = pol.play(mdp, new HistoryProcessor(MALMO_HPROC)); rewards += reward; Logger.getAnonymousLogger().info("Reward: " + reward); }