All about Rainbow DQN
Throughout this book, we have learned how the various threads in Reinforcement Learning (RL) combined to form modern RL and then advanced to Deep Reinforcement Learning (DRL) with the inclusion of Deep Learning (DL). Like most other specialized fields from this convergence, we now see a divergence back to specialized methods for specific classes of environments. We started to see this in the chapters where we covered Policy Gradient (PG) methods and the environments it specialized on are continuous control. The flip side of this is the more typical episodic game environment, which is episodic with some form of discrete control mechanism. These environments typically perform better with DQN but the problem then becomes ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access