N. SanghiDeep Reinforcement Learning with Pythonhttps://doi.org/10.1007/979-8-8688-0273-7_5

5. Function Approximation and Deep Learning

Nimish Sanghi¹

(1)

Bangalore, India

The previous three chapters looked at various approaches to planning and control—first at the dynamic programming (DP), then at the Monte Carlo approach (MC), and finally at the temporal difference (TD) approach. In all these approaches, you saw problems where the state space and actions were discrete. Only in the previous chapter, toward the end, did I talk about Q-learning in a continuous state space. You discretized the state values using an arbitrary approach and trained a learning ...

Get Deep Reinforcement Learning with Python: RLHF for Chatbots and Large Language Models now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Deep Reinforcement Learning with Python: RLHF for Chatbots and Large Language Models by Nimish Sanghi

5. Function Approximation and Deep Learning

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly