Use these questions and exercises to reinforce the material you just learned. The exercises may be fun to attempt, so be sure to try atleast two to four questions/exercises:
Questions:
- What are the names of the main components of an RL system? Hint, the first one is Environment.
- Name the four elements of an RL system. Remember that one element is optional.
- Name the three main threads that compose modern RL.
- What makes a Markov state a Markov property?
- What is a policy?
Exercises:
- Using Chapter_1_2.py, alter the code so the agent pulls from a bandit with 1,000 arms. What code changes do you need to make?
- Using Chapter_1_3.py, alter the code so that the agent pulls from the average value, not greedy/max. How did this affect the ...