Summary
Our task for this chapter was to use machine learning to teach the robot how to use its robot arm. We used two techniques with some variations. We used a variety of reinforcement learning, called Q-learning, to develop a movement path by selecting individual actions based on the robot's arm state. Each motion was scored individually as a reward, and as part of the overall path as a value. The process stored the results of learning into a Q-matrix that could be used to generate a path. We improved our first cut at the reinforcement learning program by indexing, or encoding, the motions from a 27-element array of possible combinations of motors to a number from 0 to 26, and likewise indexing the robot state to a state lookup table. ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access