Skip to Content
Practical Simulations for Machine Learning
book

Practical Simulations for Machine Learning

by Paris Buttfield-Addison, Mars Buttfield-Addison, Tim Nugent, Jon Manning
June 2022
Beginner to intermediate
331 pages
7h 15m
English
O'Reilly Media, Inc.
Content preview from Practical Simulations for Machine Learning

Chapter 6. Introducing Imitation Learning

In this chapter, we’re going to look at imitation learning (IL). Imitation learning is slightly different from other forms of machine learning because the intent of IL isn’t to achieve a specific goal. Instead, the intent is to copy the behavior of something else. That something else? Probably a human.

To explore IL, we’ll be making another ball-based agent that can roll around, and we’ll be training it to seek and pick up a coin (a classic video game–style pickup). But instead of training it to do what we want by reinforcing the behavior using reward signals, we’ll train it using our own human brains.

This means that, initially, we’ll be moving the agent around ourselves, using the keyboard, just like when we’ve used the heuristic behavior to control agents in previous chapters. The difference is that while we drive the agent around this time, ML-Agents will be watching us, and once we’ve finished, we’ll use IL to let the agent work out how to copy our behavior.

Note

IL not only lets you create more humanlike behaviors, it can also be used to essentially jump-start training. Some tasks have very high initial learning curves, and training to get over these early hurdles can be quite slow. If a human can show the agent how to do a task, the agent can use that as guidance when getting started and then optimize the approach from there. Luckily for us, humans are pretty good at plenty of things, and IL lets you take advantage of this. A disadvantage ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Training Data for Machine Learning

Training Data for Machine Learning

Anthony Sarkis
Practicing Trustworthy Machine Learning

Practicing Trustworthy Machine Learning

Yada Pruksachatkun, Matthew Mcateer, Subho Majumdar

Publisher Resources

ISBN: 9781492089919Errata Page