Machine Vision, GANs, and Deep Reinforcement Learning

Video description

6 Hours of Video Instruction

An intuitive introduction to the latest superhuman capabilities facilitated by Deep Learning.


Machine Vision, GANs, Deep Reinforcement Learning LiveLessons is an introduction to three of the most exciting topics in Deep Learning today. Modern machine vision involves automated systems outperforming humans on image recognition, object detection, and image segmentation tasks. Generative Adversarial Networks cast two Deep Learning networks against each other in a “forger-detective” relationship, enabling the fabrication of stunning, photorealistic images with flexible, user-specifiable elements. Deep Reinforcement Learning has produced equally surprising advances, including the bulk of the most widely-publicized “artificial intelligence” breakthroughs. Deep RL involves training an “agent” to become adept in given “environments,” enabling algorithms to meet or surpass human-level performance on a diverse range of complex challenges, including Atari video games, the board game Go, and subtle hand-manipulation tasks. Throughout these lessons, essential theory is brought to life with intuitive explanations and interactive, hands-on Jupyter notebook demos. Examples feature Python and straightforward Keras layers in TensorFlow 2, the most popular Deep Learning library.

About the Instructor

Jon Krohn is Chief Data Scientist at the machine learning company untapt. He presents a popular series of deep learning tutorials published by Addison-Wesley and is the author of the bestselling book Deep Learning Illustrated. Jon teaches his deep learning curriculum in-classroom at the New York City Data Science Academy, as well as guest lecturing at Columbia University and New York University. He holds a doctorate in neuroscience from Oxford University and has been publishing on machine learning in leading journals since 2010.

Skill Level

  • Intermediate

Learn How To

  • Understand the high-level theory and key language around machine vision, deep reinforcement learning, and generative adversarial networks
  • Create state-of-the art models for image recognition, object detection, and image segmentation
  • Architect GANs that create convincing images in the style of human-drawn illustrations
  • Build deep RL agents that become adept at performing in a wide variety of environments, such as those provided by OpenAI Gym
  • Run automated experiments for optimizing deep reinforcement learning agent hyperparameters, such as its artificial-neural-network configuration
  • Appreciate what the current limitations of “artificial intelligence” are and how they may be overcome in the near future

Who Should Take This Course

  • Perfectly suited to software engineers, data scientists, analysts, and statisticians with an interest in applying Deep Learning to natural language data
  • Code examples are provided in Python, so familiarity with it or another object-oriented programming language would be helpful

Course Requirements

  • The author’s Deep Learning with TensorFlow, Keras, and PyTorch LiveLessons, or familiarity with the topics covered in Chapters 5 through 9 of his book Deep Learning Illustrated, are a prerequisite.

Lesson Descriptions

Lesson 1: Orientation
Lesson 1 starts with a quick review of how to run the code in the lessons. It then reviews the foundational deep learning theory that is essential for building machine vision, GANs, and deep reinforcement learning specializations. The lesson wraps up with a sneak peek at the cutting-edge capabilities that will be developed over the course of all five LiveLessons.

Lesson 2: Convolutional Neural Networks for Machine Vision
Lesson 2 introduces convolutional layers. Convolutional layers are first used to create ConvNets in TensorFlow. The lesson then covers the gamut of machine learning applications, including residual networks, image segmentation, object detection, transfer learning, and capsule networks.

Lesson 3: Generative Adversarial Networks for Creativity
Lesson 3 begins with the applications of and essential theory behind generative adversarial networks (GANs). You then are shown the Quick Draw! Game, which is used as a source of hundreds of thousands of hand-drawn images from a single class for a GAN to learn how to imitate. The rest of the lesson is spent developing the intricate code for the three primary components of a GAN: the discriminator network, the generator network, and the adversarial network that pits them against each other.

Lesson 4: Deep Reinforcement Learning
Lesson 4 begins with a definition of deep reinforcement learning, then turns to an overview of its applications. The lesson wraps up with a discussion of the relationship of deep reinforcement learning to artificial intelligence.

Lesson 5: Deep Q-Learning and Beyond
Lesson 5 starts with the Cart-Pole game, which is used in the lesson to train the deep reinforcement learning algorithms. Then it turns to the theory behind deep reinforcement learning as well as deep Q-learning, a popular type of deep reinforcement learning agent.

With this theory under your belt you will be able to understand at an intuitive level the code that you subsequently when you define your own DQN agent and have it excel at the Cart-Pole game. The lesson wraps up by exploring a tool for optimizing deep reinforcement learning agent hyperparameters, introducing agents beyond DQN agents, outlining some possible project ideas, and return to the discussion of artificial intelligence to address the limitations of contemporary deep learning approaches.

About Pearson Video Training

Pearson publishes expert-led video tutorials covering a wide selection of technology topics designed to teach you the skills you need to succeed. These professional and personal technology videos feature world-leading author instructors published by your trusted technology brands: Addison-Wesley, Cisco Press, Pearson IT Certification, Prentice Hall, Sams, and Que Topics include: IT Certification, Network Security, Cisco Technology, Programming, Web Development, Mobile Development, and more. Learn more about Pearson Video training at

Table of contents

  1. Introduction
    1. Machine Vision, GANs, Deep Reinforcement Learning: Introduction 00:02:48
  2. Lesson 1: Orientation
    1. Topics 00:00:30
    2. 1.1 Running the Hands-On Code Examples in Jupyter Notebooks 00:13:19
    3. 1.2 Review of Prerequisite Deep Learning Theory 00:14:52
    4. 1.3 A Sneak Peak 00:06:23
  3. Lesson 2: Convolutional Neural Networks for Machine Vision
    1. Topics 00:00:28
    2. 2.1 Convolutional Layers 00:27:48
    3. 2.2 Convolutional Filter Hyperparameters 00:07:05
    4. 2.3 Activation Pooling and Flattening 00:05:32
    5. 2.4 Building A ConvNet in TensorFlow 00:19:33
    6. 2.5 ConvNet Model Architectures 00:10:21
    7. 2.6 Residual Networks 00:09:51
    8. 2.7 Image Segmentation 00:16:07
    9. 2.8 Object Detection 00:07:34
    10. 2.9 Transfer Learning 00:16:16
    11. 2.10 Capsule Networks 00:01:20
  4. Lesson 3: Generative Adversarial Networks for Creativity
    1. Topics 00:00:44
    2. 3.1 A Boozy All-Nighter 00:04:23
    3. 3.2 Latent Space: Arithmetic on Fake Human Faces 00:05:35
    4. 3.3 Style Transfer: Converting Photos into Monet (and Vice Versa) 00:03:32
    5. 3.4 Applications of GANs 00:02:42
    6. 3.5 Essential GAN Theory 00:10:04
    7. 3.6 The "Quick, Draw!" Dataset 00:12:54
    8. 3.7 The Discriminator Network 00:11:29
    9. 3.8 The Generator Network 00:10:40
    10. 3.9 Training the Adversarial Network 00:19:35
  5. Lesson 4: Deep Reinforcement Learning
    1. Topics 00:00:23
    2. 4.1 Three Categories of Machine Learning Problems 00:07:18
    3. 4.2 When Reinforcement Learning Becomes Deep 00:02:47
    4. 4.3 Applications to Video Games 00:04:04
    5. 4.4 Applications to Board Games 00:13:16
    6. 4.5 Real-World Applications 00:02:32
    7. 4.6 Reinforcement Learning Environments 00:03:52
    8. 4.7 Three Categories of Artificial Intelligence 00:03:58
  6. Lesson 5: Deep Q-Learning and Beyond
    1. Topics 00:01:03
    2. 5.1 The Cart-Pole Game 00:03:06
    3. 5.2 Essential Reinforcement Learning Theory 00:12:14
    4. 5.3 Deep Q-Learning Networks 00:10:17
    5. 5.4 Defining a DQN Agent 00:24:12
    6. 5.5 Interacting with an Environment 00:07:54
    7. 5.6 Hyperparameter Optimization with SLM Lab 00:05:47
    8. 5.7 Agents Beyond DQN 00:05:49
    9. 5.8 Datasets, Project Ideas, and Resources for Self-Study 00:08:46
    10. 5.9 Approaching Artificial General Intelligence 00:05:58
  7. Summary
    1. Machine Vision, GANs, Deep Reinforcement Learning: Summary 00:01:10

Product information

  • Title: Machine Vision, GANs, and Deep Reinforcement Learning
  • Author(s): Jon Krohn
  • Release date: April 2020
  • Publisher(s): Addison-Wesley Professional
  • ISBN: 0136620183