book

Practical AI on the Google Cloud Platform

Name: Practical AI on the Google Cloud Platform
Author: Micheal Lanham
ISBN: 9781492075813

by Micheal Lanham

October 2020

Beginner to intermediate

391 pages

10h 22m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Preface
Who Should Read This BookWhy I Wrote This BookNavigating This BookA Note on the Google AI PlatformThings You Need for This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
1. Data Science and Deep Learning
What Is Data Science?Classification and RegressionRegressionGoodness of FitClassification with Logistic RegressionMultivariant Regression and ClassificationData Discovery and PreparationBad DataTraining, Test, and Validation DataGood DataPreparing DataQuestioning Your DataThe Basics of Deep LearningThe Perceptron GameUnderstanding How Networks LearnBackpropagationOptimization and Gradient DescentVanishing or Exploding GradientsSGD and Batching SamplesBatch Normalization and RegularizationActivation FunctionsLoss FunctionsBuilding a Deep LearnerOptimizing a Deep Learning NetworkOverfitting and UnderfittingNetwork CapacityConclusionGame Answers
2. AI on the Google Cloud Platform
AI Services on GCPThe AI HubAI PlatformAI Building BlocksGoogle Colab NotebooksBuilding a Regression Model with ColabAutoML TablesThe Cloud ShellManaging Cloud DataConclusion
3. Image Analysis and Recognition on the Cloud
Deep Learning with ImagesEnter Convolution Neural NetworksImage ClassificationSet Up and Load DataInspecting Image DataChannels and CNNBuilding the ModelTraining the AI Fashionista to Discern FashionsImproving Fashionista AI 2.0Transfer Learning ImagesIdentifying Cats or DogsTransfer Learning a Keras Application ModelTraining Transfer LearningRetraining a Better Base ModelObject Detection and the Object Detection Hub APIYOLO for Object DetectionGenerating Images with GANsConclusion
4. Understanding Language on the Cloud
Natural Language Processing, with EmbeddingsUnderstanding One-Hot EncodingVocabulary and Bag-of-WordsWord EmbeddingsUnderstanding and Visualizing EmbeddingsRecurrent Networks for NLPRecurrent Networks for MemoryClassifying Movie ReviewsRNN VariationsNeural Translation and the Translation APISequence-to-Sequence LearningTranslation APIAutoML TranslationNatural Language APIBERT: Bidirectional Encoder Representations from TransformersSemantic Analysis with BERTDocument Matching with BERTBERT for General Text AnalysisConclusion
5. Chatbots and Conversational AI
Building Chatbots with PythonDeveloping Goal-Oriented Chatbots with DialogflowBuilding Text TransformersLoading and Preparing DataUnderstanding AttentionMasking and the TransformerEncoding and Decoding the SequenceTraining Conversational ChatbotsCompiling and Training the ModelEvaluation and PredictionUsing Transformer for Conversational ChatbotsConclusion
6. Video Analysis on the Cloud
Downloading Video with PythonVideo AI and Video IndexingBuilding a Webcam Face DetectorUnderstanding Face EmbeddingsRecognizing Actions with TF HubExploring the Video Intelligence APIConclusion
7. Generators in the Cloud
Unsupervised Learning with AutoencodersMapping the Latent Space with VAEGenerative Adversarial NetworkExploring the World of GeneratorsA Path for Exploring GANsTranslating Images with Pix2Pix and CycleGANAttention and the Self-Attention GANUnderstanding Self-AttentionSelf-Attention for Image Colorization—DeOldifyConclusion
8. Building AI Assistants in the Cloud
Needing Smarter AgentsIntroducing Reinforcement LearningMultiarm Bandits and a Single StateAdding Quality and Q LearningExploration Versus ExploitationUnderstanding Temporal Difference LearningBuilding an Example Agent with Expected SARSAUsing SARSA to Drive a TaxiLearning State Hierarchies with Hierarchical Reinforcement LearningBringing Deep to Reinforcement LearningDeep Q LearningOptimizing Policy with Policy Gradient MethodsConclusion
9. Putting AI Assistants to Work
Designing an Eat/No Eat AISelecting and Preparing Data for the AITraining the Nutritionist ModelOptimizing Deep Reinforcement LearningBuilding the Eat/No Eat AgentTesting the AI AgentCommercializing the AI AgentIdentifying App/AI IssuesInvolving Users and Progressing DevelopmentFuture ConsiderationsConclusion

10. Commercializing AI
The Ethics of Commercializing AIPackaging Up the Eat/No Eat AppReviewing Options for DeploymentDeploying to GitHubDeploying with Google Cloud DeployExploring the Future of Practical AIConclusion
Index

Content preview from Practical AI on the Google Cloud Platform

Chapter 6. Video Analysis on the Cloud

With the improved developments in image analysis, it was only a matter of time before we would be using the same techniques to analyze video. If we ignore the audio, video is for the most part just a stack of pictures—pictures that are in a given sequence that describes some order of events or context. As we learned from NLP, context can matter, and in video it certainly matters.

In this chapter we look at the process of analyzing video for a variety of applications, ranging from video indexing for capturing or tagging content in videos, to using the event sequence itself to capture the activity. Video indexing, while just an extension of image analysis, has wide-ranging use cases, from security to streaming. Indexing video is like asking the question “Who are they?” while identifying motion or action in video can answer the question “What are they doing?”

In this chapter we will first look at how to load and analyze video with Python on Colab. Then, we look at applications of AI with respect to video, in particular the task of automatic video indexing. From video indexing, we will move on to using a webcam to detect faces. Finally, we finish on a practical example in which we use a TF Hub human-motion detector to identify human activity in videos.

The following is a list of the main topics we will cover in this chapter:

Downloading Video with Python
Video AI and Video Indexing
Building a Webcam Face Detector
Recognizing Actions with ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781492075806Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Practical AI on the Google Cloud Platform

by Micheal Lanham

Chapter 6. Video Analysis on the Cloud

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.