book

Applied Machine Learning and AI for Engineers

Name: Applied Machine Learning and AI for Engineers
Author: Jeff Prosise
ISBN: 9781492098058

by Jeff Prosise

November 2022

Intermediate to advanced

425 pages

11h 25m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Foreword
Preface
Who Should Read This BookWhy I Wrote This BookRunning the Book’s Code SamplesNavigating This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
I. Machine Learning with Scikit-Learn
1. Machine Learning
What Is Machine Learning?Machine Learning Versus Artificial IntelligenceSupervised Versus Unsupervised LearningUnsupervised Learning with k-Means ClusteringApplying k-Means Clustering to Customer DataSegmenting Customers Using More Than Two DimensionsSupervised Learningk-Nearest NeighborsUsing k-Nearest Neighbors to Classify FlowersSummary
2. Regression Models
Linear RegressionDecision TreesRandom ForestsGradient-Boosting MachinesSupport Vector MachinesAccuracy Measures for Regression ModelsUsing Regression to Predict Taxi FaresSummary
3. Classification Models
Logistic RegressionAccuracy Measures for Classification ModelsCategorical DataBinary ClassificationClassifying Passengers Who Sailed on the TitanicDetecting Credit Card FraudMulticlass ClassificationBuilding a Digit Recognition ModelSummary
4. Text Classification
Preparing Text for ClassificationSentiment AnalysisNaive BayesSpam FilteringRecommender SystemsCosine SimilarityBuilding a Movie Recommendation SystemSummary
5. Support Vector Machines
How Support Vector Machines WorkKernelsKernel TricksHyperparameter TuningData NormalizationPipeliningUsing SVMs for Facial RecognitionSummary
6. Principal Component Analysis
Understanding Principal Component AnalysisFiltering NoiseAnonymizing DataVisualizing High-Dimensional DataAnomaly DetectionUsing PCA to Detect Credit Card FraudUsing PCA to Predict Bearing FailureMultivariate Anomaly DetectionSummary
7. Operationalizing Machine Learning Models
Consuming a Python Model from a Python ClientVersioning Pickle FilesConsuming a Python Model from a C# ClientContainerizing a Machine Learning ModelUsing ONNX to Bridge the Language GapBuilding ML Models in C# with ML.NETSentiment Analysis with ML.NETSaving and Loading ML.NET ModelsAdding Machine Learning Capabilities to ExcelSummary

II. Deep Learning with Keras and TensorFlow
8. Deep Learning
Understanding Neural NetworksTraining Neural NetworksSummary
9. Neural Networks
Building Neural Networks with Keras and TensorFlowSizing a Neural NetworkUsing a Neural Network to Predict Taxi FaresBinary Classification with Neural NetworksMaking PredictionsTraining a Neural Network to Detect Credit Card FraudMulticlass Classification with Neural NetworksTraining a Neural Network to Recognize FacesDropoutSaving and Loading ModelsKeras CallbacksSummary
10. Image Classification with Convolutional Neural Networks
Understanding CNNsUsing Keras and TensorFlow to Build CNNsTraining a CNN to Recognize Arctic WildlifePretrained CNNsUsing ResNet50V2 to Classify ImagesTransfer LearningUsing Transfer Learning to Identify Arctic WildlifeData AugmentationImage Augmentation with ImageDataGeneratorImage Augmentation with Augmentation LayersApplying Image Augmentation to Arctic WildlifeGlobal PoolingAudio Classification with CNNsSummary
11. Face Detection and Recognition
Face DetectionFace Detection with Viola-JonesUsing the OpenCV Implementation of Viola-JonesFace Detection with Convolutional Neural NetworksExtracting Faces from PhotosFacial RecognitionApplying Transfer Learning to Facial RecognitionBoosting Transfer Learning with Task-Specific WeightsArcFacePutting It All Together: Detecting and Recognizing Faces in PhotosHandling Unknown Faces: Closed-Set Versus Open-Set ClassificationSummary
12. Object Detection
R-CNNsMask R-CNNYOLOYOLOv3 and KerasCustom Object DetectionTraining a Custom Object Detection Model with the Custom Vision ServiceUsing the Exported ModelSummary
13. Natural Language Processing
Text PreparationWord EmbeddingsText ClassificationAutomating Text VectorizationUsing TextVectorization in a Sentiment Analysis ModelFactoring Word Order into PredictionsRecurrent Neural Networks (RNNs)Using Pretrained Models to Classify TextNeural Machine TranslationLSTM Encoder-DecodersTransformer Encoder-DecodersBuilding a Transformer-Based NMT ModelUsing Pretrained Models to Translate TextBidirectional Encoder Representations from Transformers (BERT)Building a BERT-Based Question Answering SystemFine-Tuning BERT to Perform Sentiment AnalysisSummary
14. Azure Cognitive Services
Introducing Azure Cognitive ServicesKeys and EndpointsCalling Azure Cognitive Services APIsAzure Cognitive Services ContainersThe Computer Vision ServiceThe Language ServiceThe Translator ServiceThe Speech ServicePutting It All Together: Contoso TravelSummary
Index
About the Author

Content preview from Applied Machine Learning and AI for Engineers

Chapter 6. Principal Component Analysis

Principal component analysis, or PCA, is one of the minor miracles of machine learning. It’s a dimensionality reduction technique that reduces the number of dimensions in a dataset without sacrificing a commensurate amount of information. While that might seem underwhelming on the face of it, it has profound implications for engineers and software developers working to build predictive models from their data.

What if I told you that you could take a dataset with 1,000 columns, use PCA to reduce it to 100 columns, and retain 90% or more of the information in the original dataset? That’s relatively common, believe it or not. And it lends itself to a variety of practical uses, including:

Reducing high-dimensional data to two or three dimensions so that it can be plotted and explored
Reducing the number of dimensions in a dataset and then restoring the original number of dimensions, which finds application in anomaly detection and noise filtering
Anonymizing datasets so that they can be shared with others without revealing the nature or meaning of the data

And that’s not all. A side effect of applying PCA to a dataset is that less important features—columns of data that have less relevance to the outcome of a predictive model—are removed, while dependencies between columns is eliminated. And in datasets with a low ratio of samples (rows) to features (columns), PCA can be used to increase that ratio. As a rule of thumb, you typically ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781492098041Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Applied Machine Learning and AI for Engineers

by Jeff Prosise

Chapter 6. Principal Component Analysis

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.