book

Predictive Analytics for the Modern Enterprise

Name: Predictive Analytics for the Modern Enterprise
Author: Nooruddin Abbas Ali
ISBN: 9781098136864

by Nooruddin Abbas Ali

May 2024

Beginner to intermediate

360 pages

9h 2m

English

O'Reilly Media, Inc.

Audio summary available

Read now

Unlock full access

Includes

Quizzes

Preface
Who Is This Book For?How This Book Is OrganizedConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
1. Data Analytics in the Modern Enterprise
The Evolution of Data AnalyticsDifferent Types of Data AnalyticsDescriptive AnalyticsDiagnostic AnalyticsPredictive AnalyticsPrescriptive AnalyticsKnowledge Acquisition, Machine Learning, and the Role of Predictive AnalyticsTools, Frameworks, and Platforms in the Predictive Analytics WorldLanguages and LibrariesServicesConclusion
2. Predictive Analytics: An Operational Necessity
The Move from “Data Producing” to “Data Driven”Challenges to Using Predictive AnalyticsPeopleDataTechnologyVertical Industry Use Cases for Predictive AnalyticsFinanceHealthcareAutomotiveEntertainmentConclusion
3. The Mathematics and Algorithms Behind Predictive Analytics
Statistics and Linear AlgebraRegressionWhat Is Regression Analysis?Regression TechniquesR-squared and P-valueSelecting a Regression ModelDecision TreesTraining Decision TreesUsing Decision Trees to Solve Regression Problems: Regression TreesTuning Decision TreesOther AlgorithmsRandom ForestsNeural NetworksSupport Vector MachinesNaive Bayes ClassifierOther Learning Patterns in Machine LearningConclusion
4. Working with Data
Understanding DataData Preprocessing and Feature EngineeringHandling Missing DataCategorical Data EncodingData TransformationOutlier ManagementHandling Imbalanced DataCombining DataFeature SelectionSplitting Preprocessed DataUnderstanding BiasThe Predictive Analytics PipelineThe Data StageThe Model StageThe Serving StageOther ComponentsSelecting the Right ModelConclusion
5. Python and scikit-learn for Predictive Analytics
Anaconda and Jupyter NotebooksNumPy in PythonIntroduction to NumPyGenerating ArraysArray SlicingArray TransformationOther Array OperationsExploring a Business Example Using PandasPandas in PythonImport and View DataVisualize the DataData Cleaning and ModificationReading from Different Data SourcesData Filtering and GroupingScikit-learnTraining and Predicting with a Linear Regression ModelUsing a Random Forest ClassifierTraining a Decision TreeA Clustering Example (Unsupervised Learning)Conclusion
6. TensorFlow and Keras for Predictive Analytics
TensorFlow FundamentalsLinear Regression Using TensorFlowData PreparationModel Creation and TrainingPredictions and Model EvaluationDeep Neural Networks in TensorFlowConclusion
7. Predictive Analytics for Business Problem-Solving
Prediction-Based Optimal Retail Price RecommendationsUsing a Simple Linear Regression ModelUsing a Polynomial Regression ModelUsing Multivariate RegressionAn Introduction to Recommender SystemsBuilding Recommender Systems Using surprise scikit in PythonCredit Card Fraud ClassificationCredit Card Fraud Baseline Analysis Using Artificial Neural NetworksCredit Card Fraud Weighted Analysis Using Artificial Neural NetworksCredit Card Analysis with Multiple Hidden Layers in the Artificial Neural NetworkConclusion
8. Exploring AWS Cloud Provider Services for AI/ML
To Cloud or Not to CloudExploring AWS SageMakerPrerequisitesData Ingest and ExplorationData TransformationModel Training and PredictionCleanupExploring Amazon ForecastImport DataTrain the PredictorCreate a ForecastWhat-If AnalysisCleanupConclusion
9. Food for Thought
A Few More Use CasesNavigation and Traffic ManagementCredit ScoringThe Social Impact of PredictionsConclusion

Index
About the Author

Content preview from Predictive Analytics for the Modern Enterprise

Chapter 5. Python and scikit-learn for Predictive Analytics

We started our journey with a brief history of data analytics. We discussed the importance of predictive analytics in the modern enterprise, and we covered some industry use cases to appreciate the real-world implications of its implementation. We then took a slightly deep dive into the statistics and mathematics behind different predictive analytics algorithms (if you are a diver, you can think of that as a 10-meter dive rather than a 100-meter deep sea exploration). I am a big proponent of strong foundations. I believe that once you have a strong grasp of the foundation, you can learn and understand the details much more easily, even though they can evolve over time. Now that we have the analytics foundation established, in this chapter we will get our hands dirty with some actual predictions.

Anaconda and Jupyter Notebooks

This is a hands-on chapter. If you are a data science professional or student, the content should be familiar to you. However, even if you are new to data science, the material and sample code should be clear enough for you to understand, so long as you have a basic grasp of computer programming.

We’ll need a few prerequisites in place before we begin. These are shown in Table 5-1.

Table 5-1. Prerequisites
Serial #	Name	Description	Version used in this chapter	URL
1	Python	Python is a high-level programming language used heavily in data science.	V3.9.13	https://www.python.org
2	Anaconda ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781098136857Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Predictive Analytics for the Modern Enterprise

by Nooruddin Abbas Ali

Chapter 5. Python and scikit-learn for Predictive Analytics

Anaconda and Jupyter Notebooks

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.