book

Python Machine Learning Cookbook

Name: Python Machine Learning Cookbook
ISBN: 9781786464477

by Prateek Joshi, Vahid Mirjalili

June 2016

Beginner to intermediate

304 pages

6h 24m

English

Packt Publishing

Read now

Unlock full access

Python Machine Learning Cookbook
Table of Contents
Python Machine Learning Cookbook
Credits
About the Author
About the Reviewer
www.PacktPub.com
eBooks, discount offers, and moreWhy Subscribe?
Preface
What this book covers
What you need for this book
Who this book is for

Sections
Getting readyHow to do it…How it works…There's more…See also
Conventions
Reader feedback
Customer support
Downloading the example codeDownloading the color images of this bookErrataPiracyQuestions
1. The Realm of Supervised Learning
Introduction
Preprocessing data using different techniques
Getting readyHow to do it…Mean removalScalingNormalizationBinarizationOne Hot Encoding
Label encoding
How to do it…
Building a linear regressor
Getting readyHow to do it…
Computing regression accuracy
Getting readyHow to do it…
Achieving model persistence
How to do it…
Building a ridge regressor
Getting readyHow to do it…
Building a polynomial regressor
Getting readyHow to do it…
Estimating housing prices
Getting readyHow to do it…
Computing the relative importance of features
How to do it…
Estimating bicycle demand distribution
Getting readyHow to do it…There's more…
2. Constructing a Classifier
Introduction
Building a simple classifier
How to do it…There's more…
Building a logistic regression classifier
How to do it…
Building a Naive Bayes classifier
How to do it…
Splitting the dataset for training and testing
How to do it…
Evaluating the accuracy using cross-validation
Getting ready…How to do it…
Visualizing the confusion matrix
How to do it…
Extracting the performance report
How to do it…
Evaluating cars based on their characteristics
Getting readyHow to do it…
Extracting validation curves
How to do it…
Extracting learning curves
How to do it…
Estimating the income bracket
How to do it…
3. Predictive Modeling
Introduction
Building a linear classifier using Support Vector Machine (SVMs)
Getting readyHow to do it…
Building a nonlinear classifier using SVMs
How to do it…
Tackling class imbalance
How to do it…
Extracting confidence measurements
How to do it…
Finding optimal hyperparameters
How to do it…
Building an event predictor
Getting readyHow to do it…
Estimating traffic
Getting readyHow to do it…
4. Clustering with Unsupervised Learning
Introduction
Clustering data using the k-means algorithm
How to do it…
Compressing an image using vector quantization
How to do it…
Building a Mean Shift clustering model
How to do it…
Grouping data using agglomerative clustering
How to do it…
Evaluating the performance of clustering algorithms
How to do it…
Automatically estimating the number of clusters using DBSCAN algorithm
How to do it…
Finding patterns in stock market data
How to do it…
Building a customer segmentation model
How to do it…
5. Building Recommendation Engines
Introduction
Building function compositions for data processing
How to do it…
Building machine learning pipelines
How to do it…How it works…
Finding the nearest neighbors
How to do it…
Constructing a k-nearest neighbors classifier
How to do it…How it works…
Constructing a k-nearest neighbors regressor
How to do it…How it works…
Computing the Euclidean distance score
How to do it…
Computing the Pearson correlation score
How to do it…
Finding similar users in the dataset
How to do it…
Generating movie recommendations
How to do it…
6. Analyzing Text Data
Introduction
Preprocessing data using tokenization
How to do it…
Stemming text data
How to do it…How it works…
Converting text to its base form using lemmatization
How to do it…
Dividing text using chunking
How to do it…
Building a bag-of-words model
How to do it…How it works…
Building a text classifier
How to do it…How it works…
Identifying the gender
How to do it…
Analyzing the sentiment of a sentence
How to do it…How it works…
Identifying patterns in text using topic modeling
How to do it…How it works…
7. Speech Recognition
Introduction
Reading and plotting audio data
How to do it…
Transforming audio signals into the frequency domain
How to do it…
Generating audio signals with custom parameters
How to do it…
Synthesizing music
How to do it…
Extracting frequency domain features
How to do it…
Building Hidden Markov Models
How to do it…
Building a speech recognizer
How to do it…
8. Dissecting Time Series and Sequential Data
Introduction
Transforming data into the time series format
How to do it…
Slicing time series data
How to do it…
Operating on time series data
How to do it…
Extracting statistics from time series data
How to do it…
Building Hidden Markov Models for sequential data
Getting readyHow to do it…
Building Conditional Random Fields for sequential text data
Getting readyHow to do it…
Analyzing stock market data using Hidden Markov Models
How to do it…
9. Image Content Analysis
Introduction
Operating on images using OpenCV-Python
How to do it…
Detecting edges
How to do it…
Histogram equalization
How to do it…
Detecting corners
How to do it…
Detecting SIFT feature points
How to do it…
Building a Star feature detector
How to do it…
Creating features using visual codebook and vector quantization
How to do it…
Training an image classifier using Extremely Random Forests
How to do it…
Building an object recognizer
How to do it…
10. Biometric Face Recognition
Introduction
Capturing and processing video from a webcam
How to do it…
Building a face detector using Haar cascades
How to do it…
Building eye and nose detectors
How to do it…
Performing Principal Components Analysis
How to do it…
Performing Kernel Principal Components Analysis
How to do it…
Performing blind source separation
How to do it…
Building a face recognizer using Local Binary Patterns Histogram
How to do it…
11. Deep Neural Networks
Introduction
Building a perceptron
How to do it…
Building a single layer neural network
How to do it…
Building a deep neural network
How to do it…
Creating a vector quantizer
How to do it…
Building a recurrent neural network for sequential data analysis
How to do it…
Visualizing the characters in an optical character recognition database
How to do it…
Building an optical character recognizer using neural networks
How to do it…
12. Visualizing Data
Introduction
Plotting 3D scatter plots
How to do it…
Plotting bubble plots
How to do it…
Animating bubble plots
How to do it…
Drawing pie charts
How to do it…
Plotting date-formatted time series data
How to do it…
Plotting histograms
How to do it…
Visualizing heat maps
How to do it…
Animating dynamic signals
How to do it…
Index

Content preview from Python Machine Learning Cookbook

Estimating bicycle demand distribution

Let's use a different regression method to solve the bicycle demand distribution problem. We will use the random forest regressor to estimate the output values. A random forest is a collection of decision trees. This basically uses a set of decision trees that are built using various subsets of the dataset, and then it uses averaging to improve the overall performance.

Getting ready

We will use the bike_day.csv file that is provided to you. This is also available at https://archive.ics.uci.edu/ml/datasets/Bike+Sharing+Dataset. There are 16 columns in this dataset. The first two columns correspond to the serial number and the actual date, so we won't use them for our analysis. The last three columns correspond ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Python Machine Learning Cookbook - Second Edition

Publisher Resources

ISBN: 9781786464477

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Python Machine Learning Cookbook

by Prateek Joshi, Vahid Mirjalili

Estimating bicycle demand distribution

Getting ready

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.