book

Advanced Machine Learning with Python

Name: Advanced Machine Learning with Python
Author: John Hearty
ISBN: 9781784398637

by John Hearty

July 2016

Intermediate to advanced

278 pages

6h 48m

English

Packt Publishing

Read now

Unlock full access

Advanced Machine Learning with Python
Table of Contents
Advanced Machine Learning with Python
Credits
About the Author
About the Reviewers
www.PacktPub.com
eBooks, discount offers, and moreWhy subscribe?
Preface
What is advanced machine learning?
What should you expect from this book?
What this book covers

What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example codeDownloading the color images of this bookErrataPiracyQuestions
1. Unsupervised Machine Learning
Principal component analysisPCA – a primerEmploying PCA
Introducing k-means clustering
Clustering – a primerKick-starting clustering analysisTuning your clustering configurations
Self-organizing maps
SOM – a primerEmploying SOM
Further reading
Summary
2. Deep Belief Networks
Neural networks – a primerThe composition of a neural networkNetwork topologies
Restricted Boltzmann Machine
Introducing the RBMTopologyTrainingApplications of the RBMFurther applications of the RBM
Deep belief networks
Training a DBNApplying the DBNValidating the DBN
Further reading
Summary
3. Stacked Denoising Autoencoders
AutoencodersIntroducing the autoencoderTopologyTrainingDenoising autoencodersApplying a dA
Stacked Denoising Autoencoders
Applying the SdAAssessing SdA performance
Further reading
Summary
4. Convolutional Neural Networks
Introducing the CNNUnderstanding the convnet topologyUnderstanding convolution layersUnderstanding pooling layersTraining a convnetPutting it all togetherApplying a CNN
Further Reading
Summary
5. Semi-Supervised Learning
Introduction
Understanding semi-supervised learning
Semi-supervised algorithms in action
Self-trainingImplementing self-trainingFinessing your self-training implementationImproving the selection processContrastive Pessimistic Likelihood Estimation
Further reading
Summary
6. Text Feature Engineering
Introduction
Text feature engineering
Cleaning text dataText cleaning with BeautifulSoupManaging punctuation and tokenizingTagging and categorising wordsTagging with NLTKSequential taggingBackoff taggingCreating features from text dataStemmingBagging and random forestsTesting our prepared data
Further reading
Summary
7. Feature Engineering Part II
Introduction
Creating a feature set
Engineering features for ML applicationsUsing rescaling techniques to improve the learnability of featuresCreating effective derived variablesReinterpreting non-numeric featuresUsing feature selection techniquesPerforming feature selectionCorrelationLASSORecursive Feature EliminationGenetic models
Feature engineering in practice
Acquiring data via RESTful APIsTesting the performance of our modelTwitterTranslink TwitterConsumer commentsThe Bing Traffic APIDeriving and selecting variables using feature engineering techniquesThe weather API
Further reading
Summary
8. Ensemble Methods
Introducing ensemblesUnderstanding averaging ensemblesUsing bagging algorithmsUsing random forestsApplying boosting methodsUsing XGBoostUsing stacking ensemblesApplying ensembles in practice
Using models in dynamic applications
Understanding model robustnessIdentifying modeling risk factorsStrategies to managing model robustness
Further reading
Summary
9. Additional Python Machine Learning Tools
Alternative development toolsIntroduction to LasagneGetting to know LasagneIntroduction to TensorFlowGetting to know TensorFlowUsing TensorFlow to iteratively improve our modelsKnowing when to use these libraries
Further reading
Summary
A. Chapter Code Requirements
Index

Content preview from Advanced Machine Learning with Python

Summary

In this chapter, we've reviewed three techniques with a broad range of applications for preprocessing and dimensionality reduction. In doing so, you learned a lot about an unfamiliar dataset.

We started out by applying PCA, a widely-utilized dimensionality reduction technique, to help us understand and visualize a high-dimensional dataset. We then followed up by clustering the data using k-means clustering, identifying means of improving and measuring our k-means analysis through performance metrics, the elbow method, and cross-validation. We found that k-means on the digits dataset, taken as is, didn't deliver exceptional results. This was due to class overlap that we spotted through PCA. We overcame this weakness by applying PCA as a ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Machine Learning for Time-Series with Python

Publisher Resources

ISBN: 9781784398637

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Advanced Machine Learning with Python

by John Hearty

Summary

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.