Skip to Content
Machine Learning with Regression in Python: With Ordinary Least Squares, Ridge, Decision Trees and Neural Networks
on-demand course

Machine Learning with Regression in Python: With Ordinary Least Squares, Ridge, Decision Trees and Neural Networks

with Michael Keith
September 2020
Intermediate
44m
English
Apress
Closed Captioning available in English

Overview

In this video, you will learn regression techniques in Python using ordinary least squares, ridge, lasso, decision trees, and neural networks.

We start by exploring a census dataset that captures sales from a business in various counties across the United States. We briefly explore the dataset before moving onto model assumptions and feature engineering. We then implement a linear regression, which is a simple model that is easy to interpret, then move through more complex models to see what best makes predictions on our dataset. To avoid overfitting, we split our dataset and to optimize predictions, we tune hyperparameters with k-folds cross validation. We move through models that are more complex until we arrive at a neural network model. We then use the model with the lowest error metrics on the test dataset and make predictions on a new dataset. Using these predictions, we make a recommendation to the company's shareholders who want to expand the business about which counties to expand to next. 

This modeling process will be done in Python 3 on a Jupyter notebook, so it's a good idea to have Anaconda installed on your computer so you can follow along. We will structure our notebook to be easy-to-read by others on our team who may want to expand on our analysis. 

What You Will Learn
  • Explore a dataset with Pandas 
  • Transform variables in a dataset to account for non-linearities and optimize predictions 
  • Tune model hyperparameters and score model performance to determine the best model for a given dataset 
  • Use statistical modeling to make recommendations to shareholders

Who This Video Is For
Software professionals with knowledge of Python basics and data scientists looking to apply data science to industry. 
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Programming Foundations of Classification and Regression LiveLessons (Machine Learning with Python for Everyone Series), Part 1

Programming Foundations of Classification and Regression LiveLessons (Machine Learning with Python for Everyone Series), Part 1

Mark Fenner

Publisher Resources

ISBN: 9781484265833