Machine Learning 101 with Scikit-learn and StatsModels

Video description

Machine Learning is one of the fundamental skills you need to become a data scientist. It’s the steppingstone that will help you understand deep learning and modern data analysis techniques.

In this course, you’ll explore the three fundamental machine learning topics - linear regression, logistic regression, and cluster analysis. Even neural networks geeks (like us) can’t help but admit that it’s these three simple methods that data science revolves around. So, in this course, we will make the otherwise complex subject matter easy to understand and apply in practice. This course supports statistics theory with practical application of these quantitative methods in Python to help you develop skills in the context of data science.

We’ve developed this course with not one but two machine learning libraries: StatsModels and sklearn. You’ll be eager to complete this course and get ready to become a successful data scientist!

What You Will Learn

  • Confidently work with two of the leading ML packages: statsmodels and sklearn
  • Understand how to perform a linear regression
  • Become familiar with the ins and outs of logistic regression
  • Get to grips with carrying out cluster analysis (both flat and hierarchical)
  • Apply your skills to real-life business cases
  • Get insights into the underlying ideas behind ML models

Audience

If you want to get acquainted with fundamental machine learning methods, become a successful data scientist, or just get started with building valuable skills in machine learning and data science, this course is for you.

About The Author

365 Careers Ltd.: 365 Careers’ courses have been taken by more than 203,000 students in 204 countries. People working at world-class firms such as Apple, PayPal, and Citibank have completed 365 Careers trainings. By choosing 365 Careers, you make sure you will learn from proven experts who have a passion for teaching, and can take you from beginner to pro in the shortest possible amount of time.

If you want to become a financial analyst, a finance manager, an FP&A analyst, an investment banker, a business executive, an entrepreneur, a business intelligence analyst, a data analyst, or a data scientist, 365 Careers’ courses are the perfect place to start.

Table of contents

  1. Chapter 1 : Introduction
    1. What Does the Course Cover?
  2. Chapter 2 : Setting Up the Working Environment
    1. Setting Up the Environment - An Introduction (Do Not Skip, Please)!
    2. Why Python and Why Jupyter?
    3. Installing Anaconda
    4. The Jupyter Dashboard - Part 1
    5. The Jupyter Dashboard - Part 2
    6. Installing sklearn
  3. Chapter 3 : Linear Regression with StatsModels
    1. Introduction to Regression Analysis
    2. The Linear Regression Model
    3. Correlation vs Regression
    4. Geometrical Representation
    5. Python Packages Installation
    6. Simple Linear Regression in Python
    7. What is Seaborn?
    8. What Does the StatsModels Summary Regression Table Tell us?
    9. SST, SSR, and SSE
    10. The Ordinary Least Squares (OLS)
    11. Goodness of Fit: The R-Squared
    12. The Multiple Linear Regression Model
    13. Adjusted R-Squared
    14. F-Statistic and F-Test for a Linear Regression
    15. Assumptions of the OLS Framework
    16. A1: Linearity
    17. A2: No Endogeneity
    18. A3: Normality and Homoscedasticity
    19. A4: No Autocorrelation
    20. A5: No Multicollinearity
    21. Dealing with Categorical Data
    22. Making Predictions
  4. Chapter 4 : Linear Regression with Sklearn
    1. What is sklearn?
    2. Game Plan for sklearn
    3. Simple Linear Regression with sklearn
    4. Simple Linear Regression with sklearn - Summary Table
    5. Multiple Linear Regression with sklearn
    6. Adjusted R-Squared
    7. Feature Selection through p-values (F-regression)
    8. Creating a Summary Table with the p-values
    9. Feature Scaling
    10. Feature Selection through Standardization
    11. Making Predictions with Standardized Coefficients
    12. Underfitting and Overfitting
    13. Training and Testing
  5. Chapter 5 : Linear Regression - Practical Example
    1. Practical Example (Part 1)
    2. Practical Example (Part 2)
    3. Practical Example (Part 3)
    4. Practical Example (Part 4)
    5. Practical Example (Part 5)
  6. Chapter 6 : Logistic Regression
    1. Introduction to Logistic Regression
    2. A Simple Example of a Logistic Regression in Python
    3. What is the Difference Between a Logistic and a Logit Function?
    4. Your First Logistic Regression
    5. A Coding Tip (optional)
    6. Going through the Regression Summary Table
    7. Interpreting the Odds Ratio
    8. Dummies in a Logistic Regression
    9. Assessing the Accuracy of a Classification Model
    10. Underfitting and Overfitting
    11. Testing our Model and Bulding a Confusion Matrix
  7. Chapter 7 : Cluster Analysis
    1. Introduction to Cluster Analysis
    2. Examples of Clustering
    3. Classification vs Clustering
    4. Math Concepts Needed to Proceed
    5. K-Means Clustering
    6. A Hands-on Example of K-Means
    7. Categorical Data in Cluster Analysis
    8. The Elbow Method or How to Choose the Number of Clusters
    9. Pros and Cons of K-Means
    10. Standardization of Features when Clustering
    11. Cluster Analysis and Regression Analysis
    12. Practical Example: Market Segmentation (Part 1)
    13. Practical Example: Market Segmentation (Part 2)
    14. What Can be Done with Cluster Analysis?
  8. Chapter 8 : Cluster Analysis: Additional Topics
    1. Other Types of Clustering
    2. The Dendrogram
    3. Heatmaps

Product information

  • Title: Machine Learning 101 with Scikit-learn and StatsModels
  • Author(s): 365 Careers
  • Release date: July 2019
  • Publisher(s): Packt Publishing
  • ISBN: 9781838987671