Video description
Machine Learning is one of the fundamental skills you need to become a data scientist. It’s the steppingstone that will help you understand deep learning and modern data analysis techniques.
In this course, you’ll explore the three fundamental machine learning topics - linear regression, logistic regression, and cluster analysis. Even neural networks geeks (like us) can’t help but admit that it’s these three simple methods that data science revolves around. So, in this course, we will make the otherwise complex subject matter easy to understand and apply in practice. This course supports statistics theory with practical application of these quantitative methods in Python to help you develop skills in the context of data science.
We’ve developed this course with not one but two machine learning libraries: StatsModels and sklearn. You’ll be eager to complete this course and get ready to become a successful data scientist!
What You Will Learn
- Confidently work with two of the leading ML packages: statsmodels and sklearn
- Understand how to perform a linear regression
- Become familiar with the ins and outs of logistic regression
- Get to grips with carrying out cluster analysis (both flat and hierarchical)
- Apply your skills to real-life business cases
- Get insights into the underlying ideas behind ML models
Audience
If you want to get acquainted with fundamental machine learning methods, become a successful data scientist, or just get started with building valuable skills in machine learning and data science, this course is for you.
About The Author
365 Careers Ltd.: 365 Careers’ courses have been taken by more than 203,000 students in 204 countries. People working at world-class firms such as Apple, PayPal, and Citibank have completed 365 Careers trainings. By choosing 365 Careers, you make sure you will learn from proven experts who have a passion for teaching, and can take you from beginner to pro in the shortest possible amount of time.
If you want to become a financial analyst, a finance manager, an FP&A analyst, an investment banker, a business executive, an entrepreneur, a business intelligence analyst, a data analyst, or a data scientist, 365 Careers’ courses are the perfect place to start.
Table of contents
- Chapter 1 : Introduction
- Chapter 2 : Setting Up the Working Environment
-
Chapter 3 : Linear Regression with StatsModels
- Introduction to Regression Analysis
- The Linear Regression Model
- Correlation vs Regression
- Geometrical Representation
- Python Packages Installation
- Simple Linear Regression in Python
- What is Seaborn?
- What Does the StatsModels Summary Regression Table Tell us?
- SST, SSR, and SSE
- The Ordinary Least Squares (OLS)
- Goodness of Fit: The R-Squared
- The Multiple Linear Regression Model
- Adjusted R-Squared
- F-Statistic and F-Test for a Linear Regression
- Assumptions of the OLS Framework
- A1: Linearity
- A2: No Endogeneity
- A3: Normality and Homoscedasticity
- A4: No Autocorrelation
- A5: No Multicollinearity
- Dealing with Categorical Data
- Making Predictions
-
Chapter 4 : Linear Regression with Sklearn
- What is sklearn?
- Game Plan for sklearn
- Simple Linear Regression with sklearn
- Simple Linear Regression with sklearn - Summary Table
- Multiple Linear Regression with sklearn
- Adjusted R-Squared
- Feature Selection through p-values (F-regression)
- Creating a Summary Table with the p-values
- Feature Scaling
- Feature Selection through Standardization
- Making Predictions with Standardized Coefficients
- Underfitting and Overfitting
- Training and Testing
- Chapter 5 : Linear Regression - Practical Example
-
Chapter 6 : Logistic Regression
- Introduction to Logistic Regression
- A Simple Example of a Logistic Regression in Python
- What is the Difference Between a Logistic and a Logit Function?
- Your First Logistic Regression
- A Coding Tip (optional)
- Going through the Regression Summary Table
- Interpreting the Odds Ratio
- Dummies in a Logistic Regression
- Assessing the Accuracy of a Classification Model
- Underfitting and Overfitting
- Testing our Model and Bulding a Confusion Matrix
-
Chapter 7 : Cluster Analysis
- Introduction to Cluster Analysis
- Examples of Clustering
- Classification vs Clustering
- Math Concepts Needed to Proceed
- K-Means Clustering
- A Hands-on Example of K-Means
- Categorical Data in Cluster Analysis
- The Elbow Method or How to Choose the Number of Clusters
- Pros and Cons of K-Means
- Standardization of Features when Clustering
- Cluster Analysis and Regression Analysis
- Practical Example: Market Segmentation (Part 1)
- Practical Example: Market Segmentation (Part 2)
- What Can be Done with Cluster Analysis?
- Chapter 8 : Cluster Analysis: Additional Topics
Product information
- Title: Machine Learning 101 with Scikit-learn and StatsModels
- Author(s):
- Release date: July 2019
- Publisher(s): Packt Publishing
- ISBN: 9781838987671
You might also like
video
Advanced Machine Learning with scikit-learn
In this Advanced Machine Learning with scikit-learn training course, expert author Andreas Mueller will teach you …
video
Machine Learning Projects with TensorFlow 2.0
TensorFlow is the world’s most widely adopted framework for Machine Learning and Deep Learning. TensorFlow 2.0 …
video
Practical Machine Learning with TensorFlow 2.0 and Scikit-Learn
Have you been looking for a course that teaches you effective machine learning in scikit-learn and …
video
Programming Foundations of Classification and Regression LiveLessons (Machine Learning with Python for Everyone Series), Part 1
4+ Hours of Video Instruction Code-along sessions move you from introductory machine learning concepts to concrete …