Data Science Prerequisites - Numpy, Matplotlib, and Pandas in Python

Video description

Welcome to the course where you will learn about the NumPy stack in Python, which is an important prerequisite for deep learning, machine learning, and data science.

In this self-paced course, you will learn how to use NumPy, Matplotlib, Pandas, and SciPy to perform critical tasks related to data science and machine learning. This involves performing numerical computation and representing data, visualizing data with plots, loading in, and manipulating data using DataFrames, performing statistics and probability, and building machine learning models for classification and regression.

In this course, we will first start with NumPy; we will understand the benefits of NumPy array and then we will look at some complicated matrix operations, such as products, inverses, determinants, and solving linear systems.

Then we will cover Matplotlib. In this section, we will go over some common plots, namely the line chart, scatter plot, and histogram. We will also look at how to show images using Matplotlib.

Next, we will talk about Pandas. We will look at how much easier it is to load a dataset using Pandas versus trying to do it manually. Then we will look at some data frame operations useful in machine learning, such as filtering by column, filtering by row, and the apply function.

Later, you will learn about SciPy. In this section, you will learn how to do common statistics calculations, including getting the PDF value, the CDF value, sampling from a distribution, and statistical testing.

Finally, we will also cover some basics of machine learning that will help us start our deep learning journey.

By the end of the course, we will be able to confidently use the NumPy stack in deep learning and data science.

What You Will Learn

  • Understand supervised machine learning with real-world examples
  • Understand and code using the NumPy stack
  • Make use of NumPy, SciPy, Matplotlib, and Pandas to implement numerical algorithms
  • Understand the pros and cons of various machine learning models
  • Get a brief introduction to the classification and regression
  • Learn how to calculate the PDF and CDF under the normal distribution


This course is designed for anyone who is interested in data science and machine learning, who knows Python and wants to take the next step into Python libraries for data science, or who is interested in acquiring tools to implement machine learning algorithms.

One must have decent Python programming skills and a basic understanding of linear algebra and probability for this course.

About The Author

Lazy Programmer:

The Lazy Programmer is an AI and machine learning engineer with a focus on deep learning, who also has experience in data science, big data engineering, and full-stack software engineering. With a background in computer engineering and specialization in machine learning, he holds two master’s degrees in computer engineering and statistics with applications to financial engineering. His expertise in online advertising and digital media includes work as both a data scientist and big data engineer.

He has created deep learning models for prediction and has experience in recommendation systems using reinforcement learning and collaborative filtering. He is a skilled instructor who has taught at universities including Columbia, NYU, Hunter College, and The New School. He has web programming expertise, with experience in technologies such as Python, Ruby/Rails, PHP, and Angular, and has provided his services to multiple businesses.

Product information

  • Title: Data Science Prerequisites - Numpy, Matplotlib, and Pandas in Python
  • Author(s): Lazy Programmer
  • Release date: February 2023
  • Publisher(s): Packt Publishing
  • ISBN: 9781803241616