Chapter 8: Population Vs. Sample

This chapter is about population vs. sample and what we are learning here will form the basis for the next chapter – hypothesis testing.

When working on a machine learning project, you don't get to work on the complete data set. It could be due to various reasons like time challenges, cost constraints, some of the data may be corrupted, storage space may not be available etc.

So, you get to work on a part of the complete data set, known as a sample. We analyze the sample data and draw inferences from the analysis of sample data to the larger dataset or population. That is, we extrapolate the inferences of the sample to the population.

Let us say you are trying to understand buying behavior in your school or workplace. ...

Get De-Mystifying Math and Stats for Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.