Chapter 4: Sampling and Inferential Statistics

In this chapter, we focus on several difficult sampling techniques and basic inferential statistics associated with each of them. This chapter is crucial because in real life, the data we have is, most likely, only a small portion of a whole set. Sometimes, we also need to perform sampling on a given large dataset. Common reasons for sampling are listed as follows:

  • The analysis can run quicker when the dataset is small.
  • Your model doesn't benefit much from having gazillions of pieces of data.

Sometimes, you also don't want sampling. For example, sampling a small dataset with sub-categories may be detrimental. Understanding how sampling works will help you to avoid various kinds of pitfalls.

Get Essential Statistics for Non-STEM Data Analysts now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.