The central limit theorem

The classical theory of sampling is based on the following fundamental theorem.


When the distribution of any population has finite variance, then the distribution of the arithmetic mean of random samples is approximately normal, if the sample size is sufficiently large.

The proof of this theorem is usually about 3-6 pages (using advanced mathematics on measure theory). Rather than doing this mathematical exercise, the "proof" is done by simulation, which also helps to understand the central limit theorem and thus the basics of statistics.

The following setup is necessary:

  • We draw samples from populations. This means that we know the populations. This is not the case in practice, but we show that the population can have ...

Get Simulation for Data Science with R now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.