Chapter 4. Statistics

Statistics is what data science used to be called before the widespread use of computers. But, that use has not diminished the importance of statistical principles to the analysis of data. This chapter examines those principles.

Descriptive statistics

A descriptive statistic is a function that computes a numeric value which in some way summarizes the data in a numeric dataset.

We saw two statistics in Chapter 3, Data Visualization: the sample mean, Descriptive statistics, and the sample standard deviation, s. Their formulas are:

Descriptive statistics

The mean summarizes the ...

Get Java Data Analysis now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.