Skip to Content
Data Science with Java
book

Data Science with Java

by Michael R. Brzustowicz
June 2017
Beginner to intermediate
233 pages
5h 57m
English
O'Reilly Media, Inc.
Content preview from Data Science with Java

Chapter 3. Statistics

Applying the basic principles of statistics to data science provides vital insight into our data. Statistics is a powerful tool. Used correctly, it enables us to be sure of our decision-making process. However, it is easy to use statistics incorrectly. One example is Anscombe’s quartet (Figure 3-1), which demonstrates how four distinct datasets can have nearly identical statistics. In many cases, a simple plot of the data can alert us right away to what is really going on with the data. In the case of Anscombe’s quartet, we can instantly pick out these features: in the upper-left panel, x and appear to be linear, but noisy. In the upper-right panel, we see that x and y form a peaked relationship that is nonlinear. In the lower-left panel, x and y are precisely linear, except for one outlier. The lower-right panel shows that is statistically distributed for and that there ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Mastering Java for Data Science

Mastering Java for Data Science

Alexey Grigorev
Java: Data Science Made Easy

Java: Data Science Made Easy

Richard M. Reese, Jennifer L. Reese, Alexey Grigorev

Publisher Resources

ISBN: 9781491934104Errata PageSupplemental Content