Chapter 7. Statistical Data Analysis

In this chapter, we will cover the following topics:

  • Exploring a dataset with pandas and Matplotlib
  • Getting started with statistical hypothesis testing – a simple z-test
  • Getting started with Bayesian methods
  • Estimating the correlation between two variables with a contingency table and a chi-squared test
  • Fitting a probability distribution to data with the maximum likelihood method
  • Estimating a probability distribution nonparametrically with a kernel density estimation
  • Fitting a Bayesian model by sampling from a posterior distribution with a Markov chain Monte Carlo method
  • Analyzing data with the R programming language in the Jupyter Notebook


In the previous chapters, we reviewed technical aspects of high-performance ...

Get IPython Interactive Computing and Visualization Cookbook - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience live online training, plus books, videos, and digital content from nearly 200 publishers.