Chapter 7. Statistical Data Analysis

In this chapter, we will cover the following topics:

  • Exploring a dataset with pandas and matplotlib
  • Getting started with statistical hypothesis testing – a simple z-test
  • Getting started with Bayesian methods
  • Estimating the correlation between two variables with a contingency table and a chi-squared test
  • Fitting a probability distribution to data with the maximum likelihood method
  • Estimating a probability distribution nonparametrically with a kernel density estimation
  • Fitting a Bayesian model by sampling from a posterior distribution with a Markov chain Monte Carlo method
  • Analyzing data with the R programming language in the IPython notebook

Introduction

In the previous chapters, we reviewed technical aspects of high-performance ...

Get IPython Interactive Computing and Visualization Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.