What you need for this book

The following softwares are required for this book:

  • Ubuntu OS, preferably 14.04
  • Python 2.7
  • The pandas 0.16.2 library
  • The NumPy 1.9.2 library
  • The SciPy 0.16 library
  • IPython 4.0
  • The SciKit 0.16.1 module
  • The statsmodels 0.6.1 module
  • The matplotlib 1.4.3 library
  • Apache Hadoop CDH4 (Cloudera Hadoop 4) with MRv1 (MapReduce version 1)
  • Apache Spark 1.4.0

Get Mastering Python for Data Science now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.