What you need for this book

The following softwares are required for this book:

  • Ubuntu OS, preferably 14.04
  • Python 2.7
  • The pandas 0.16.2 library
  • The NumPy 1.9.2 library
  • The SciPy 0.16 library
  • IPython 4.0
  • The SciKit 0.16.1 module
  • The statsmodels 0.6.1 module
  • The matplotlib 1.4.3 library
  • Apache Hadoop CDH4 (Cloudera Hadoop 4) with MRv1 (MapReduce version 1)
  • Apache Spark 1.4.0

Get Mastering Python for Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.