In this book, we will try to use as many varied datasets as possible. This depends on the availability of the data. Unfortunately, this means that the subject of the data might not exactly match your interests. Every dataset has its own quirks, but the general skills you acquire in this book should transfer to your own field. In this chapter, we will load a number of Comma-separated Value (CSV) files into NumPy arrays in order to analyze the data.
To load the data, we will use the NumPy
loadtxt() function as follows:
The code for this example can be found in
basic_stats.py in the code bundle.
import numpy as np from scipy.stats import scoreatpercentile data = np.loadtxt("mdrtb_2012.csv", delimiter=',', ...