April 2016
Beginner to intermediate
384 pages
8h 36m
English
Binning the observations comes in handy when we want to check the shape of the distribution visually or we want to transform the data into an ordinal form.
To execute this recipe, you will need the pandas and NumPy modules.
No other prerequisites are required.
To bin your observations (as in a histogram), you can use the following code (data_binning.py file):
# create bins for the price that are based on the
# linearly spaced range of the price values
bins = np.linspace(
csv_read['price_mean'].min(),
csv_read['price_mean'].max(),
6
)
# and apply the bins to the data
csv_read['b_price'] = np.digitize(
csv_read['price_mean'],
bins
)First, we create bins. For our price (with the mean imputed ...