Index

Accuracy, 9, 158165, 167

Agglomerative hierarchical clustering, 111120, 154

adjusting cut-off distances, 116

creating clusters, 114116

example, 116120

grouping process, 111113

Aggregate table, 39

Aggregation, 31

Alternative hypothesis, 7374

Anomaly detection, 211

Artificial neural network, see Neural network

Association rules, see Associative rules

Associative rules, 129139

antecedent, 134

confidence, 134135

consequence, 134

example, 137139, 230

extracting rules, 132137

grouping, 130132

lift, 135137

support, 134

Analysis of variance, see One-way analysis of variance

Average, see Mean

Bagging, 168

Bar chart, 41

Bin, 30

Binary, see Variable, binary

Binning, 30

Black-box, 197

Boosting, 168

Box plots, 25, 4546, 52, 233

Box-and-whisker plots, see Box plots

Budget, 12, 1415

Business analyst, 10

Case study, 12

Central limits theorem, 63

Central tendency, 5557, 96

Charts, see Graphs

Chi-square, 39, 67, 8284, 91

critical value, 8384

degrees of freedom, 84

distribution, 243

expected frequencies, 83

observed frequencies, 83

Churn analysis, 210

Claim, 72

Classification, 158162

Classification and regression tree (CART), see Decision trees

Classification models, 158, 182, 199, 202, 233

Classification trees, 181184, 203

Cleaning, 2426, 32, 219220

Clustering, 25, 110129, 168

agglomerative hierarchical clustering, 111120, 154

bottom-up, 111

hierarchical, 110

k-means clustering, 120129, 154

nonhierarchical, 110, 120

outlier detection, 25

top-down, 120

Common subsets, ...

Get Making Sense of Data: A Practical Guide to Exploratory Data Analysis and Data Mining now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.