Index

  

  

  

  

  

  

  

  

  

  

  

  

  

  

  

  

  

  

  

Numbers & Symbols

\ (backward slash) as separator, 69

/ (forward slash) as separator, 69

1-itemsets, 147

2-itemsets, 148149

3 Vs (volume, variety, velocity), 23

3-itemsets, 149150

4-itemsets, 150151

A

accuracy, 225

ACF (autocorrelation function), 236237

ACME text analysis example, 259260

raw text collection, 260263

aggregates (SQL)

ordered, 351352

user-defined, 347351

aggregators of data, 18

AIE (Applied Information Economics), 28

algorithms

clustering, 134135

decision trees, 197200

C4.5, 203204

CART, 204

ID3, 203

Alphine Miner, 42

alternative hypothesis, 102103

analytic projects

Approach, 369371

BI analyst, 362

business users, 361

code, 362, 376377

communication, 360361

data engineer, 362

data scientists, 362

DBA (Database Administrator), 362

deliverables, 362364

audiences, 364365

core material, 364365

key points, 372

Main Findings, 367369

model description, 371

model details, 372374

operationalizing, 360361

outputs, 361

presentations, 362

Project Goals, 365367

project manager, 362

project sponsor, 361

recommendations, 374375

stakeholders, 361362

technical specifications, 376377

analytic sandboxes. See sandboxes

analytical architecture, 1315

analytics

business drivers, 11

examples, 2223

new approaches, 1619

ANOVA, 110114

Anscombe's quartet, 8283

aov( ) function, 78

Apache Hadoop. See Hadoop

APIs (application programming interfaces), Hadoop, 304305

apriori( ) function, 146, 152157 ...

Get Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.