Index

A

Absolute location
Adders utilities
Akka
Altitude
Amazon Redshift
Amazon Simple Storage Service (Amazon S3)
Boto package
Python’s s3 module
Amazon Web Services
Analysis of variance (ANOVA)
Analytical models
columns
data field name verification
data pattern
data type of data column
histograms of column
maximum value
mean
median
minimum value
missing/unknown values
mode
quartiles
range
sample data set
skewness
standard deviation
unique identifier, data entry
Andrews’ curves
Animals
class
families
genera
kingdoms
orders
phyla
species
ANOVA
SeeAnalysis of variance (ANOVA
Apache Cassandra
Apache Hadoop
Luigi
Pydoop
Apache Hive
Apache Mesos
Apache Spark
Apriori algorithm
Area graph
Assess_Best_Logistics
Assess superstep
Clark Ltd
SeeClark Ltd
data analysis
data profiling
data quality
erroneous data ...

Get Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.