December 2016
Beginner to intermediate
256 pages
7h 26m
English
If you torture the data long enough, it will confess.
Ronald Coase, Economist
In This Chapter:
What data quality is, the different types of data quality issues that arise in data, and how to address them with Hadoop
The importance of feature generation, various types of features, and how to generate features for your model with Hadoop
Feature selection and dimensionality reduction and its importance in addressing the ...
Read now
Unlock full access