Chapter 8: Analytics for Data Quality
8.2 Correlation: Problem and Benefit
Imputing missing values based on other variables
Substituting the effect of unavailable or unusable variables
Multicollinearity or the need for independent variables
Derived variables from transactional data
Derived variables for customer behavior
Statistical variability and the significance of p-values
Instability of the business background and definitions
8.4 Distribution and Sparseness
Distribution of interval variables