Chapter 11: Data Preparation for Analysis
Common Issues and Appropriate Strategies
Scale Differences among Model Variables
Too Many Levels of a Categorical Variable
High Dimensionality: Abundance of Columns
Correlated or Redundant Variables
Missing or Sparse Observations across Columns
Partitioning into Training, Validation, and Test Sets
Aggregating Rows with Summary Tables
Some Date Functions: Extracting Parts
Row Functions Especially Useful in Time-Ordered Data
Elapsed Time and Date Arithmetic
Introduction
Once we have all ...
Get Preparing Data for Analysis with JMP now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.