Exploring multiple variables simultaneously

All right. You have arrived at the last section of exploratory data analysis. Now you will expand your exploration to multiple variables at once. Typical datasets have many variables, but a bivariate analysis limits you to pairwise comparisons. Exploring five variables, two at a time creates 10 pairs, 10 variables create 45, 20 variables create 190, 40 variables create 780, and so on. The impact on workflow is nearly exponential, as shown in the following diagram:

Exploring multiple variables simultaneously

As the number of features (variables) in your dataset grows, your strategy for exploratory data analysis must scale along with your data. Your ...

Get Introduction to R for Business Intelligence now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.