Chapter 35. Breaking Down Complex Data Preparation Challenges

Previous chapters have discussed techniques for determining the changes required to prepare a data set for analysis, albeit at a relatively simple level. What about those situations where the path isn’t straightforward; how do you approach the problem then? Complex challenges can include building solutions requiring multiple steps, inputting data sources using multiple join conditions, or having to complete many reshaping steps throughout the data prep process. This chapter will cover this exact scenario by taking on one of the most complicated challenges Preppin’ Data has covered to date: 2020: Week 3. The aim of this challenge, created by Jonathan Allenby (my fellow Dr. Prepper), is to turn the National Basketball Association (NBA) game results into the detailed standings you commonly see on websites or in newspapers.

 

The Challenge

This challenge involves taking the results and building the full conference league tables, including rankings, wins, and losses; recent performance; and even winning streaks (Figure 35-1). This really is a tough challenge with many facets; hence, it’s a great example to use to show you how to break down complex problems into smaller, more manageable pieces.

Challenge post for Preppin’ Data 2020: Week 3
Figure 35-1. Challenge post for Preppin’ Data 2020: Week 3

As covered in Chapter 3, developing an understanding of the input and ...

Get Tableau Prep: Up & Running now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.