15

Other Considerations – Measures, Calculations, Restatements, and Data Science Best Practices

In this chapter, we will elaborate on important data considerations, such as measures, calculations, data restatements (of derived data), and various data engineering designs for data science, as well as the notebook tooling needed for these designs.

We’ll begin with a metaphor. Think about making a loaf of bread. When a baker prepares, they begin by assembling all the materials needed so that everything that will be required is present, in equal if not greater amounts than required for the finished product. This careful planning makes it possible to not have to scurry around looking for an item (tool or ingredient) when it is needed in the recipe. ...

Get Data Engineering Best Practices now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.