10

Transformation Patterns, Cleansing, and Normalization

In this chapter, we’ll learn about transformation patterns and their role in data management. The Lambda, Kappa, and Microservice architectural patterns will be covered in the following sections. We’ll also cover important data transformation methods, such as cleansing, normalization, masking, de-duplication, enrichment, validation, and standardization.

Data workers, like you, must understand these transformation patterns and methods. In a data-driven world, the ability to analyze raw data is invaluable. This expertise is crucial for data scientists preparing data for machine learning models, analysts gaining insights, and database administrators assuring data governance and security. ...

Get The Definitive Guide to Data Integration now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.