August 2022
Beginner to intermediate
204 pages
4h 54m
English
In the previous chapter, we went into depth on our first data flow pattern in this book, walking through the common slowly changing dimensions pattern. In this chapter, we’ll continue the deep-dive exploration with another data flow pattern. This time we’ll cover data deduplication.
Part of the role of data engineering and ETL jobs is to ensure that the data being processed for business use is clean and contains a single source of truth. Deduping data is extremely important and ...
Read now
Unlock full access