Data standardization and matching
In general, populating a data warehouse is the process of integrating, consolidating, and aggregating data from various disparate data sources. These sources are rarely designed by a single authority. More often, they are designed through individual, siloed projects or introduced through mergers and acquisitions where each one uses its own types of identifiers and maintain its own version of the truth. Trying to integrate such fragmented data into a single system presents a number of data quality (DQ) problems ...