Data standardization and matching
In general, populating a data warehouse is the process of integrating, consolidating, and aggregating data from various disparate data sources. These sources are rarely designed by a single authority. More often, they are designed through individual, siloed projects or introduced through mergers and acquisitions where each one uses its own types of identifiers and maintain its own version of the truth. Trying to integrate such fragmented data into a single system presents a number of data quality (DQ) problems ...

Get IBM Information Server: Integration and Governance for Emerging Data Warehouse Demands now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.