Skip to Content
Data Management at Scale, 2nd Edition
book

Data Management at Scale, 2nd Edition

by Piethein Strengholt
April 2023
Beginner to intermediate
412 pages
12h 48m
English
O'Reilly Media, Inc.
Book available
Content preview from Data Management at Scale, 2nd Edition

Chapter 10. Modern Master Data Management

In this book’s first edition, I offered the following advice about data and master data management (MDM): “If it’s fast and fluid, break it apart into smaller pieces and leave it up to the domains. If it’s stable and it truly matters, consider using MDM.” Three years later, I stand by this recommendation. Why? Because in a heavily distributed environment, applications are often intertwined. In previous chapters, we discussed the scale at which enterprises need to manage and distribute data. Large organizations typically have a multitude of domains, each taking ownership of its data assets and responsibility for sharing reusable data products with data consumers. This federation increases speed, but it’s a concern for several reasons:

  • Domains tend to take care of only their own data products and not how to work with other products, possibly making it difficult for consumers to combine data from multiple domains.

  • If each domain is building its own data transformation code, then there will be much duplication of effort, which makes the architecture costly and ineffective.

  • Having multiple domains that have aggregates or copies of data from other domains for performance reasons leads to data duplication and increased complexity.

  • There may be inconsistencies in contexts between domains, and therefore variance in the levels of data quality.

To address the challenges of data reusability and consistency, we need to add the discipline ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Data Governance: The Definitive Guide

Data Governance: The Definitive Guide

Evren Eryurek, Uri Gilad, Valliappa Lakshmanan, Anita Kibunguchy-Grant, Jessi Ashdown
Practical Statistics for Data Scientists, 2nd Edition

Practical Statistics for Data Scientists, 2nd Edition

Peter Bruce, Andrew Bruce, Peter Gedeck
Fundamentals of Data Engineering

Fundamentals of Data Engineering

Joe Reis, Matt Housley

Publisher Resources

ISBN: 9781098138851Errata Page