book

Implementing a Modern Data Catalog to Power Data Intelligence

by Fadi Maali, Jason Lim

June 2022

Beginner to intermediate

38 pages

51m

English

O'Reilly Media, Inc.

Read now

Unlock full access

What Is in a Data Catalog?Data Catalog Features and Example ApplicationsA Framework to Characterize Data CatalogsSummary
Tool-Adjunct Data CatalogsBroad ConnectivityIntelligenceActive GovernanceDomain-Specific CatalogsBroad ConnectivityIntelligenceActive GovernanceData Catalog PlatformsBroad ConnectivityIntelligenceActive GovernanceSummary
Data Catalog in an Enterprise Data StackEnterprise Data LakesThe Modern Data StackData MeshData FabricSuccessful Implementation of Data CatalogsAccommodate Existing Workflows for Data UsersFocus on PeopleFocus on Business and Technical MetadataHave an Adoption PlanMeasure Adoption and Impact of the Data CatalogSummary
Catalog Business ImpactCatalog Use CasesSelf-Service Business IntelligenceData Governance and Guided Data UsageData OperationsCloud and Multicloud MigrationSummary

Content preview from Implementing a Modern Data Catalog to Power Data Intelligence

Chapter 1. Data Catalogs

A data catalog is a collection of metadata describing data assets and their usage. Modern data catalogs provide relevant functionality to support metadata management, enrichment, and search. They not only help users find relevant data but guide them on proper use of that data. Data catalogs help answer the questions:

How can I find relevant data?
Once I find data, can I use it?
Should I use it?
How should I use it?

Cataloging and managing metadata in enterprises is not a new practice. Metadata repositories have existed since the 1970s and relational databases have had metadata catalogs since their early days. However, in the years since, the technology surrounding data and the role of data in the enterprise have both changed substantially.

Enterprise data landscapes have grown more sophisticated—the “3 Vs” of big data (volume, velocity, and variety) are widely known. And the legislative environment mandating compliant data usage continues to grow in complexity as more people (and AI-powered programs) access and use data in new ways.¹ Moreover, the growing adoption of cloud computing and SaaS results in more data residing outside the enterprise infrastructure and control. As a result, collecting, managing, and using comprehensive and accurate metadata has become paramount; and modern data catalogs are the tools that enable best practices.

Modern data catalogs have grown in maturity and sophistication to address new and increasingly complex challenges. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Architecting Modern Data Platforms

Jan Kunigk, Ian Buss, Paul Wilkinson, Lars George

Data Fabric and Data Mesh Approaches with AI: A Guide to AI-based Data Cataloging, Governance, Integration, Orchestration, and Consumption

Eberhard Hechler, Maryela Weihrauch, Yan (Catherine) Wu

Automating Data Quality Monitoring

Jeremy Stanley, Paige Schwartz

The Enterprise Data Catalog

Ole Olesen-Bagneux

Publisher Resources

ISBN: 9781492098751