Book description
The data lake was once heralded as the answer to the flood of big data that arrived in a variety of structured and unstructured formats. But, due to the ease of integration and the lack of governance, data lakes in many companies have devolved into unusable data swamps. This short ebook shows you how to solve this problem using an Operational Data Hub (ODH) to collect, store, index, cleanse, harmonize, and master data of all shapes and formats.
Gerhard Ungerer—CTO and co-founder of Random Bit LLC—explains how the ODH supports transactional integrity so that the hub can serve as integration point for enterprise applications. You’ll also learn how the ODH helps you leverage the investment in your data lake (or swamp), so that the data trapped there can finally be ingested, processed, and provisioned.
With this ebook, you’ll learn how an ODH:
- Allows you to focus on categorizing data for easy and fast retrieval
- Provides flexible storage models, indexing support, query capabilities, security, and a governance framework
- Delivers flexible storage models; support for indexing, scripting, and automation; query capabilities; transactional integrity; and security
- Includes a governance model to help you access, ingest, harmonize, materialize, provision, and consume data
Table of contents
-
Cleaning Up the Data Lake with an Operational Data Hub
- Introduction
- The End Goal: Enterprise Data Integration
- What Are the Challenges Faced by Enterprise Data Integration Efforts?
- Big Data for Enterprise Data Integration
- What Is a Data Lake?
- What Is a Data Swamp?
- What Is an ODH?
- The Benefits of an ODH
- Planning to Clear the Data Swamp
- Transforming the Data Swamp into a Hub
- Summary
Product information
- Title: Cleaning Up the Data Lake with an Operational Data Hub
- Author(s):
- Release date: March 2018
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492027379
You might also like
book
Operationalizing the Data Lake
Big data and advanced analytics have increasingly moved to the cloud as organizations pursue actionable insights …
book
Practical Enterprise Data Lake Insights: Handle Data-Driven Challenges in an Enterprise Big Data Lake
Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake …
video
Data Superstream: Data Lakes and Warehouses
Storing, processing, and moving data in the cloud efficiently and cost-effectively is a must for working …
book
Data Engineering with AWS
The missing expert-led manual for the AWS ecosystem — go from foundations to building data engineering …