Skip to Content
Designing Cloud Data Platforms
book

Designing Cloud Data Platforms

by Lynda Partner, Danil Zburivsky
May 2021
Beginner to intermediate
336 pages
11h
English
Manning Publications
Content preview from Designing Cloud Data Platforms

4 Getting data into the platform

This chapter covers

  • Understanding databases, files, APIs, and streams
  • Ingesting data from RDBMSs using SQL versus change data capture
  • Parsing and ingesting data from various file formats
  • Developing strategies to deal with source schema changes
  • Designing an ingestion pipeline to handle the challenges of data streams
  • Building an ingestion pipeline for SaaS data
  • Implementing quality control and monitoring in your ingestion pipeline
  • Discussing network and security considerations for cloud data ingestion

If you’ve read the chapters up to this point, you’re able to architect a good, layered data lake. Now it’s time to start diving into a few of these layers in much greater detail.

In this chapter, we’ll focus on ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Designing Cloud Data Platforms

Designing Cloud Data Platforms

Danil Zburivsky, Lynda Partner
The Cloud Data Lake

The Cloud Data Lake

Rukmani Gopalan
Architecting Modern Data Platforms

Architecting Modern Data Platforms

Jan Kunigk, Ian Buss, Paul Wilkinson, Lars George

Publisher Resources

ISBN: 9781617296444Publisher SupportOtherPublisher WebsitePurchase Link