book

Designing a Modern Application Data Stack

by Adam Morton, Brad Culberson, Kevin McGinley

October 2023

Intermediate to advanced

42 pages

58m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Introduction
1. The Cloud Data Platform Is a Great Fit for Data-Intensive Apps
Benefits of a Modern Cloud Data PlatformCloud Platform–Specific ConsiderationsCross-Cloud Deployment and Application DevelopmentApplication Deployment ModelsExtensibilitySecuritySummary
2. Scalability
Single Versus Multi-TenancyResource Sharing and Workload IsolationBest Practices for Building a Scalable Data ApplicationStorage ConsiderationsVertical Versus Horizontal ScalingWorkload IsolationRow-Level SecuritySummary
3. Efficient Data Processing and Distribution
Key Considerations for Data ProcessingData Format Reducing Data MovementData Integrity and Timeliness Considerations for DistributionSecure Data Sharing with Third PartiesChoosing Where to Process the DataModernizing App Distribution Summary
Conclusion
About the Authors

Content preview from Designing a Modern Application Data Stack

Chapter 3. Efficient Data Processing and Distribution

Successful data applications deliver tangible insights to customers in the most efficient way possible. Modern data applications need to tap into an array of rapidly changing data sets and data formats while supporting a distribution model that delivers a consistent user experience.

It’s important to consider how to ingest and integrate data by building simple data pipelines that are easy to maintain and extend over time. In this chapter, we will look at how a cloud data platform allows you to reduce data movement and improve timeliness to deliver data to your application at scale.

Key Considerations for Data Processing

Processing data at scale presents a significant and complex challenge for many data teams. The primary objective of the data processing layer is to construct pipelines that efficiently and rapidly transfer data from source systems to the cloud data platform. These pipelines should be automated and resilient. They also often trigger subsequent processes that apply transformations to cleanse and standardize data, ensuring the consistent delivery of high-quality data in the required format. This gives your teams fast access to ready-to-use data, allowing them to focus their efforts on application development rather than maintaining data pipelines.

Let’s take a look at key considerations for efficient data processing, including data format, how to reduce data movement to improve data integrity and timeliness, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781098157524

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Designing a Modern Application Data Stack

by Adam Morton, Brad Culberson, Kevin McGinley

Chapter 3. Efficient Data Processing and Distribution

Key Considerations for Data Processing

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.