Skip to Content
Cost-Effective Data Pipelines
book

Cost-Effective Data Pipelines

by Sev Leonard
July 2023
Intermediate to advanced
286 pages
7h 52m
English
O'Reilly Media, Inc.

Overview

The low cost of getting started with cloud services can easily evolve into a significant expense down the road. That's challenging for teams developing data pipelines, particularly when rapid changes in technology and workload require a constant cycle of redesign. How do you deliver scalable, highly available products while keeping costs in check?

With this practical guide, author Sev Leonard provides a holistic approach to designing scalable data pipelines in the cloud. Intermediate data engineers, software developers, and architects will learn how to navigate cost/performance trade-offs and how to choose and configure compute and storage. You'll also pick up best practices for code development, testing, and monitoring.

By focusing on the entire design process, you'll be able to deliver cost-effective, high-quality products. This book helps you:

  • Reduce cloud spend with lower cost cloud service offerings and smart design strategies
  • Minimize waste without sacrificing performance by rightsizing compute resources
  • Drive pipeline evolution, head off performance issues, and quickly debug with effective monitoring
  • Set up development and test environments that minimize cloud service dependencies
  • Create data pipeline code bases that are testable and extensible, fostering rapid development and evolution
  • Improve data quality and pipeline operation through validation and testing
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Data Engineering with AWS

Data Engineering with AWS

Gareth Eagar
Kafka Connect

Kafka Connect

Mickael Maison, Kate Stanley
Building Machine Learning Pipelines

Building Machine Learning Pipelines

Hannes Hapke, Catherine Nelson
Data Science on AWS

Data Science on AWS

Chris Fregly, Antje Barth

Publisher Resources

ISBN: 9781492098638Errata Page