Book description
The low cost of getting started with cloud services can easily evolve into a significant expense down the road. That's challenging for teams developing data pipelines, particularly when rapid changes in technology and workload require a constant cycle of redesign. How do you deliver scalable, highly available products while keeping costs in check?
With this practical guide, author Sev Leonard provides a holistic approach to designing scalable data pipelines in the cloud. Intermediate data engineers, software developers, and architects will learn how to navigate cost/performance trade-offs and how to choose and configure compute and storage. You'll also pick up best practices for code development, testing, and monitoring.
By focusing on the entire design process, you'll be able to deliver cost-effective, high-quality products. This book helps you:
- Reduce cloud spend with lower cost cloud service offerings and smart design strategies
- Minimize waste without sacrificing performance by rightsizing compute resources
- Drive pipeline evolution, head off performance issues, and quickly debug with effective monitoring
- Set up development and test environments that minimize cloud service dependencies
- Create data pipeline code bases that are testable and extensible, fostering rapid development and evolution
- Improve data quality and pipeline operation through validation and testing
Table of contents
- Preface
- 1. Designing Compute for Data Pipelines
- 2. Responding to Changes in Demand by Scaling Compute
- 3. Data Organization in the Cloud
- 4. Economical Pipeline Fundamentals
- 5. Setting Up Effective Development Environments
- 6. Software Development Strategies
- 7. Unit Testing
- 8. Mocks
- 9. Data for Testing
- 10. Logging
- 11. Finding Your Way with Monitoring
- 12. Essential Takeaways
- Appendix. Preparing a Cloud Budget
- Index
- About the Author
Product information
- Title: Cost-Effective Data Pipelines
- Author(s):
- Release date: July 2023
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492098645
You might also like
book
Tidy First?
Messy code is a nuisance. "Tidying" code, to make it more readable, requires breaking it up …
book
Flow Architectures
Software development today is embracing events and streaming data, which optimizes not only how technology interacts …
book
Building Micro-Frontends
What's the answer to today's increasingly complex web applications? Micro-frontends. Inspired by the microservices model, this …
book
Hypermodern Python Tooling
Keeping up with the Python ecosystem can be daunting. Its developer tooling doesn't provide the out-of-the-box …