Skip to Content
Learning and Operating Presto
book

Learning and Operating Presto

by Angelica Lo Duca, Tim Meehan, Vivek Bharathan, Ying Su
September 2023
Intermediate to advanced
191 pages
4h 32m
English
O'Reilly Media, Inc.
Content preview from Learning and Operating Presto

Chapter 9. Operating Presto at Scale

Scalability involves a Presto cluster handling increased demand or usage with minimal impact on performance, ensuring that the system’s response time remains consistent and acceptable even when the workload increases.

We won’t be implementing a specific scenario in this chapter, so you won’t find the code in the book’s GitHub repository since the scalability of your Presto cluster depends on your cluster workload. Instead, we’ll discuss general strategies for scaling your Presto cluster to enable you to adapt them to your specific conditions.

The chapter is organized into four parts. In the first part, we’ll introduce some basic concepts related to scalability, including reasons to scale a Presto cluster and some common issues related to a Presto cluster that needs to be scaled. In the second part, we’ll see some design considerations to consider when you want to scale your Presto cluster. These include availability, manageability, performance, protection, and configuration. Next, we’ll analyze popular approaches for scaling a Presto cluster, including multiple coordinators, Presto on Spark, and spilling. Finally, we’ll focus on how to scale a Presto cluster using a cloud service.

Introducing Scalability

Operating Presto at scale means adding more resources to the system to handle an increased workload. The concept of scalability is slightly different from that of performance tuning, which you learned in Chapter 8. In fact, performance tuning ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Ten Things to Know About ModelOps

Ten Things to Know About ModelOps

Thomas Hill, Mark Palmer, Larry Derany
What Employees Want Most in Uncertain Times

What Employees Want Most in Uncertain Times

Kristine W. Powers, Jessica B.B. Diaz
Data Superstream: Data Lakes and Warehouses

Data Superstream: Data Lakes and Warehouses

Alistair Croll, Lena Hall, Vini Jaiswal, Einat Orr, Wannes Rosiers, Jessica Larson, Ryan Blue, Tejas Chopra

Publisher Resources

ISBN: 9781098141844Errata PageSupplemental Content