Skip to Content
Learning and Operating Presto
book

Learning and Operating Presto

by Angelica Lo Duca, Tim Meehan, Vivek Bharathan, Ying Su
September 2023
Intermediate to advanced
191 pages
4h 32m
English
O'Reilly Media, Inc.
Content preview from Learning and Operating Presto

Chapter 3. Connectors

A Presto connector is a plug-in that connects the Presto engine to an external catalog. Presto connectors are available for a wide variety of data sources, including relational databases, NoSQL databases, and filesystems.

In this chapter, you’ll learn how to implement a custom connector. First, we’ll describe the concept of the Service Provider Interface (SPI) provided by Presto, which defines the interfaces to build custom connectors, types, functions, and system access control in Presto.

Then, we’ll focus on the concept of connectors and how it’s implemented in Presto. Next, you’ll learn how to implement a custom connector and an alternative implementation, based on Apache Thrift, that enables the communication between Presto and external servers written in any language supported by Thrift, such as Python, PHP, and many more.

Finally, we’ll focus on Apache Pinot, which is a real-time distributed datastore, and how to connect it to Presto.

Service Provider Interface

An SPI is an interface you can use to extend a framework with third-party components. Presto implements its own SPI, as shown in Figure 3-1. The Presto SPI, which is within the Presto server, manages the communication between the Presto core and the external service providers.

The SPI architecture.
Figure 3-1. The SPI architecture

A service provider, wanting to communicate with Presto, must implement a plug-in module, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Ten Things to Know About ModelOps

Ten Things to Know About ModelOps

Thomas Hill, Mark Palmer, Larry Derany
What Employees Want Most in Uncertain Times

What Employees Want Most in Uncertain Times

Kristine W. Powers, Jessica B.B. Diaz
Data Superstream: Data Lakes and Warehouses

Data Superstream: Data Lakes and Warehouses

Alistair Croll, Lena Hall, Vini Jaiswal, Einat Orr, Wannes Rosiers, Jessica Larson, Ryan Blue, Tejas Chopra

Publisher Resources

ISBN: 9781098141844Errata PageSupplemental Content