Book description
Perform fast interactive analytics against different data sources using the Trino high-performance distributed SQL query engine. With this practical guide, you'll learn how to conduct analytics on data where it lives, whether it's Hive, Cassandra, a relational database, or a proprietary data store. Analysts, software engineers, and production engineers will learn how to manage, use, and even develop with Trino.
Initially developed by Facebook, open source Trino is now used by Amazon, Google, LinkedIn, Lyft, Netflix, Pinterest, Salesforce, Shopify, and many other companies. Matt Fuller, Manfred Moser, and Martin Traverso show you how a single Trino query can combine data from multiple sources to allow for analytics across your entire organization.
- Get started: Explore Trino's use cases and learn about tools that will help you connect to Trino and query data
- Go deeper: Learn Trino's internal workings, including how to connect to and query data sources with support for SQL statements, operators, functions, and more
- Put Trino in production: Secure Trino, monitor workloads, tune queries, and connect more applications; learn how other organizations apply Trino
Publisher resources
Table of contents
- Foreword
- Preface
- I. Getting Started with Trino
-
1. Introducing Trino
- The Problems with Big Data
- Trino to the Rescue
-
Trino Use Cases
- One SQL Analytics Access Point
- Access Point to Data Warehouse and Source Systems
- Provide SQL-Based Access to Anything
- Federated Queries
- Semantic Layer for a Virtual Data Warehouse
- Data Lake Query Engine
- SQL Conversions and ETL
- Better Insights Due to Faster Response Times
- Big Data, Machine Learning, and Artificial Intelligence
- Other Use Cases
- Trino Resources
- A Brief History of Trino
- Conclusion
- 2. Installing and Configuring Trino
- 3. Using Trino
- II. Diving Deeper into Trino
- 4. Trino Architecture
- 5. Production-Ready Deployment
- 6. Connectors
- 7. Advanced Connector Examples
-
8. Using SQL in Trino
- Trino Statements
- Trino System Tables
- Catalogs
- Schemas
- Information Schema
- Tables
- Views
- Session Information and Configuration
- Data Types
- SELECT Statement Basics
- WHERE Clause
- GROUP BY and HAVING Clauses
- ORDER BY and LIMIT Clauses
- JOIN Statements
- UNION, INTERSECT, and EXCEPT Clauses
- Grouping Operations
- WITH Clause
- Subqueries
- Deleting Data from a Table
- Conclusion
-
9. Advanced SQL
- Functions and Operators Introduction
- Scalar Functions and Operators
- Boolean Operators
- Logical Operators
- Range Selection with the BETWEEN Statement
- Value Detection with IS (NOT) NULL
- Mathematical Functions and Operators
- Trigonometric Functions
- Constant and Random Functions
- String Functions and Operators
- Strings and Maps
- Unicode
- Regular Expressions
- Unnesting Complex Data Types
- JSON Functions
- Date and Time Functions and Operators
- Histograms
- Aggregate Functions
- Window Functions
- Lambda Expressions
- Geospatial Functions
- Prepared Statements
- Conclusion
- III. Trino in Real-World Uses
- 10. Security
- 11. Integrating Trino with Other Tools
- 12. Trino in Production
- 13. Real-World Examples
- 14. Conclusion
- Index
Product information
- Title: Trino: The Definitive Guide
- Author(s):
- Release date: April 2021
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098107710
You might also like
book
Robust Python
Does it seem like your Python projects are getting bigger and bigger? Are you feeling the …
book
Foundations of Scalable Systems
In many systems, scalability becomes the primary driver as the user base grows. Attractive features and …
book
Flow Architectures
Software development today is embracing events and streaming data, which optimizes not only how technology interacts …
book
Quick Start Guide to Large Language Models: Strategies and Best Practices for using ChatGPT and Other LLMs
The advancement of Large Language Models (LLMs) has revolutionized the field of Natural Language Processing in …