Overview
Iceberg data lakehouse architecture leverages the widely accepted Apache Iceberg open table format to deliver superior features through enhanced metadata handling. But understanding Iceberg conceptually is only the beginning. To benefit from its architecture, you need to know how it works, how to apply it to real tasks, and how to optimize it effectively.
In this practical guide, Lester Martin provides the conceptual understanding, best practices, and appropriate tools required to optimize your own Iceberg architecture. While specifically addressing Apache Iceberg, the author contrasts other table formats, including Apache Hive and Delta Lake, and explores related technologies such as Apache Spark. You'll learn how to optimize Iceberg with Trino by deploying a Starburst Icehouse architecture, and how Starburst's value-added features and frameworks support closely coupled functionality such as data ingestion.
- Understand what Apache Iceberg is and how it works architecturally
- Use Iceberg to perform tasks that are difficult in other architectures
- Optimize Iceberg by leveraging its architectural features
- Understand why Trino works alongside Iceberg to deliver superior performance
- Perform table structure and data platform optimizations using Starburst Galaxy
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access