April 2026
Beginner to intermediate
408 pages
13h 26m
English
Apache Iceberg is a community-driven table format that defines how large analytical datasets are organized, versioned, and accessed on a data lake. It doesn’t change how data is stored at the file level. Instead, it adds a standard metadata layer on top of files, typically stored in Apache Parquet, which lets collections of files be treated as coherent, relational tables while remaining on low-cost object storage. This chapter will explore the architecture and value of Apache Iceberg as an open table format for data lakehouses.
Read now
Unlock full access