CHAPTER 6Data Warehouses, Data Lakes, and Data Lakehouses
In previous chapters, we explored the fundamentals of databases, exploring how databases are structured, managed, and optimized for business operations. Databases are the backbone of data storage and retrieval for many organizations. However, traditional database systems have some limitations. This chapter will introduce you to other database solutions designed to tackle unique challenges faced by organizations.
IN THIS CHAPTER, YOU WILL LEARN ABOUT THE FOLLOWING:
- What are data warehouses and how do they work?
- Extract, transform, and load processes
- Star and Snowflake modeling techniques
- Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP) systems
- Data marts and their benefits
- Storing raw data with data lakes
- The data lakehouse architecture
- Choosing between a database, a data warehouse, and a data lake
- Exploring real-world use cases
By the end of this chapter, you will have a solid foundation for understanding these key data storage paradigms, enabling you to make informed decisions while integrating them into modern data workflows. Throughout this chapter, we’ll use Dough & Delight, a fictional bakery, as our retail store example. All data models and scenarios will reference this business to provide consistency and real-world context.
Data Warehouses
Our retail store, Dough & Delight, has a relational database with separate tables for tracking customer orders, pastries, prices, and transactions. ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access