July 2022
Intermediate to advanced
304 pages
9h 15m
English
While parts 1 and 2 were concerned with squeezing and sampling data to make it fit into RAM, we can now finally breathe a sigh of relief—our data, all of it, is comfortably resting on disk. In the three chapters in part 3, we will learn how to effectively design algorithms and data structures for large datasets sitting on disk. This will include understanding how retrieval, insertion, and deletion work in different kinds of databases and how to efficiently sort large files on disk. We will also delve into differences in the design of indices between read-optimized and write-optimized databases. The first step in doing all of this will be understanding how the I/O cost (i.e., ...
Read now
Unlock full access