Chapter 5. Query Execution and Scoring
By now, we have established a fundamental grasp of how search systems encode and organize data. We’ve seen how raw input, whether textual, numeric, geospatial, or vector, is transformed into indexed representations across immutable segment files. We’ve examined how updates and deletes are implemented using tombstones and versioning, and how the immutable design enables concurrency, consistency, and performance. But indexing is only half the battle. In this chapter, we turn our attention to the other half: the mechanics of querying.
Efficiently executing a query across large-scale, append-only indexes is a non-trivial task. The system must not only find matching documents but do so in a way that is latency-bounded, resource-aware, and accurate enough to support downstream applications such as ranking, recommendations, or retrieval-augmented generation. This requires a sophisticated execution pipeline that includes planning, coordination, filtering, scoring, ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access