Skip to Content
Introduction to Search with Sphinx
book

Introduction to Search with Sphinx

by Andrew Aksyonoff
April 2011
Beginner
148 pages
4h 15m
English
O'Reilly Media, Inc.
Content preview from Introduction to Search with Sphinx

Chapter 5. Managing Indexes

As we briefly mentioned in the preceding chapter, Sphinx lets you search through multiple indexes at the same time. There are usually two reasons for devoting multiple indexes to the same application area: the main+delta strategy that greatly reduces the delay in keeping an index up-to-date, and parallelizing queries across indexes to reduce the delay in responding to queries. All serious production sites use multiple indexes, so you’ll find this chapter to be a natural sequel to the preceding one. The strategy leads to complexities that I’ll cover in this chapter. But I’ll occasionally diverge from the “general overview” approach of previous chapters and focus more on specific features, all of the nitty and even some of the gritty details of engine internals, and concrete use cases and dos and don’ts.

The “Divide and Conquer” Concept

Plain disk indexes need to be fully rebuilt from scratch every time you need to update the text data they contain. This can lead to delays of minutes or even hours before new and updated rows appear in response to queries—and that’s not even considering the waste of CPU cycles and networking.

Many people, including myself, lack the patience for this. Should you stand for this in your very own applications? It depends on the numbers, and concrete figures are easy to approximate with a bit of simple back-of-the-envelope math.

On modern commodity gear (which, at the time of this writing, means multicore CPUs clocked at 3.0 GHz, ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Elasticsearch 7 Quick Start Guide

Elasticsearch 7 Quick Start Guide

Anurag Srivastava, Douglas Miller
Mastering Apache Solr 7.x

Mastering Apache Solr 7.x

Sandeep Nair, Chintan Mehta, Dharmesh Vasoya
Learning Elasticsearch

Learning Elasticsearch

Abhishek Andhavarapu

Publisher Resources

ISBN: 9780596809546