Chapter 2. Understanding Analyzers, Tokenizers, and Filters

In the previous chapter, we read how to install and run Solr on various operating systems and covered its architecture. We talked briefly about the basic building blocks of Solr, such as Solr config files.

In this chapter, we will cover the following core components of the Solr configuration:

  • Analyzers
  • Tokenizers
  • Filters

Introducing analyzers

To make us able to search effectively and efficiently, Solr splits text into tokens during indexing as well as during search (query time). Solr does all of this with the help of its three main components: analyzers, tokenizers, and filters. Analyzers are used during both indexing and searching. An analyzer examines the text of fields and the generated ...

Get Apache Solr for Indexing Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.