Chapter 2. Understanding Analyzers, Tokenizers, and Filters

In the previous chapter, we read how to install and run Solr on various operating systems and covered its architecture. We talked briefly about the basic building blocks of Solr, such as Solr config files.

In this chapter, we will cover the following core components of the Solr configuration:

  • Analyzers
  • Tokenizers
  • Filters

Introducing analyzers

To make us able to search effectively and efficiently, Solr splits text into tokens during indexing as well as during search (query time). Solr does all of this with the help of its three main components: analyzers, tokenizers, and filters. Analyzers are used during both indexing and searching. An analyzer examines the text of fields and the generated ...

Get Apache Solr for Indexing Data now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.