Chapter 5. Analyzing your data

This chapter covers

  • Analyzing your document’s text with Elasticsearch
  • Using the analysis API
  • Tokenization
  • Character filters
  • Token filters
  • Stemming
  • Analyzers included with Elasticsearch

So far we’ve covered indexing and searching your data, but what actually happens when you send data to Elasticsearch? What happens to the text sent in a document to Elasticsearch? How can Elasticsearch find specific words within sentences, even when the case changes? For example, when a user searches for “nosql,” generally you’d like a document containing the sentence “share your experience with NoSql & big data technologies” to match, because it contains the word NoSql. You can use the information you learned in the previous chapter ...

Get Elasticsearch in Action now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.