Indexing multiple languages (Advanced)

We have seen in previous examples that Solr can support issues with languages beyond English by, for example, enabling non-accented searches for words that originally have accents and complex characters.

In this example, we will look at deeper language support that Solr provides by automatically detecting the language used, by allowing the text processing of different languages and by hiding the implementation details from users and client applications.

As Solr is quite flexible with its language support, let's consider and implement one scenario:

  • An e-mail may arrive in one of the two languages: English or Russian
  • The language-specific content will be in the subject and message fields with both fields assumed ...

Get Instant Apache Solr for Indexing Data How-to now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.