O'Reilly logo

Apache Solr 3 Enterprise Search Server by Eric Pugh, David Smiley

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 2. Schema and Text Analysis

The foundation of Solr is based on Lucene's index—the subject of this chapter. You will learn about:

  • Schema design decisions in which you map your source data to Lucene's limited structure. In this book we'll consider the data from MusicBrainz.org.
  • The structure of the schema.xml file where the schema definition is defined. Within this file are both the definition of field types and the fields of those types that store your data.
  • Text analysis—the configuration of how text is processed (tokenized and so on) for indexing. This configuration affects whether or not a particular search is going to match a particular document.

The following diagram shows the big picture of how various aspects of working with Solr are ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required