Skip to Content
eXist
book

eXist

by Erik Siegel, Adam Retter
December 2014
Beginner
584 pages
15h 13m
English
O'Reilly Media, Inc.
Content preview from eXist

Chapter 12. Text Indexing and Lookup

Beside the “basic” indexing capabilities, as handled in Chapter 11, eXist also has a full-text index based on the Apache Lucene text search-engine library. Lucene allows eXist to offer search capabilities like words near each other, words like other words, Boolean operators, and more. Full-text indexes allow you to do much more with your content than you can do using straight XPath expressions.

If your application needs search based on human input, such as searching documentation or the like, full-text indexes can really help. But things get even better: on top of the full-text index searches, eXist offers “keyword in context,” or KWIC, functionality. This makes it extremely easy to display the results of your searches in context, showing the search results within the surrounding text. KWIC is handled in Using Keywords in Context.

Full-Text Index and KWIC Example

The examples for this book contain a simple full-text search example. This example searches, using the full-text index, over some ancient Encyclopedia Britannica entries. Important components of the example are:

  • The index definition in /db/system/config/db/apps/exist-book/indexing/data/collection.xconf defines a full-text index on tei:p elements:

    <collection xmlns="http://exist-db.org/collection-config/1.0">
      <index xmlns:tei="http://www.tei-c.org/ns/1.0">
        
        <!-- other indexes -->
        
        <lucene>
          <text qname="tei:p"/>
        </lucene>
      </index>
    </collection>
  • An extremely simple HTML form that allows you to ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

Communicate with Teams More Effectively

Communicate with Teams More Effectively

Charles Humble
What Successful Project Managers Do

What Successful Project Managers Do

W. Scott Cameron, Jeffrey S. Russell, Edward J. Hoffman, Alexander Laufer

Publisher Resources

ISBN: 9781449337094Errata Page