Press Room

Press Release: April 14, 2008

Ferret--New from O'Reilly: Indexed Searching for Ruby Applications

Domestic ferrets are most commonly known for being quick, curious, and flexible, so it's no surprise that the new search library for Ruby is known for these same qualities. A high-performance, full-featured text search engine library written specifically for Ruby, Ferret can quickly locate data on hard drives and servers.

But just like domestic ferrets, this program takes a bit of training before it's "house-broken." However, with the new release of Ferret (O'Reilly Media, $24.99), Ruby enthusiasts have no reason to worry. With this book, authored by David Balmain--the primary developer of this open source library--Ruby programmers can learn to build one of the fastest and most flexible search libraries available for locating data on hard drives and servers.

Ferret teaches you how to index different document types such as PDF, Microsoft Word, and HTML, as well as how to deal with foreign languages and different character encodings. This guide also describes the Ferret Query Language in detail along with the object-oriented approach to building queries.

Topics discussed in Ferret include:
  • The Index class: an easy-to-use interface to the rest of the Ferret API
  • Using Ferret out of the box: how to set up an index, set up fields for indexing different data types, and set up a field for sorting
  • Going behind the scenes: learn about the indexing process, performance tuning, optimizing the index, Ferret's index locking mechanism, and concurrency issues
  • Ferret's search API: find out how to search classes, build queries, and filter, sort, and highlight search results
  • Ferret's analysis API: use the standard Token, TokenStream, and Analyzer classes, and learn to build a custom analyzer
  • Building a search application: index multiple document types, make indexing and search improvements, and put together a powerful PC search application

This book includes plenty of Ruby code samples, which are downloadable from the companion web site. With Ferret you can now build a search library to help you find anything on your computer with lightning fast indexing and split-second query results.

For a review copy or more information please email Please include your delivery address and contact information.

David Balmain is a freelance software developer and the primary developer of the open source search library Ferret. He gained an interest in high performance text processing at university where he earned a BSc specializing in natural language processing. Recently he has taken an interest in web application development and become enamored with the scripting language Ruby.

Additional Resources:
For more information about this book, including table of contents, index, author bios, and cover graphic, see the catalog page for Ferret

David Balmain
ISBN: 9780596519407, 110 pages, Book Price: $24.99 USD, £15.50 GBP
PDF Price: $9.99 USD

About O'Reilly

For almost 40 years, O’Reilly Media has provided technology and business training, knowledge, and insight to help companies succeed. Our unique network of experts and innovators share their knowledge and expertise through the company’s Safari training and learning platform and at O’Reilly conferences. As a SaaS learning platform, Safari delivers highly topical and comprehensive technology and business learning solutions to millions of users across enterprise, consumer, and university channels. For more information visit

Return to: O'Reilly Press Room

Press Contacts

Media Relations – Corporate & North America

Fama PR

Media Relations – Japan

Fumi Yamakawa
+81 3-3356-5227

Media Relations – United Kingdom

Helen Codling
+44 (0) 1252 721284