Administrating Solr

Book Description

Master the use of Drupal and associated scripts to administrate, monitor, and optimize Solr

  • Learn how to work with monitoring tools like OpsView, New Relic, and SPM

  • Utilize Solr scripts and Collection Distribution scripts to manage Solr

  • Employ search features like querying, categorizing, search based on location, and distributed search

In Detail

Implementing different search engines on web products is a mandate these days. Apache Solr is a robust search engine, but simply implementing Apache Solr and forgetting about it is not a good idea, especially when you have to fight for the search ranking of your web product. In such a scenario, you need to keep monitoring, administrating, and optimizing your Solr to retain your ranking.

"Administrating Solr" is a practical, hands-on guide. This book will provide you with a number of clear, step-by-step exercises and some advanced concepts which will help you administrate, monitor, and optimize Solr using Drupal and associated scripts. Administrating Solr will also provide you with a solid grounding on how you can use Apache Solr with Drupal.

"Administrating Solr" starts with an overview of Apache Solr and the installation process to get you familiar with Solr. It then gradually moves on to discuss the mysteries that make Solr flexible enough to render appropriate search results in different scenarios. This book will take you through clear and practical concepts that will help you monitor, administrate, and optimize your Solr appropriately using both scripts and tools. This book will also teach you ways to query your search and methods to keep your Solr healthy and well maintained. With this book, you will learn how to effectively implement and optimize Solr using Drupal.

Table of Contents

  1. Administrating Solr
    1. Table of Contents
    2. Administrating Solr
    3. Credits
    4. About the Author
    5. About the Reviewers
    6. www.PacktPub.com
      1. Support files, eBooks, discount offers and more
        1. Why Subscribe?
        2. Free Access for Packt account holders
    7. Preface
      1. What this book covers
      2. What you need for this book
      3. Who this book is for
      4. Conventions
      5. Reader feedback
      6. Customer support
        1. Downloading the example code
        2. Errata
        3. Piracy
        4. Questions
    8. 1. Searching Data
      1. Installation
      2. Request/response handling
      3. Querying
        1. Querying based on a particular field value
      4. Searching for a phrase
      5. Boosting phrases over words
      6. Prioritizing your document in search results
      7. Query nesting
      8. Faceted search
        1. Search based on the same value range
        2. Filter your facet results
        3. Autosuggest feature using faceting
      9. Geospatial search
        1. Storing geographical points in the index
        2. Sort results by a distance from a point
      10. Distributed search
      11. Summary
    9. 2. Monitoring Solr
      1. Monitoring metrics
        1. Stats.jsp
        2. JMX MBeans
        3. Solr with JMX
      2. Using password-based authentication
        1. Singler-user environment
        2. Multiuser environment
      3. Agent-less health checks
        1. Ping status
      4. Agent-based health checks
      5. Monitoring tools
        1. Opsview
          1. Solr-specific plugin
          2. Service check set up
          3. Host configuration
          4. Viewing output
        2. New Relic for PHP
          1. Features
          2. System requirement
        3. SPM
          1. What are SPM monitors?
          2. Features
      6. Summary
    10. 3. Managing Solr
      1. Basic Solr scripts
        1. User scripts.conf
        2. Init script
      2. Backup your Solr
        1. Using the Solr replication handler
        2. Backup scripts from earlier Solr releases
      3. Configuring logging
        1. Temporary logging settings
        2. Permanent logging settings
          1. Tomcat logging settings
          2. Jetty logging settings
      4. Collection distribution scripts
        1. Scripts configuration
        2. Rsyncd set up
      5. Summary
    11. 4. Optimizing Solr Tools and Scripts
      1. Business rules
        1. Drools
          1. Drools language features
          2. Drools rule format
          3. Drools rule language (DRL)
          4. Domain-specific language
          5. Guided rule editor
          6. Other Drools features
        2. XML rule language
        3. Why rules?
        4. The rule engine workflow
        5. Benefits of using rules
      2. Language Detection
        1. Configuring language detection
        2. Tika language detection
        3. LangDetect language detection
      3. OpenNLP (Natural Language Processing)
        1. Sentence detection and splitting
        2. Tokenization
        3. Part-Of-Speech tagging
        4. Named entity recognition
      4. Case study – Apache Solr with Drupal 7
        1. Autocomplete
        2. Faceted search
        3. Facet slider
        4. Boosting search relevancy
        5. Prioritizing search results
        6. Customizing the search result display
      5. Summary
    12. A. Solr Resources
    13. Index