Book description
Build an enterprise search engine using Apache Solr: index and search documents; ingest data from varied sources; apply various text processing techniques; utilize different search capabilities; and customize Solr to retrieve the desired results. Apache Solr: A Practical Approach to Enterprise Search explains each essential concept--backed by practical and industry examples--to help you attain expert-level knowledge.
The book, which assumes a basic knowledge of Java, starts with an introduction to Solr, followed by steps to setting it up, indexing your first set of documents, and searching them. It then introduces you to information retrieval and its implementation in Apache Solr; this will help you understand your search problem, decide the approach to build an effective solution, and use various metrics to evaluate the results.
The book next covers the schema design and techniques to build a text analysis chain for cleansing, normalizing and enriching your documents and addressing different types of search queries. It describes various popular matching techniques which are generally applied to improve the precision and recall of searches.
You will learn the end-to-end process of data ingestion from varied sources, metadata extraction, pre-processing and transformation of content, various search components, query parsers and other advanced search capabilities.
After covering out-of-the-box features, Solr expert Dikshant Shahi dives into ways you can customize Solr for your business and its specific requirements, along with ways to plug in your own components. Most important, you will learn about implementations for Solr scoring, factors affecting the document score, and tuning the score for the application at hand. The book explains why textual scoring is not sufficient for practical ranking of documents and ways to integrate real-world factors for contributing to the document ranking.
You'll see how to influence user experience by providing suggestions and recommendations. You'll also see integration of Solr with important related technologies such as OpenNLP and Tika. Additionally, you will learn about scaling Solr using SolrCloud.
This book concludes with coverage of semantic search capabilities, which is crucial for taking the search experience to the next level. By the end of Apache Solr, you will be proficient in designing and developing your search engine.
Table of contents
- Cover
- Title
- Copyright
- Dedication
- Contents at a Glance
- Contents
- About the Author
- About the Technical Reviewer
- Acknowledgments
- Introduction
- Chapter 1: Apache Solr: An Introduction
- Chapter 2: Solr Setup and Administration
- Chapter 3: Information Retrieval
- Chapter 4: Schema Design and Text Analysis
- Chapter 5: Indexing Data
- Chapter 6: Searching Data
- Chapter 7: Searching Data: Part 2
- Chapter 8: Solr Scoring
- Chapter 9: Additional Features
-
Chapter 10: Traditional Scaling and SolrCloud
- Stand-Alone Mode
- Sharding
- Master-Slave Architecture
- Shards with Master-Slave
-
SolrCloud
- Understanding the Terminology
- Starting SolrCloud
- Restarting a Node
- Creating a Collection
- Uploading to ZooKeeper
- Deleting a Collection
- Indexing a Document
- Load Balancing
- Document Routing
- Working with a Transaction Log
- Performing a Shard Health Check
- Querying Results
- Performing a Recovery
- Shard Splitting
- Adding a Replica
- ZooKeeper
-
Frequently Asked Questions
- Why is the size of my data/tlog directory growing drastically? How can I handle that?
- Can I totally disable transaction logs? What would be the impact?
- I have recently migrated from traditional architecture to SolrCloud. Is there anything that I should be careful of and not do in SolrCloud?
- I am migrating to SolrCloud, but it fails to upload the configurations to ZooKeeper. What could be the reason?
- Summary
- Chapter 11: Semantic Search
- Index
Product information
- Title: Apache Solr: A Practical Approach to Enterprise Search
- Author(s):
- Release date: December 2015
- Publisher(s): Apress
- ISBN: 9781484210703
You might also like
book
Apache Solr Search Patterns
Leverage the power of Apache Solr to power up your business by navigating your users to …
book
Mastering Apache Solr 7.x
Accelerate your enterprise search engine and bring relevancy in your search analytics About This Book A …
book
Apache Solr for Indexing Data
Enhance your Solr indexing experience with advanced techniques and the built-in functionalities available in Apache Solr …
book
Scaling Apache Solr
Optimize your searches using high-performance enterprise search repositories with Apache Solr In Detail This book is …