Book description
Further your knowledge of the Elasticsearch server by learning more about its internals, querying, and data handling
In Detail
Elasticsearch is a modern, fast, distributed, scalable, fault tolerant, and open source search and analytics engine. Elasticsearch leverages the capabilities of Apache Lucene, providing a new level of control over how you can index and search even huge sets of data.
This book covers intermediate and advanced functionalities of Elasticsearch and walks you through its internals including caches, the Apache Lucene library, and its monitoring capabilities. You'll learn about practical usage of Elasticsearch configuration parameters and how to use the monitoring API.
With this book, you'll delve into Elasticsearch's query rewrite, query template, bulk operation, document grouping, and function score queries. You will also learn how to improve user search experience, index distribution, segment statistics, and merging. By the end of the book, you will be able to enhance Elasticsearch's performance and create your own Elasticsearch plugins.
What You Will Learn
- Understand Apache Lucene and Elasticsearch's design and architecture
- Use and configure different scoring models to alter the default scoring mechanism
- Choose the appropriate amount of shards and replicas for your deployment
- Improve user search experience by utilizing Elasticsearch functionality
- Control segment merging and learn why Elasticsearch uses merging
- Develop custom Elasticsearch plugins and cover detailed examples of how to extend Elasticsearch by writing your own plugins
- Apply your knowledge to create scalable, efficient, and fault tolerant clusters and monitor your cluster by using and understanding the Elasticsearch API
Table of contents
-
Mastering Elasticsearch Second Edition
- Table of Contents
- Mastering Elasticsearch Second Edition
- Credits
- About the Author
- Acknowledgments
- About the Author
- Acknowledgments
- About the Reviewers
- www.PacktPub.com
- Preface
- 1. Introduction to Elasticsearch
-
2. Power User Query DSL
- Default Apache Lucene scoring explained
- Query rewrite explained
- Query templates
- Handling filters and why it matters
-
Choosing the right query for the job
- Query categorization
- The use cases
- Summary
-
3. Not Only Full Text Search
- Query rescoring
- Controlling multimatching
- Significant terms aggregation
- Documents grouping
- Relations between documents
- Scripting changes between Elasticsearch versions
- Summary
-
4. Improving the User Search Experience
- Correcting user spelling mistakes
- Improving the query relevance
- Summary
- 5. The Index Distribution Architecture
-
6. Low-level Index Control
- Altering Apache Lucene scoring
- Choosing the right directory implementation – the store module
- NRT, flush, refresh, and transaction log
- Segment merging under control
- When it is too much for I/O – throttling explained
-
Understanding Elasticsearch caching
- The filter cache
- The field data cache
- The shard query cache
- Using circuit breakers
- Clearing the caches
- Index, indices, and all caches clearing
- Summary
-
7. Elasticsearch Administration
- Discovery and recovery modules
- The human-friendly status API – using the Cat API
- Backing up
- Federated search
- Summary
-
8. Improving Performance
- Using doc values to optimize your queries
- Knowing about garbage collector
- Benchmarking queries
- Very hot threads
-
Scaling Elasticsearch
- Vertical scaling
- Horizontal scaling
- Using Elasticsearch for high load scenarios
- Summary
-
9. Developing Elasticsearch Plugins
- Creating the Apache Maven project structure
- Understanding the basics
- Creating custom REST action
-
Creating the custom analysis plugin
-
Implementation details
- Implementing TokenFilter
- Implementing the TokenFilter factory
- Implementing the class custom analyzer
- Implementing the analyzer provider
- Implementing the analysis binder
- Implementing the analyzer indices component
- Implementing the analyzer module
- Implementing the analyzer plugin
- Informing Elasticsearch about our custom analyzer
- Testing our custom analysis plugin
-
Implementation details
- Summary
- Index
Product information
- Title: Mastering Elasticsearch - Second Edition
- Author(s):
- Release date: February 2015
- Publisher(s): Packt Publishing
- ISBN: 9781783553792
You might also like
book
ElasticSearch Cookbook - Second Edition
Over 130 advanced recipes to search, analyze, deploy, manage, and monitor data effectively with ElasticSearch In …
book
Elasticsearch Essentials
Harness the power of ElasticSearch to build and manage scalable search and analytics solutions with this …
book
Elasticsearch Server - Third Edition
Leverage Elasticsearch to create a robust, fast, and flexible search solution with ease About This Book …
book
Learning Elasticsearch
Store, search, and analyze your data with ease using Elasticsearch 5.x About This Book Get to …