December 2023
Intermediate to advanced
592 pages
15h 57m
English
Data that makes its way into Elasticsearch is not always clean. Usually, data requires transformation, enrichment, or formatting. There are options for cleaning data before bringing it into Elasticsearch for ingestion, such as writing custom transformers or using ETL (extract, transform, load) tools. Elasticsearch allows these capabilities via ingest pipelines that provide first-class support for manipulating data—we can split, remove, modify, and enhance data before it is ingested.
Data to be indexed into Elasticsearch may need to undergo transformation and manipulation. Consider an example of loading millions of legal documents represented as PDF files into Elasticsearch for searching. Although bulk ...
Read now
Unlock full access