Chapter 3. Analyzing Data with Apache Solr

Many organizations suffer when dealing with huge amounts of data generated in different formats, due to incremental IT enablement of their business processes. Dealing with vast varieties of data becomes a challenge for any enterprise search engine. This data may reside in a database, or would be streamed over HTTP protocol. To address these problems, many companies provided tools to bring in data from various sources into one form. These were Extract Transfer Load (ETL) tools mainly used for business intelligence (BI) and analytics solutions. Luckily, Apache Solr provides different ways of dealing with different data types, when it comes down to information collection. We have already read about indexing ...

Get Scaling Apache Solr now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.