Apache Solr and Cassandra

Cassandra is one of the most widely used and distributed, fault-tolerant NOSQL database. Cassandra is designed to handle Big Data workloads across multiple nodes without a single point of failure. There are some interesting performance benchmarks published at planet Cassandra (http://planetcassandra.org/nosql-performance-benchmarks/), which places Apache Cassandra as one of the fastest NOSQL database among its competitors in terms of throughput, load, and so on. Apache Cassandra allows schemaless storage of user information in its store called column families pattern. For example, look at the data model for sales information, which is shown as follows:

When this model is transformed for the Cassandra store, it becomes ...

Get Scaling Apache Solr now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.