Apache Solr and Cassandra

Cassandra is one of the most widely used and distributed, fault-tolerant NOSQL database. Cassandra is designed to handle Big Data workloads across multiple nodes without a single point of failure. There are some interesting performance benchmarks published at planet Cassandra (http://planetcassandra.org/nosql-performance-benchmarks/), which places Apache Cassandra as one of the fastest NOSQL database among its competitors in terms of throughput, load, and so on. Apache Cassandra allows schemaless storage of user information in its store called column families pattern. For example, look at the data model for sales information, which is shown as follows:

When this model is transformed for the Cassandra store, it becomes ...

Get Scaling Apache Solr now with O’Reilly online learning.

O’Reilly members experience live online training, plus books, videos, and digital content from 200+ publishers.