O'Reilly logo

Learning Real-time Processing with Spark Streaming by Sumit Gupta

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Integration with Cassandra

Apache Cassandra, http://cassandra.apache.org/, is a massively distributed database for handling large data across data centers. It is a linearly scalable NoSQL (non-relational) open source database which offers high availability with ease of operation. It also offers a low cost solution for commodity hardware or cloud infrastructure but with proven fault tolerance.

The integration of a NoSQL database like Cassandra with Spark Streaming not only provides the flexibility to downstream systems like web applications, portals or mobile apps for consumption of the processed data according to convenience or requirements, but can also be used as the data source for Spark for further deep analytics.

DataStax, www.datastax.com ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required