on-demand course

Building Better Distributed Data Pipelines

with Patrick McFadin

November 2017

Intermediate

53m

English

Closed Captioning available in German, English, Spanish, French, Japanese, Korean, Portuguese (Portugal, Brazil), Chinese (Simplified), Chinese (Traditional)

Overview

Patrick McFadin explains the basics of how to build more efficient data pipelines, using Apache Kafka to organize, Apache Cassandra to store, and Apache Spark to analyze. Patrick offers an overview of how Cassandra works and why it can be a perfect fit for data-driven projects. Patrick then demonstrates that with the addition of Spark and Kafka, you can maintain a highly distributed, fault-tolerant, and scaling solution. You’ll leave with a comprehensive view of the many options to make considered choices in your data pipeline projects.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Berglund and McCullough on Mastering Cassandra for Architects

Publisher Resources

ISBN: 9781492031000

Building Better Distributed Data Pipelines

with Patrick McFadin

Overview

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

Berglund and McCullough on Mastering Cassandra for Architects

Deploying TensorFlow Models to a Web Application: Using Flask API, TensorFlowJS, and TensorFlow Serving

Large-scale Real-time Stream Processing and Analytics

The Java Sessions: The Best of OSCON 2011

Publisher Resources

Overview

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

Berglund and McCullough on Mastering Cassandra for Architects

Deploying TensorFlow Models to a Web Application: Using Flask API, TensorFlowJS, and TensorFlow Serving

Large-scale Real-time Stream Processing and Analytics

The Java Sessions: The Best of OSCON 2011

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.