Skip to Content
Scalable Big Data Architecture: A Practitioner’s Guide to Choosing Relevant Big Data Architecture
book

Scalable Big Data Architecture: A Practitioner’s Guide to Choosing Relevant Big Data Architecture

by Bahaaldine Azarmi
January 2016
Intermediate to advanced
160 pages
3h 35m
English
Apress
Content preview from Scalable Big Data Architecture: A Practitioner’s Guide to Choosing Relevant Big Data Architecture

CHAPTER 4

image

Streaming Data

In the previous chapter, we focused on a long-term processing job, which runs in a Hadoop cluster and leverages YARN or Hive. In this chapter, I would like to introduce you to what I call the 2014 way of processing the data: streaming data. Indeed, more and more data processing infrastructures are relying on streaming or logging architecture that ingest the data, make some transformation, and then transport the data to a data persistency layer.

This chapter will focus on three key technologies: Kafka, Spark, and the ELK stack from Elastic. We will work on combining them to implement different kind of logging architecture ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Designing Big Data Platforms

Designing Big Data Platforms

Yusuf Aytas

Publisher Resources

ISBN: 9781484213261Purchase book