This webcast talk will discuss how logs and stream-processing can form a backbone for data flow, ETL, and real-time data processing. It will describe the challenges and lessons learned as LinkedIn built out its real-time data subscription and processing infrastructure. It will also discuss the role of real-time processing and its relationship to offline processing frameworks such as MapReduce.
Table of contents
- Title: I ❤ Logs: Apache Kafka and Real-time Data Integration
- Release date: June 2014
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 978149190830
You might also like
Introduction to Apache Kafka
Currently one of the hottest projects across the Hadoop ecosystem, Apache Kafka is a distributed, real-time …
O'Reilly Strata Data Conference 2019 - New York, New York
The 2019 Strata Data Conference NYC, the biggest Big Data conference in the world, was a …
Data Science from Scratch, 2nd Edition
To really learn data science, you should not only master the tools—data science libraries, frameworks, modules, …
Strata Data Conference - New York, NY 2018
The chief data officer for Goldman Sachs, a cofounder of the blockchain computing platform Ethereum, Google …