This webcast talk will discuss how logs and stream-processing can form a backbone for data flow, ETL, and real-time data processing. It will describe the challenges and lessons learned as LinkedIn built out its real-time data subscription and processing infrastructure. It will also discuss the role of real-time processing and its relationship to offline processing frameworks such as MapReduce.
- Title: I ❤ Logs: Apache Kafka and Real-time Data Integration
- Release date: June 2014
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 978149190830
You might also like
Introduction to Apache Kafka
Currently one of the hottest projects across the Hadoop ecosystem, Apache Kafka is a distributed, real-time …
O'Reilly Strata Data Conference 2019 - New York, New York
The 2019 Strata Data Conference NYC, the biggest Big Data conference in the world, was a …
Streaming Data: Understanding the real-time pipeline
Summary Streaming Data introduces the concepts and requirements of streaming and real-time data systems. The book …
Designing Data-Intensive Applications
Data is at the center of many challenges in system design today. Difficult issues need to …