This webcast talk will discuss how logs and stream-processing can form a backbone for data flow, ETL, and real-time data processing. It will describe the challenges and lessons learned as LinkedIn built out its real-time data subscription and processing infrastructure. It will also discuss the role of real-time processing and its relationship to offline processing frameworks such as MapReduce.
- Title: I ❤ Logs: Apache Kafka and Real-time Data Integration
- Release date: June 2014
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 978149190830
You might also like
Apache Hadoop 3 Quick Start Guide
A fast paced guide that will help you learn about Apache Hadoop 3 and its ecosystem …
Analyzing Big Data with Hadoop, AWS, and EMR
Hadoop is today's most pervasive technology used in Big Data for distributing the processing of massive …
Joins in Kafka Streams
Start with KStream, KTable, and GlobalKTable, then learn about joins, how to implement them, and when …
Hadoop MapReduce v2 Cookbook - Second Edition
Explore the Hadoop MapReduce v2 ecosystem to gain insights from very large datasets In Detail Starting …