Learning Apache Apex
by Ananth Gundabattula, Thomas Weise, Munagala V. Ramanath, David Yan, Kenneth Knowles
Apache Kafka
In its own words, Apache Kafka (http://kafka.apache.org/) "is used for building real-time data pipelines and streaming apps. It is horizontally scalable, fault-tolerant, wicked fast, and runs in production in thousands of companies".
Apache Kafka is a distributed, horizontally scalable, fault tolerant and high-throughput pub-sub messaging system. In contrast to similar messaging systems, Kafka was not only designed as a transport mechanism, but also functions as a durable data store that can hold huge amounts of data stored in logs on disk on commodity hardware. It provides access to consumers of the data that is quite similar to a file API. We may even think of Kafka as providing remote access to potentially very large log files. ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access