March 2020
Intermediate to advanced
406 pages
8h 39m
English
Apache Kafka is referred to as a distributed streaming system, which is just another way of saying a distributed job queue. Kafka, which is written in Java, uses the idea of a publish/subscribe model for message queueing. It's often used for writing real-time streaming data pipelines.
We'll assume you have a Kafka instance set up already. If you don't, you can use the following bash script to get a quick Kafka instance:
#!/bin/bashrm -rf kafka_2.12-2.3.0wget -c http://apache.cs.utah.edu/kafka/2.3.0/kafka_2.12-2.3.0.tgztar xvf kafka_2.12-2.3.0.tgz./kafka_2.12-2.3.0/bin/zookeeper-server-start.sh kafka_2.12-2.3.0/config/zookeeper.properties &./kafka_2.12-2.3.0/bin/kafka-server-start.sh kafka_2.12-2.3.0/config/server.propertieswait
We can ...
Read now
Unlock full access