The integration of Storm with YARN
Storm is a distributed computational and processing framework, which was developed to process streaming data in real time. It has been released as open source. It is useful for continuous monitoring of processes and running machine learning algorithms. Storm can process millions of records per second on a single node and is widely used for low-latency processing.
Storm has two main services: Nimbus
(master) and Supervisor
(slave). Storm requires Zookeeper component for co-ordination between the Nimbus
and Supervisor
services. The Storm bundle contains a storm.yaml
configuration file. The file contains information related to the Nimbus
server and the Zookeeper quorum. Similar to Spark, this book does not focus ...
Get Learning YARN now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.