The integration of Storm with YARN

Storm is a distributed computational and processing framework, which was developed to process streaming data in real time. It has been released as open source. It is useful for continuous monitoring of processes and running machine learning algorithms. Storm can process millions of records per second on a single node and is widely used for low-latency processing.

Storm has two main services: Nimbus (master) and Supervisor (slave). Storm requires Zookeeper component for co-ordination between the Nimbus and Supervisor services. The Storm bundle contains a storm.yaml configuration file. The file contains information related to the Nimbus server and the Zookeeper quorum. Similar to Spark, this book does not focus ...

Get Learning YARN now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.