May 2017
Intermediate to advanced
348 pages
7h 8m
English
In Hadoop, centralized cache management is an explicit mechanism for caching the most frequently used files. Users can configure the path to be cached by HDFS, which prevents them from being evicted from memory. Namenode is responsible for coordinating all the Datanode caches in the cluster and periodically receives a cache report.
For this recipe, you will again need a running cluster with at least the HDFS daemons running in the cluster.
master1.cyrus.com master node and switch to user hadoop.
$ hdfs cacheadmin -addPool sales
Read now
Unlock full access