October 2018
Beginner
220 pages
5h 33m
English
So far, we have seen how one can run the MapReduce program directly on the server. It is possible to send the program to a remote Hadoop cluster for running it. All you need to ensure is that you have set the resource manager address, fs.defaultFS, library files, and mapreduce.framework.name correctly before running the actual job. So, your program snippet would look something like this:
Configuration conf = new Configuration();conf.set("yarn.resourcemanager.address", "<your-hostname>:<port>"); conf.set("mapreduce.framework.name", "mapreduce"); conf.set("fs.defaultFS", "hdfs://<your-hostname>/"); conf.set("yarn.application.classpath", "<client-jar-libraries");conf.set("HADOOP_USER_NAME","<pass-username>");conf.set("mapreduce.job.jar","myjobfile.jar"); ...Read now
Unlock full access