Connecting to YARN
If you run YARN (aka MapReduce 2.0, or MRv2) instead of the first generation of MapReduce framework on your Hadoop cluster, the following steps are required to configure the CloverDX Hadoop connection:
Write an arbitrary value into the MapReduce JobTracker Host field. This value will not be used, but will ensure that MapReduce job execution is enabled for this Hadoop connection.
Add this key-value pair to Hadoop Parameters:
In the Hadoop Parameters, add the key
yarn.resourcemanager.addresswith a value in form of a colon separated hostname and port of your YARN ResourceManager, e.g.
You will probably have to specify the
yarn.application.classpath parameter too, if the default value from
yarn-default.xml isn’t working.
In this case, you would probably find some
java.lang.NoClassDefFoundError in the log of the failed YARN application container.