The NodeManager logs indicate a failure similar to the one below:
2014-02-26 15:31:55,178 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: setsid exited with exit code 0 2014-02-26 15:31:55,182 WARN org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Exit code from container is : 24 2014-02-26 15:31:55,183 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: configuration tokenization failed 2014-02-26 15:31:55,183 FATAL org.apache.hadoop.yarn.server.nodemanager.NodeManager: Error starting NodeManager org.apache.hadoop.yarn.YarnException: Failed to initialize container executor at org.apache.hadoop.yarn.server.nodemanager.NodeManager.init(NodeManager.java:144) at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:321) at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:359) Caused by: java.io.IOException: Linux container executor not configured properly (error=24) at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:135) at org.apache.hadoop.yarn.server.nodemanager.NodeManager.init(NodeManager.java:142) ... 2 more Caused by: org.apache.hadoop.util.Shell$ExitCodeException: Can't get configured value for yarn.nodemanager.linux-container-executor.group. at org.apache.hadoop.util.Shell.runCommand(Shell.java:202) at org.apache.hadoop.util.Shell.run(Shell.java:129) at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:322) at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:130) ... 3 more 2014/02/28 02:37:22 INFO mapreduce.Job: Job job_1393582635312_0006 failed with state FAILED due to: Application application_1393582635312_0006 failed 1 times due to AM Container for appattempt_1393582635312_0006_000001 exited with exitCode: -1000 due to: java.io.IOException: App initialization failed (139) with output: at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:191) at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:860) Caused by: org.apache.hadoop.util.Shell$ExitCodeException: at org.apache.hadoop.util.Shell.runCommand(Shell.java:202) at org.apache.hadoop.util.Shell.run(Shell.java:129) at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:322) at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:183) ... 1 more .Failing this attempt.. Failing the application.
Testing the container-executor.cfg file will do nothing or it it will return the following:
# cd /usr/lib/gphd/hadoop-yarn/bin # ./container-executor --checksetup configuration tokenization failed Can't get configured value for yarn.nodemanager.linux-container-executor.group.
After configuring a secure cluster and the NodeManager fails to start, check the for the symptoms discussed above in the nodes /var/log/gphd/hadoop-yarn/yarn-yarn-nodemanager-*.log
This error can occur if container-executor.cfg does not have a banned.users entry or an empty "banned.user=" entry.
# cd /etc/gphd/hadoop/conf # cat container-executor.cfg #configured value of yarn.nodemanager.linux-container-executor.group yarn.nodemanager.linux-container-executor.group=yarn #comma separated list of users who can not run applications #Prevent other super-users min.user.id=400
# cd /etc/gphd/hadoop/conf # cat container-executor.cfg #configured value of yarn.nodemanager.linux-container-executor.group yarn.nodemanager.linux-container-executor.group=yarn #comma separated list of users who can not run applications banned.users= #Prevent other super-users min.user.id=400