HAWQ fails to start giving error, "Failed to start Master instance in admin mode "
search cancel

HAWQ fails to start giving error, "Failed to start Master instance in admin mode "

book

Article ID: 295098

calendar_today

Updated On:

Products

Services Suite

Issue/Introduction

Symptoms:
When starting HAWQ, the following may be seen:
[gpadmin@cl1554 init.d]$ service hawq start 
20161012:05:53:53:609170 gpstart:cl1554:gpadmin-[INFO]:-Starting gpstart with args: -a 
20161012:05:53:53:609170 gpstart:cl1554:gpadmin-[INFO]:-Gathering information and validating the environment... 
20161012:05:53:53:609170 gpstart:cl1554:gpadmin-[INFO]:-Greenplum Binary Version: 'postgres (HAWQ) 4.2.0 build 1' 
20161012:05:53:54:609170 gpstart:cl1554:gpadmin-[INFO]:-Greenplum Catalog Version: '201402180' 
20161012:05:53:54:609170 gpstart:cl1554:gpadmin-[INFO]:-Starting Master instance in admin mode 
20161012:05:58:55:609170 gpstart:cl1554:gpadmin-[CRITICAL]:-Failed to start Master instance in admin mode 
20161012:05:58:55:609170 gpstart:cl1554:gpadmin-[CRITICAL]:-Error occurred: non-zero rc: 1 
Command was: 'env GPSESSID=0000000000 GPERA=None $GPHOME/bin/pg_ctl -D /apps/hdfs/data/nn/hawqmaster/gpseg-1 -l /apps/hdfs/data/nn/hawqmaster/gpseg-1/pg_log/startup.lo 
g -w -t 600 -o " -p 5432 -b 1 -z 64 --silent-mode=true -i -M master -C -1 -x 66 -c gp_role=utility " start' 
rc=1, stdout='waiting for server to start............................................................................................................................... 
........................................................................................................................................................................ 
.........could not start server 
', stderr='pg_ctl: PID file "/apps/hdfs/data/nn/hawqmaster/gpseg-1/postmaster.pid" does not exist

In startup.log the following is seen:
2016-10-12 09:53:55.519809 GMT,,,p609185,th1707034656,,,,0,,,seg-1,,,,,"FATAL","55000","data directory ""/apps/hdfs/data/nn/hawqmaster/gpseg-1"" has group or world access","Permissions should be u=rwx (0700).",,,,,,,"checkDataDir","postmaster.c",1789,
2016-10-12 09:59:58.119876 GMT,,,p609329,th-1399326688,,,,0,,,seg-1,,,,,"FATAL","55000","data directory ""/apps/hdfs/data/nn/hawqmaster/gpseg-1"" has group or world access","Permissions should be u=rwx (0700).",,,,,,,"checkDataDir","postmaster.c",1789,
2016-10-12 10:05:33.659481 GMT,,,p609663,th1654880288,,,,0,,,seg-1,,,,,"FATAL","55000","data directory ""/apps/hdfs/data/nn/hawqmaster/gpseg-1"" has group or world access","Permissions should be u=rwx (0700).",,,,,,,"checkDataDir","postmaster.c",1789,

Environment


Cause

The wrong permissions are set on the master directory.

Resolution

Correct the permissions on the master directory, from the example above the master directory, was "/apps/hdfs/data/nn/hawqmaster/gpseg-1"

The correct permissions for the master directory is:

[root@amb171hawq ~]# ls -ltr /data/hawq/master/
total 8
drwx------. 17 gpadmin gpadmin 4096 May 21 12:36 gpseg-1
[root@amb171hawq ~]#