1. Check the
/var/log/secure logs on the segment hosts for possible causes for the authentication failure.
In this particular case, we were getting the following errors:
Oct 9 10:36:10 datanode01 sshd[49954]: pam_sss(sshd:account): Access denied for user gpadmin: 4 (System error)
Oct 9 10:36:10 datanode01 sshd[50427]: Received disconnect from 10.1.1.1 port 55132:11: disconnected by user
Oct 9 10:36:10 datanode01 sshd[50427]: Disconnected from 10.1.1.1 port 55132
Oct 9 10:36:10 datanode01 sshd[49954]: fatal: Access denied for user gpadmin by PAM account configuration [preauth]
Oct 9 10:36:10 datanode01 sshd[49900]: pam_unix(sshd:session): session closed for user gpadmin
2. The issue was being reported by sss daemon when performing PAM authentication.
3. Disabling PAM authentication would fix the issue, but if PAM authentication cannot be disabled, the following parameter can be specified in
/etc/sssd/sssd.conf:
selinux_provider=none
4. This is due to a bug reported for some Linux builds for heavily loaded clients, even if a client has selinux disabled, it was found that ssh access is still randomly denied because of selinux failures. You need to explicitly add
selinux_provider=none to
sssd.conf to avoid seeing these.
That is why the failure occurred during gpbackup execution. This failure occurs when multiple ssh commands are run at the same time but not when manually running the command.
Note: This requirement for configuring the system for GPDB installation will be added to the Greenplum documentation.