This issue is caused by the
sshd configuration file that has
AuthorizedKeysFile configured:
cat /etc/ssh/sshd_config
(...)
AuthorizedKeysFile .ssh/authorized_keys .ssh/authorized_keys2 /etc/ssh/authorized_keys.d/%u
HostKey /etc/ssh/ssh_host_rsa_key
HostKey /etc/ssh/ssh_host_ed25519_key
(...)
gpstop starts up to 64 connections to all segments to communicate with them by default.
ssh daemon is not able to load many authorization keys at the same time because they are not served fast enough. As a result. some of the connections are failing with
Connection closed by remote host .
Workaround
Decrease the number of connections used by the greenplum utilities by using the
-B option that controls the amount of parallel processes. In the most extreme situation, you can use
gpstop -u -B 1 to communicate with only one segment at that time but
gpstop/gpstart will take more time. In this case it was enough to use
gpstop -u -B 12
Permanent solution
Authorization OS admin has to work on the RSA key server to be able to serve the keys faster on parallel request.