Use either of these commands to list and identify the existing rsync processes on each node in the database cluster.
- ps -ef | grep rsync
- lsof -i :50000 -S
Do we see rsync processes that have a child, but not a parent, PID listed? If yes continue. If no there may be other causes yet to be identified. Please open a support case for additional investigation.
If we see rsync processes that have no Parent ID but do have a Child PID use the 'kill -9 <PID>' command, entering the child PID of the zombie rsync process to shut it down. Run the above ps or lsof commands to confirm it's been stopped and removed from the results.
Select cases have shown rsync processes with both a Parent PID and Child PID, as well as showing parameters tying it to the /opt/vertica/bin directory. There should be no rsync processes left running if there is no database backup actively running. If we see this and see that:
- The Parent PID remains and wasn't removed but is invalid. Does the parent PID exist against another process?
- There is no active database backup running.
It is appropriate to kill those processes as well using the 'kill -9 <PID>' command.