Symptoms:
- Creating Native Kubernetes Clusters with an NFS node in VMware Cloud Director Container Service Extension 3.X fails.
- The native cluster fails to deploy successfully due to errors related to the NFS node.
- The cse-server-debug.log on the CSE Server shows errors of the form:
| cluster_service_2_x:2877 - _execute_script_in_nodes | DEBUG :: about to execute script on nfsd-xxxx (vm='vim.VirtualMachine:vm-123'), wait=True
| cluster_service_2_x:2823 - _wait_for_guest_execution_callback | DEBUG :: waiting for process 1699 on vm 'vim.VirtualMachine:vm-123' to finish (1)
| cluster_service_2_x:2823 - _wait_for_guest_execution_callback | DEBUG :: exception, will retry in a few seconds, vm 'vim.VirtualMachine:vm-123'
| cluster_service_2_x:2825 - _wait_for_guest_execution_callback | ERROR :: exception: (vim.fault.GuestOperationsUnavailable) {
dynamicType = <unset>,
dynamicProperty = (vmodl.DynamicProperty) [],
msg = 'The guest operations agent could not be contacted.',
faultCause = <unset>,
faultMessage = (vmodl.LocalizableMessage) []
}
| cluster_service_2_x:2906 - _execute_script_in_nodes | ERROR :: Error executing script in node nfsd-xxxx: process not found (pid=1699) (vm='vim.VirtualMachine:vm-123')
Traceback (most recent call last):
File "/root/.local/lib/python3.7/site-packages/container_service_extension/rde/backend/cluster_service_2_x.py", line 2889, in _execute_script_in_nodes
callback=_wait_for_guest_execution_callback)
File "/usr/local/lib/python3.7/site-packages/vsphere_guest_run/vsphere.py", line 216, in execute_script_in_guest
callback=callback)
File "/usr/local/lib/python3.7/site-packages/vsphere_guest_run/vsphere.py", line 123, in execute_program_in_guest
raise e
File "/usr/local/lib/python3.7/site-packages/vsphere_guest_run/vsphere.py", line 89, in execute_program_in_guest
(pid, vm))
Exception: process not found (pid=1699) (vm='vim.VirtualMachine:vm-123')