When upgrading VMware Cloud Foundation Automation (VCFA) from version 9.0.0 to 9.0.1, the process may fail with a timeout after two hours. This issue occurs because the vco-app pods in the Kubernetes cluster are stuck in a CrashLoopBackoff status. Specifically, the install-rpms init container fails to complete. You can verify this by checking the logs of the failing container, which will show an error indicating that there is no space left on the device during RPM installation.
vco-app)The persistent volume (PVC) associated with the vco-app pods runs out of available space. This prevents the install-rpms init container from installing the necessary updates required for the 9.0.1 version.
To resolve this issue, you must clean the persistent volumes for each of the three vco-app pods.
kubectl get pods vco-app-0 -n prelude -oyaml | yq '.metadata.uid'
kubectl get pods -n prelude -owide | grep vco-app-0
kubectl get nodes -owide
ssh vmware-system-user@<node_ip_address> sudo su -
mount | grep <pod_uid> | grep pvc
usr/lib/vco directory: cd /var/lib/kubelet/pods/<pod_uid>/volumes/kubernetes.io~csi/<pvc_id>/mount cd usr/lib/vco rm -Rf *
vco-app-1 and vco-app-2. vco-app pods are in a Running state.