Attempting to upgrade a Tanzu Kubernetes Cluster (TKC) to Tanzu Kubernetes Release (TKR) version 1.28.8 resulted in failure. Although the cluster was edited successfully to reference the new version, neither control plane nor worker nodes rolled out. This caused the upgrade process to stall.
The following log message from the tkr-status-resolver pod on the supervisor confirmed that the issue was related to OSImage resolution:
could not resolve TKR/OSImage for controlPlane, machineDeployments: (workers), query: (controlPlane: (k8sVersionPrefix: ‘v1.28.8+vmware.1-fips.1-tkg.
Tanzu Kubernetes Runtime
TKR 1.28.8 introduced a new structure allowing a single TKR to reference multiple OSImages (e.g., Photon and Ubuntu). In this particular case, the TKR object for version 1.28.8 was missing a reference to the Photon OSImage.
As a result, the cluster—expecting to use a Photon-based OSImage (the default)—could not resolve a suitable image to create new nodes. The absence of this reference blocked the rollout.
To resolve the issue:
kubectl get tkr | grep <tkr version>kubectl delete tkr <tkr version>kubectl get tkr | grep <tkr version>kubectl delete pod vmware-system-tkg-controller-manager-<guid> -n <namespace>kubectl get osimage | grep tkr version