When deploying a TanzuKubernetesCluster (TKC) with a GPU-enabled VM Class on Supervisor 8, the cluster may remain stuck in a Creating/Provisioning state. GPU worker VMs power on but initially show GuestBootstrap = Unknown (NoBootstrapStatus) and later report as unhealthy with NodeHealthy=False and the message Node condition MemoryPressure is True. Non-GPU TKCs deploy successfully, but GPU-enabled TKCs fail to become Ready.
GPU workers are not supported on Photon-based OS images. When a GPU-enabled VM Class is used with Photon OS, bootstrap may partially complete but the node reports MemoryPressure and is marked unhealthy. This prevents the Machines from becoming Ready and blocks the TKC from transitioning to a Ready state. Ubuntu is the supported operating system for GPU-enabled TKCs.
Deploy GPU-enabled TKCs using Ubuntu instead of Photon.
kubectl edit tkc <tkc> -n <namespace>run.tanzu.vmware.com/os-image: ubuntu