VKS cluster worker nodes fail to move to ESXi host after exiting maintenance mode
search cancel

VKS cluster worker nodes fail to move to ESXi host after exiting maintenance mode

book

Article ID: 428336

calendar_today

Updated On:

Products

VMware vSphere Kubernetes Service

Issue/Introduction

Removing a host from maintenance mode on a Supervisor Cluster does not move VKS cluster worker nodes to it.

Symptoms:

  • Workload Management > Supervisor > Monitor page reports the below error for the ESXi host that was removed from maintenance mode:
    Failed to find any accessible datastores for storage policy #######-########-####-############ datastore URIs: [ds:///vmfs/volumes/#######-########-####-############/] 

  • VKS worker nodes remain on existing hosts and DRS does not migrate them to the new host.

  • SSH session on the ESXI host shows the datastore UUID mentioned in the error is not visible in /vmfs/volumes/ directory.  

Environment

VMware vSphere with Tanzu
VMware vSphere Kubernetes Service

Cause

The storage policy used by the Supervisor Cluster requires the shared datastore to be mounted and accessible on every host within the cluster.
If a host is added or removed from maintenance mode without the datastore being correctly mounted, VKS worker nodes cannot be scheduled on that host due to storage inaccessibility.

Resolution

Perform a storage rescan to ensure the required datastore is mounted on the affected host.

  1. Select the affected ESXi host in the Inventory.

  2. Navigate to Configure > Storage > Storage Adapters.

  3. Click Refresh, then click Rescan Storage.

  4. Verify the mount state via SSH on the ESXi host :

    ls -l /vmfs/volumes | grep <DATASTORE_NAME>
    

Once the datastore is accessible, the host status should transition to Ready without warnings.


If the datastore remains inaccessible or the symlink is missing in the /vmfs/volumes directory, please raise a Support Request. 

Additional Information

Datastore name displays in red on the ESXi host CLI (323016)

Performing a rescan of the storage on an ESXi host (308546)

vSphere with Tanzu PVC creation failure due to "No candidates datastores found" (323448)


Japanese version: VKS クラスタ ワーカー ノードがメンテナンス モードを終了した後に ESXi ホストに移動できない。