VMware Identity Broker is missing one of the three nodes and VCF Services Platform is unable to auto-redeploy the missing node
search cancel

VMware Identity Broker is missing one of the three nodes and VCF Services Platform is unable to auto-redeploy the missing node

book

Article ID: 436704

calendar_today

Updated On:

Products

VCF Operations

Issue/Introduction

One of the three VMware Identity Broker (vIDB) nodes got deleted due to an outage and VMware Services Platform (VMSP) should auto re-deploy the lost node but the same is not happening. Services Platform (VMSP) is unable to auto-redeploy the missing node.

Output of kubectl get pod -n vidb-external shows the pod status as Pending:

root@[<VMSP_Node>] # kubectl get pod -n vidb-external
NAMEREADYSTATUSRESTARTSAGE
vidb-postgres-instance-02/2Running14 (5d ago)51d
vidb-postgres-instance-10/2Pending017s
vidb-postgres-instance-22/2Running12 (4d10h ago)32d
vidb-service-pod10/1Pending030d
vidb-service-pod21/1Running12 (4d10h ago)32d
vidb-service-pod31/1Running20 (16h ago)37d

 

Output of kubectl get nodes shows only two nodes:

root@[<VMSP_Node>] # kubectl get nodes
NAMESTATUSROLESAGEVERSION
<VMSP_Node1>Readycontrol-plane51dv1.32.0+vmware.1-fips
<VMSP_Node2>Readycontrol-plane51dv1.32.0+vmware.1-fips

 
Output of kubectl get machines,vspheremachines -A shows the node as Provisioning:

root@[<VMSP_Node>] # kubectl get machines,vspheremachines -A
NAMESPACEVERSIONNAMECLUSTERNODENAMEPROVIDERIDPHASEAGE
vmsp-platformv1.32.0machine.cluster.###vcf-mgmt-###<Node1>vsphere://###Running51d
vmsp-platformv1.32.0machine.cluster.###vcf-mgmt-###<Node2>vsphere://###Provisioning9m1sec
vmsp-platformv1.32.0machine.cluster.###vcf-mgmt-###<Node3>vsphere://###Running51d



Output of kubectl logs capv-controller-manager-xxxx -n vmsp-platform | vim - shows VM template is missing from the data store while VMSP platform is trying to clone the VM:

E0401 <TimeStamp> l controller.go:316] "Reconciler error" err="failed to reconcile VM: unable to find template by name \"/<Datastore_Name>/vm/vcf-services-platform-template-9.0.1.0.24940697\": vm '/<Datastore_Name>/vm/vcf-services-platform-template-9.0.1.0.24940697' not found" controller="vspherevm" controllerGroup="infrastructure. cluster.x-k8s.io" controllerKind="VSphereVM" VSphereVM="vmsp-platform/<Node_Name>" namespace="vmsp-platform" name="<Node_Name>" reconcileID="########-####-####-####-###########"

Environment

VCF Operations 9.0.x

Cause

VM template is missing from the datastore, preventing the new node from being provisioned.

Resolution

To resolve this issue, clone the template from the datastore where this template is available to the data store where this is required(location indicated by logs) and the new node will get provisioned.

Additional Information

Restore VCF Identity Broker from a File-Based Backup for VCF Instance Recovery