vCenter Enhanced Linked Mode replication broken after vCenter IP address change.
search cancel

vCenter Enhanced Linked Mode replication broken after vCenter IP address change.

book

Article ID: 406041

calendar_today

Updated On:

Products

VMware vCenter Server VMware Live Recovery

Issue/Introduction

Symptoms:

  • vCenter Server’s IP address was changed, resulting in a disruption of LDAP-based replication between the nodes participating in ELM.
  • This change also impacts the SRM site pairing between the respective vCenters.
    • The /var/log/vmware/sso/vmware-identity-sts.log file on impacted_vcenter indicates repeated authentication failures from the SRM appliance at the remote site. Key log entries point to login failures for the SRM solution user, referencing the service account SRM-51dd790f-####-####-####[email protected].

yyyy-mm-ddThh:mm:ss ERROR sts[68:tomcat-http--31] [CorId=fc63c4fd-3a44-4d09-b7fd-50497738de44] [com.vmware.identity.idm.server.IdentityManager] Failed to checkUserAccountFlags principal [SRM-51dd790f-####-####-####[email protected]] for tenant [vsphere.local]
yyyy-mm-ddThh:mm:ss INFO sts[68:tomcat-http--31] [CorId=fc63c4fd-3a44-4d09-b7fd-50497738de44] [com.vmware.identity.diagnostics.VmEventAppender] EventLog: source=[VMware Identity Server], tenant=[vsphere.local], eventid=[USER_NAME_PWD_AUTH_FAILED], level=[ERROR], category=[VMEVENT_CATEGORY_STS], text=[Failed to authenticate principal [SRM-51dd790f-####-####-####[email protected]]. Login failed], detailText=[Login failed], corelationId=[fc63c4fd-3a44-4d09-b7fd-50497738de44], timestamp=[1753847425240]

Environment

  • VMware vCenter 7.x
  • VMware vCenter 8.x
  • VMware Site Recovery Manager 8.x
  • VMware Live Recovery 9.x

Cause

  • The issue originates after the vCenter IP address is modified. Post-IP change, LDAP port 389 between the vCenters is inaccessible, leading to the breakdown of replication between the Platform Services Controllers or embedded vCenter nodes.
  • The vmdird service, responsible for LDAP-based replication and directory services, was found to be in a non-normal state, preventing outbound replication to other nodes.

Evidence from logs confirms LDAP and replication-related failures:

  • /var/log/vmware/vmdir/vdcrepadmin.log indicate unavailability of partner status:

yyyy-mm-ddThh:mm:ss:t@140239405298816:WARNING: VmDirGetReplicationPartnerStatus, partner (impacted_vcenter) status not available (53)

  • /var/log/vmware/vmdird/vmdird.log confirm missing replication agreements and vmdird not in NORMAL state:

yyyy-mm-ddThh:mm:ss:t@139742532335168:ERROR: VmDirIsHostAPartner: No replication agreement entries found under cn=impacted_vcenter,cn=Servers,cn=default-site,cn=Sites,cn=Configuration,dc=VSPHERE,dc=LOCAL
yyyy-mm-ddThh:mm:ss:t@139742532335168:ERROR: VmDirIsHostAPartner failed. Error(1168)

yyyy-mm-ddThh:mm:ss:t@139868336244288:ERROR: _VmDirSearchPreCondition: Server in not in normal mode, not allowing outward replication.
yyyy-mm-ddThh:mm:ss:t@139868336244288:ERROR: VmDirSendLdapResult: Request (Search), Error (LDAP_UNWILLING_TO_PERFORM(53)), Message (Server in not in normal mode, not allowing outward replication.), (0) socket (ip_address)

  • /var/log/vmware/lookupsvc/lookupserver-default.log captures repeated LDAP server connection failures on port 389:

yyyy-mm-ddThh:mm:ss pool-2-thread-65                                                           ERROR com.vmware.vim.lookup.impl.LdapStorage] LDAP action failed; host=impacted_vcenter, port=389
com.vmware.sso.interop.ldap.ServerDownLdapException: Can't contact LDAP server

These logs confirm that the vCenter’s vmdird service could not establish LDAP connections over port 389 due to network port block or misconfiguration, ultimately leading to replication breakage and SRM authentication failures.

Resolution

  1. Network Remediation:

    • Engage the Network team to ensure TCP port 389 is open bidirectionally between the vCenters/PSC nodes. This port is critical for LDAP communication used in vmdird replication.

    • Refer VMware Ports and Protocols for more details.
  2. Restore vmdird Service to Normal Mode:

    • SSH into the impacted vCenter as root and run the vdcadmintool:
      /usr/lib/vmware-vmdir/bin/vdcadmintool

    • Select option 5: "Set vmdir state to normal"

    • Confirm that the vmdird service enters NORMAL state and replication resumes.

  3. If vmdird state change fails:

If issue persists kindly open a support request with Broadcom Support for further assistance.

Additional Information