vRealize Automation three-node appliance topology does not come online correctly during a disaster recovery failover operation
book
Article ID: 329343
calendar_today
Updated On:
Products
VMware Aria Suite
Issue/Introduction
Symptoms: During a disaster recovery failover operation of vRealize Automation with Site Recovery Manager, the three-node vRealize Automation cluster does not come online.
Environment
VMware Validated Design for Software-Defined Data Center (SDDC) VMware Validated Design for Software-Defined Data Center (SDDC) 5.0.x VMware Validated Design for Software-Defined Data Center (SDDC) 4.3.x VMware Validated Design for Software-Defined Data Center (SDDC) 5.1.x
Cause
As of VMware Validated Design for Software-Defined Data Center 4.3, vRealize Automation uses a three-node topology for the virtual appliances. The primary node should be the first node that is powered on for the vRealize Automation appliances. It is not always possible to determine which appliance is running as the primary node as the primary role may have failed over to another appliance before the disaster recovery operation.
Resolution
Modify the Cloud Management Recovery Plan to ensure the 01a node (e.g. vra01svr01a) is powered on first.
In the vSphere Client or the vSphere Web Client, click Site Recovery > Open Site Recovery.
On the Site Recovery home tab, select a site pair and click View Details.
Select the Recovery Plans tab, right-click a recovery plan, and select Edit Plan.
Move vra01svr01a into the priority 1 group with the vRealize Automation IaaS SQL Server.
Insert a break point (User prompt) in the recovery plan after priority 1 VMs
Insert a break point (User prompt) in the recovery plan after priority 2 VMs
The recovery plan will pause after the priority 1 group has started
Prompt user to do the following:
Log into https://vra01svr01a.rainpole.local:5480.
Navigate to the Cluster Tab. (in vRA version 7.4 this setting is under vRA settings -> Database)
Review the Type & State of vra01svr01a.
If Type = PRIMARY
Change the Database Mode to Async.
Resume the recovery plan powering on the priority 2 group.*
Review the Type & State of vra01svr01b & vra01svr01c once powered on.
If State = NA/NA or Up/NA
Click Reset on vra01svr01b followed by Reset on vra01svr01c.
Change the Database Mode to Sync.
Resume the remainder of the Recovery Plan.
If Type != PRIMARY
Click Promote and wait until promotion completes.
Change the Database Mode to Async.
Resume the recovery plan powering on the priority 2 group. *
Review Type & State of vra01svr01b & vra01svr01c once powered on.
If State = NA/NA or Up/NA
Click Reset on vra01svr01b followed by Reset on vra01svr01c.
Change the Database Mode to Sync.
Resume the remainder of the Recovery Plan.
* Note: The Recovery plan will pause after the priority 2 group has started