HA Config for Hub to Hub Failover including probe and queues operation

search cancel

HA Config for Hub to Hub Failover including probe and queues operation

book

Article ID: 368185

calendar_today

Updated On: 05-29-2024

Products

CA Unified Infrastructure Management On-Premise (Nimsoft / UIM) CA Unified Infrastructure Management SaaS (Nimsoft / UIM) DX Unified Infrastructure Management (Nimsoft / UIM)

Issue/Introduction

Customers often ask for guidance on how to configure hubs for failover so remote/secondary hubs can act as failover hubs where remote (agentless) probes have been deployed. This currently only applies to an environment that doesn't use/rely upon MCS for probe/profile deployment.

The HA probe can be used for infrastructure components' high availability, but it does not support monitoring probes with MCS-based configurations to failover.

This feature/functionality may be considered in a future release.

Environment

DX UIM 20.4 or higher

Cause

Guidance

Resolution

Sample HA Config for Hub<->Hub Failover including probe and queues operation

Probes installed and running on Secondary in normal mode of operation.

HA Probe installed and configured on <ABC>_Secondary watching <ABC>_Primary.

Queues and probes listed below from the HA.cfg on a Secondary hub:

<queue_up>

queue_0 = action_manager

queue_1 = alarm_manager

queue_2 = data_engine

queue_3 = ems

queue_4 = event_manager

queue_5 = HAProxy01_AlarmDiscovery

queue_6 = HAProxy01_QOS

queue_7 = HAProxy02_AlarmDiscovery

queue_8 = HAProxy02_QOS

queue_9 = legacy_alarm_manager

queue_10 = probeDiscovery

queue_11 = udm_inventory

</queue_up>

<queue_down>

queue_0 = Secondary_QOSDiscovery

</queue_down>

<probes_up>

probe_0 = data_engine

probe_1 = ems

probe_2 = ace

probe_3 = wasp

probe_4 = qos_processor

probe_5 = discovery_server

probe_6 = mpse

probe_7 = maintenance_mode

probe_8 = udm_manager

probe_9 = sla_engine

probe_10 = trellis

probe_11 = service_host

</probes_up

Configure data_engine with the same credentials as the Primary data_engine and verify it can login to the database. It is ok to start the data_engine and have them both running for a limited amount of time but make sure that the distsrv is deployed and kept up and running otherwise the data_engine will not start.
Deploy and configure the AdminConsole on <ABC>_Secondary Hub:
How to install a second instance of the web-based Admin Console application on a secondary hub
Secondary hub is configured for adminconsole on port 80
http://<hostname or IP_ADDRESS>

HA Queues need to be configured, but disabled on the Secondary. They will be enabled by the HA probe on failover, with some connecting to other hubs/Proxy Hubs.

The Secondary_QOSDiscovery Queue connects to the Primary Hub during normal operation and is disabled on failover.
Nas replication is running between Primary and Secondary during normal operation so no Alarm Queue is needed.

Proxy Fail-Over

The Proxy hub fail-over is straightforward forward with the 2 <City/Location> Proxy hub servers watching each other, ready to pick up the tertiary Hub Queues if one should fail.

<ABC>_Proxy01 ç=è <ABC>_Proxy02

<ABC>_Proxy01 watches <ABC>_Proxy02 which manages Queues for <ABC>_SiteHub03 and 04.

Example in HA.cfg on <ABC>_Proxy01 which will have Queues enabled if <ABC>_Proxy02 stops.

<queue_up>

queue_0 = HASiteHub03_AlarmDiscovery

queue_1 = HASiteHub03_QOS

queue_2 = HASiteHub04_Alarmdiscovery

queue_3 = HASiteHub04_QOS

</queue_up>

<ABC>_Proxy02 in turn watches <ABC>_Proxy01 and will enable its Queues to SiteHub01 and 02 if needed.

Using the Infrastructure Manager (IM), under hub->Advanced Settings->General, make sure the ‘paired’ hubs are configured with the same origin. This is important for all high-availability scenarios.

Feedback

Was this article helpful?

thumb_up Yes

thumb_down No