Cluster Status listed as offline, system displays "Retrieving cluster status..." indefinitely, without further progress
search cancel

Cluster Status listed as offline, system displays "Retrieving cluster status..." indefinitely, without further progress

book

Article ID: 404805

calendar_today

Updated On:

Products

VCF Operations/Automation (formerly VMware Aria Suite)

Issue/Introduction

1. The Aria Operations cluster status shows offline. When attempting to bring it back online via the "Bring Cluster Online" option, the system displays "Retrieving cluster status..." indefinitely, without further progress.

2. The following ERROR logs are observed in /storage/vcops/log/analytics-########.log on primary node:

2025-07-17T05:33:08,666+0000 ERROR [Analytics Main Thread]  com.integrien.analytics.AnalyticsMain.createGemfireCache - Can not connect to gemfire: Problem configuring membership services
org.apache.geode.GemFireConfigException: Problem configuring membership services

...

Caused by: org.apache.geode.distributed.internal.membership.api.MembershipConfigurationException: Unable to join the distributed system. Could not contact any of the locators: [/#.#.#.#:6061] (Note: #.#.#.# appears to be the wrong IP address of the node.)

...

Caused by: java.net.SocketTimeoutException: connect timed out

3. You followed KB Change the IP Address of Aria Operations 8.14 or later Deployment and updated the IP address in all fields. However, the cluster gets stuck at "Going online", and then shows "Failure" when trying to take it offline. The following ERROR logs are observed in /storage/vcops/log/analytics-########.log on the primary node:

2025-07-18T15:36:24,712+0000 ERROR [Analytics Main Thread]  com.integrien.analytics.AnalyticsMain.createGemfireCache - Can not connect to gemfire: Problem configuring membership services
org.apache.geode.GemFireConfigException: Problem configuring membership services

...

Caused by: org.apache.geode.distributed.internal.membership.api.MembershipConfigurationException: Unable to join the distributed system. Could not contact any of the locators: [primary_ip:6061]

...

Caused by: java.net.UnknownHostException: primary_ip

Environment

Aria Operations 8.18.x

Cause

1. The IP address has changed, but it's not updated in the required files.

2. When following Step 7-n in Change the IP Address of Aria Operations 8.14 or later Deployment, the KB states to run one of the commands below, but "primary_ip" or "replica_ip" was not replaced with the actual IP address of the associated cluster node.

Command for single node or non-HA multi node cluster:
$VMWARE_PYTHON_BIN ./vcopsConfigureRoles.py --adminCS=primary_ip > /dev/null 2>&1

Command for HA multi node or CA multi node cluster:
$VMWARE_PYTHON_BIN ./vcopsConfigureRoles.py --adminCS=primary_ip,replica_ip > /dev/null 2>&1

Resolution

When changing the IP address of the nodes, follow Change the IP Address of Aria Operations 8.14 or later Deployment, ensuring to use the actual IP addresses for  "primary_ip" or "replica_ip" when running commands at Step 7-n.