NOTE: This KB article is appropriate only for clustered deployments of VCF Operations for Networks.
If you have a simple deployment (i.e. one, and only one Platform node, regardless of the number of Collector nodes), please refer to KB 433022 - "HRegion Service running but not healthy" in a Simple Deployment (one Platform node) of VCF Operations for Networks
OBSERVATIONS:
While logged into the VCF Operations for Networks GUI, and selecting Settings --> Infrastructure and Support --> Infrastructure and Updates, you observe one of more "Problem(s)" alerts.
The principal alert of concern regarding this KB is "HRegionServer is running but not healthy."
There may be other alerts that appear as well, including examples like:
NOTE: VCF Operations for Networks was formerly named Aria Operations for Networks (AON), and prior to that was named vRealize Network Insight (vRNI).
VCF Operations for Networks
The precise root cause is indeterminate; however, the symptoms indicate the presence of an HBase/HDFS database inconsistency.
This condition can be caused by one, or a combination of the following scenarios:
NOTE:
A reboot that is generated after any significant changes using the "change-network-settings" CLI command, such as changes described in KB Add/Modify the IP Address, Gateway, Netmask and DNS server/IP after VMware Aria Operations for Networks appliances are deployed will NOT cause the symptoms described in this KB.
If you have encountered this issue, ensure you DO NOT perform any manual shutdown or reboot procedure of Platform Node(s).
ubcd /home/ubuntu/./run_all.sh uptime./run_all.sh df -h./run_all.sh sudo /home/ubuntu/check-service-health.sh -p -dsudo -u hbase hbase hbcksudo cat /home/ubuntu/build-target/deployment/patch.txtsudo cat /home/ubuntu/build-target/deployment/appliance.statussudo grep id: /etc/vnera/deployment/deployment.def
In a clustered environment, any time a shutdown of a platform node cluster is needed (for example, to take powered off VM snapshots), a manual shutdown or reboot should never be done of Platform Nodes.
If you have a simple deployment (i.e. one, and only one Platform node, regardless of the number of Collector nodes), please refer to KB 433022 - "HRegion Service running but not healthy" in a Simple Deployment (one Platform node) of VCF Operations for Networks