Remote Collector fails to stay connected in vRealize Operations Manager 6.2.x
search cancel

Remote Collector fails to stay connected in vRealize Operations Manager 6.2.x

book

Article ID: 337508

calendar_today

Updated On:

Products

VMware Aria Suite

Issue/Introduction

Symptoms:

  • Multiple remote collectors goes into Collector Down state and were unable to come back online in vRealize Operations Manager UI.
  • You observe that few adapter instaces in remote collector takes more than 5 mins to collect the data.
  • In the logs\adapters\VMwareAdapter\VMwareAdapter_62524.log file, you see the entries similar to:

    2016-12-10 09:28:21,895 INFO [Collector worker thread 10] (62524) com.integrien.alive.common.adapter3.AdapterBase.collectBase - End collection, took 381 seconds.

  • In the /data/vcops/log/collector.log file, you see entries similar to:

    [warning 2016/12/10 09:27:19.398 CET vRealize Ops Collector-9a76d719-cee8-####-####-########014 <poolTimer-CollectorTovRealize Ops Controller-dd7a1381-e794-####-####-########71e2Pool-vRealize Ops Controller-dd7a1381-e794-####-####-########1e2-[157.96.177.13:6061]-17474> tid=0x3ad74] Pool unexpected closed socket on server connection=SubscriptionConnectionImpl[157.96.177.21:10000:closed]). Server unreachable: could not connect after 1 attempts

    Note: The preceding log excerpts are only examples. Date, time, and environmental variables may vary depending on your environment.

Cause

This issue occurs due to ping loss when vRealize Operations Manager collector node pings controller to check the heartbeat of the controller node.

Resolution

To resolve this issue, use these options:
  • Increase the collection interval of the adapter instances.
  • Check the latency between Remote collector and analytics node ( should be less than 200ms).
  • Correct ping latency and intermittent network communication problems.