Would like to troubleshoot this to understand why some collectors go into this state after a bit. So far my resolution would be to bounce activemq. But doing this flushes any queued data. This is not desirable. How do we troubleshoot this issue to better understand "what" is wrong?
In Administration -> Monitored Items Management -> Data Collectors we see all three status values (Configuration Status, Polling Status and Status columns) with red "Not Connected" values.
In the System Status Data Collector section we see all three status values (Configuration Status, Polling Status and Status columns) with red "Not Connected" values.
In the current Data Aggregator (default path shown) /opt/IMDataAggregator/broker/apache-activemq-<version>/data/activemq.log file we see these WARN messages. That will be one for each Data Collector impacted.
2024-11-08 15:54:54,014 | WARN | Usage(default:memory:queue://DIP-poll.responses.irep-<DCM_ID>adc:memory) percentUsage=100%, usage=209739881, limit=209715200, percentUsageMinDelta=1%;Parent:Usage(default:memory) percentUsage=1%, usage=655311976, limit=37768868659, percentUsageMinDelta=1%: Usage Manager Memory Limit reached. Producer (ID:<DCM_ID>-3:16:1:1) stopped to prevent flooding queue://DIP-poll.responses.irep-<DCM_ID>adc. See http://activemq.apache.org/producer-flow-control.html for more info (blocking for: 24410s) | org.apache.activemq.broker.region.Queue | ActiveMQ Transport: tcp:///127.0.0.1:34148@61616
All supported DX NetOps Performance Management releases
Network connection problems between impacted Data Aggregator and Data Collector servers.
We have observed disconnects ranging from 1 to 3 minutes in duration can result in this behavior.
In some instances it may be sufficient to simply stop and restart the AMQ service to resolve the issue.
If the issue is not resolved a clean dcmd restart will help. On the problem Data Collector(s) follow these steps:
The "./dcmd clean" helps to clear out old data and configuration files. Upon dcmd restart it forces creation of a fresh data directory and ensures it the correct ownership settings for the install user.
The command ./dcmd clean performs several actions to accomplish this. This summarizes the steps the "./dcmd clean" takes.