Threshold events generated in DX NetOps CA Performance Management (CAPM) stop appearing in Spectrum after upgrade of both
DX NetOps Spectrum & CAPM 22.2.5 or later
In the Spectrum OneClick server log, we see hundreds of the following:
Rest subscription b5a8e7c3-f6e8-4cfa-9b2c-74a6c264b496 queue no longer full(9001), backlog is 1433Rest subscription b5a8e7c3-f6e8-4cfa-9b2c-74a6c264b496 queue is full(10000), attempt #1
These are followed by hundreds of:
Sep 03, 2024 04:54:55.101 - IOException: Connection refused: connect
This can be caused by a hard timeout for the event subscription between Spectrum and PM. When this timeout is breached, the subscription is halted and then a new one is created. When this happens, the queue can become full and cause a slight delay in the event sync.
You can adjust the Event sync timeout as per the following:
TechDocs : DX NetOps 23.3 : Update the Configuration of the Alarm Service
Specifically, increase the RequestTimoutSec variable to stop this from happening as follows:
PUT request the Event Manager (EM) WebServicehttp://<PC_HOST>:8281/EventManager/webservice/alarmservice/configPUT request, increase the value of RequestTimoutSec to 300 (default is 120 sec)<?xml version="1.0" encoding="UTF-8" standalone="yes"?><AlarmServiceConfig> <Enabled>true</Enabled> <Parallelism>1</Parallelism> <RequestTimoutSec>300</RequestTimoutSec> <SubscriptionRetryDelaySec>300</SubscriptionRetryDelaySec> <SubscriptionBatchSize>1000</SubscriptionBatchSize> <SubscriptionHeartbeatIntervalSec>30</SubscriptionHeartbeatIntervalSec> <SubscriptionTimeoutMs>300000</SubscriptionTimeoutMs></AlarmServiceConfig>
GET request on the same URL to see if it has updated:http://<PC_HOST>:8281/EventManager/webservice/alarmservice/config