It was brought to our attention that a specific interface was not being monitored in PM, we checked Spectrum and the interface was present and according to the configuration had been added in late August this year. However PM didn't have the interface listed, we tried rediscovering the device, that didn't work, so we rediscovered the interface MF and that finally added the interface into PM. As the interface is part of a monitoring profile that is "12 hour change detection" unsure why this wasn't picked up.
We would like to investigate why this didn't work as we expected, and if possibly we are missing others due to something not working as expected.
DX NetOps Performance Management Data Aggregator fails to discover new interfaces.
DX NetOps Performance Management Data Aggregator fails to find and update existing interfaces with value changes.
DX NetOps Performance Management Portal web UI shows old out of data ifAlias values for interfaces. Spectrum and the device MIB show the correct current values for ifAlias.
All supported DX NetOps Performance Management releases
Change Detection (CD) has failed, going into a disabled state, for the Interface Metric Family that manages the devices interface items.
This is observed when the following messages are seen in the DA karaf.log file (default home path is /opt/IMDataAggregator/apache-karaf<version>/data/log). They can be seen once every time CD runs. Default time frame is every 24 hours.
INFO | er-thread-135572 | 2024-12-11T01:36:23,143 | nagedDeviceResourceDiscoveryImpl | nagedDeviceResourceDiscoveryImpl 687 | .im.aggregator.discovery | | Change detection for MF NormalizedPortInfo on device 51702397 is disabled.
INFO | er-thread-135572 | 2024-12-11T01:36:23,143 | nagedDeviceResourceDiscoveryImpl | nagedDeviceResourceDiscoveryImpl 626 | .im.aggregator.discovery | | The discovery of {http://im.ca.com/normalizer}NormalizedPortInfo at device 51702397 was disabled. Bailed out.
Use the Item_ID set on the device that hosts the problem interface to find the messages in the karaf.log.
To resolve this in the Portal UI take the following steps.
To manage this using REST API calls review the following.
Which devices have disabled Metric Family entries?
Use a REST call set to POST to the URL:
In the call BODY add the following content.
<FilterSelect xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:noNamespaceSchemaLocation="filter.xsd">
<Filter>
<Device.ChangeDetectionDisabledMFs type="CONTAINS">{http://im.ca.com/normalizer}</Device.ChangeDetectionDisabledMFs>
</Filter>
<Select use="exclude"
isa="exclude">
<Device use="exclude">
<PrimaryIPAddress use="include"/>
<ChangeDetectionDisabledMFs use="include"/>
</Device>
<Item use="exclude">
<Name use="include"/>
</Item>
</Select>
</FilterSelect>
While the result set is empty, this shows an example result using the ARC REST API client.
A working result set with data will return entries for each device that look similar to this example. This gives us the devices Name, ID and the Metric Family or Families at issue.
<DeviceList>
<Device version="1.0.0">
<ID>14005</ID>
<PrimaryIPAddress>10.1.1.1</PrimaryIPAddress>
<ChangeDetectionDisabledMFsList>
<ChangeDetectionDisabledMFs>{http://im.ca.com/normalizer}NormalizedPortInfo</ChangeDetectionDisabledMFs>
</ChangeDetectionDisabledMFsList>
<Item version="1.0.0">
<Name>device A name</Name>
</Item>
</Device>
</DeviceList>
To resolve those you can run Change Detection on the Device and it's MF in the UI (see above) or use these CURL calls to the DA that will also trigger the updates.
The CURL syntax is the following. Replace <Device_Item_ID> with the DA Item ID value for the target device. Replace <adminRoleUserName> with a real username. You'll be prompted for the password when the command is run.
An example for the Interface MF on device whose Item_ID is 14005 using the default admin user would be:
To get into a state where we disable change detection, the following needs to happen.