CPU metrics data for a set of the same devices, all using the same Vendor Certification (VC) suddenly stopped collecting data. After a Data Aggregator restart, the data started showing up again.
This has been observed a few times in the last year, always around a Data Aggregator restart. A subsequent second restart normally resolves the issue.
All supported Performance Management releases
Before requesting an official Certification from engineering, custom certification attempts were made. This lead to a custom VC file, and an extension on top of what was at the time an incorrect factory certification. Additionally, both the incorrect VC and extension were the same XML code.
In both custom VC and extension, the OIDs to gather CPU data were incorrectly set to always use .1 for an index. These OIDs require .0 to always be used. When .1 was used the device would respond appropriately, with a noSuchInstance error message as they don't exist at .1.
With these customized entries in place, alongside the default one, it was random what combination of VC's and extension were in use. After a DA restart...
Two fixes are needed here. Deletion of the custom VC file and resetting the extension. First the file deletions.
To reset the extension:
<!-- Auto-generated by the type catalog local manager. -->
<DataModel xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" namespace="http://im.ca.com/certifications/snmp" xsi:noNamespaceSchemaLocation="SNMPCertificationFacet.xsd">
<Author>CA</Author>
<Version>1.1</Version>
<FacetType name="McAfeeWebGatewayCPUMib" descriptorClass="com.ca.im.core.datamodel.certs.CertificationFacetDescriptorImpl">
<FacetOf namespace="http://im.ca.com/core" name="Item"/>
</FacetType>
</DataModel>
After about 20 minutes we should begin to see data collected again. If not restart the Data Aggregator.