1. Identify the probe responsible for high CPU (refer to the Additional Information section)
2. Older Java/Probe release:
- It could be possible a probe has been running for quite some time and over this period, the java release it was designed upon has become obsolete (or) is no longer the latest one (or) a newer release of probe is available
- Refer to the probe release notes to find if a newer release is available and compatible and if yes, upgrade the probe to the latest release.
3. Hot-fix available:
- It would a good idea to refer to the hot-fix index to find if there is a hotfix created for the probe, and if yes, check the probe prerequisites.
- Refer to the readme before upgrading the probe to the hotfix level. The link to the hot-fix index can be found Probes Hotfix Page
4. Configuration issue(s):
(a) Insufficient probe memory:
=======================
One of the possible causes could be insufficient probe memory, if there is enough free memory available, try increasing the probe's initial and maximum memory (found in the Probe configuration startup -> opt section)
(b) Issues with existing configuration / setup:
==================================
You could try to take a backup and delete the existing probe (please be aware there could be data loss, if unsure about the consequences, reach out to Support), remove the probe folder manually, deploy the same (or) newer release of probe if available (provided you have read the Probe release notes)
5. Additional Probe specific issue(s):
(a) logmon probe - use of regular expressions:
=====================================
This is specific to the logmon probe, using general syntax like
* as match pattern would increase the CPU load, try using more specific expressions to meet your requirement. The below links could be very handy when using Regex expressions:
Regex BasicsIncluding / Excluding Expressions using Regex(b) nas probe - replication failure:
==========================
This is specific to the nas probe, in a multiple nas environment with Replication enabled, Replication failures could cause the probe to consume high CPU, check the nas probes on the respective robots are activated, running error-free, on the same latest updated release
(c) snmpcollector probe:
===================
This is specific to the snmpcollector probe, check if any profile is not getting activated.
- Probably a lot of load on the end server, refer to the link below to identify if too many metrics are created:
- Check if the end device is certified, refer to the below link for the device support list:
- It would also be good to refer to the below link for snmpcollector probe known issues:
https://techdocs.broadcom.com/us/en/ca-enterprise-software/it-operations-management/ca-unified-infrastructure-management-probes/GA/monitoring/networking-infrastructure/snmpcollector-snmp-data-monitoring/snmpcollector-snmp-data-monitoring-release-notes/snmpcollector-known-issues-and-workarounds.html
(d) database monitoring probes like Oracle, sqlserver - checkpoint timeout errors:
==============================================================
This is specific to database monitoring probes like Oracle, sqlserver probes, check if there are checkpoint timeout errors in the probe log files (check the log files with debug enabled, click
here to get an understanding on how to enable debugging), click
here for further understanding.
If the issue continues to exist and/or if the above-mentioned points are not applicable, please open a case with Broadcom Support.