We keep getting the following error for the cisco_ucs probe
Error: cisco_ucs:Self-Monitoring Failures for '###.###.###.###':Monitor.Metric': Data Collection (x of x failed). See cisco_ucs.log for more details
What are these and can they be disabled?
The Self-Monitoring failures alarm is generated when the mentioned Queue is not able to fetch the data or data is getting null for that particular metric.
This is expected and by design.
If you would like to disable these alarms please do the following add the following key into cisco_ucm via Raw Configure and navigate to the setup section: enable_self_monitoring_alarm = false
self_monitoring_alarm_severity = <Desired number>
(5-Critical, 4-Major, 3-Minor, 2-Warning, 1-Informational). Default is 4
enable_self_monitoring_alarm_aggregation = false
enable_self_monitoring_alarm_same_error_suppression = true
This will disable the self-monitoring alarms moving forward.