DX IM - UIM - websphere_mq Self-Monitoring Failures - Alarms


Article ID: 125713


Updated On:


NIMSOFT PROBES DX Infrastructure Management


We keep getting the following error for the websphere_mq probe "Self-Monitoring Failures for 'XXXXXXX:QUEUE.1027':
Monitor Correlation (1 of 6 failed), Data Collection (1 of 6 failed). See websphere_mq.log for more details"

What are these and can the be disabled?


UIM 9.X and earlier
websphere_mg 2.21 and ealier


The Self-Monitoring failures alarm is generated when the mentioned Queue is not able to fetch the data or data is getting null for that particular metric.
This is expected and by design.

If you would like to disable these alarms please do the following add the following key into websphere_mq via Raw Configure and navigate to the setup section: enable_self_monitoring_alarm = false

self_monitoring_alarm_severity = <Desired number>
(5-Critical, 4-Major, 3-Minor, 2-Warning, 1-Informational). Default is 4

enable_self_monitoring_alarm_aggregation = false
enable_self_monitoring_alarm_same_error_suppression = true

This will disable the self-monitoring alarms moving forward.

Additional Information

Note 1: These Self-Monitoring Alarm Failures are aggregated for an element.metric type per resource.  Individual failure details or related exceptions should proceed this log entry.

Note 2: 'Monitor Correlation' failures occur when a monitor does not find it's specific element in the inventory, or no metric value is available for the element.
         With static monitors and changing inventory, these are sometimes expected and may be transitory.

Note 3: The failure count of 'Data Collection' failures often correlate with Monitor Correlation failures.
         When there are only 'Data Collection' failures, or when they exceed 'Monitor Correlation' failures, that usually indicates a problem in collecting that metric value.
         Some metric values are only available with additional system administration.
         Some metric values are only available for specific element types.  For instance one type of storage might have a metric, while another does not.
         Generally it is desirable to understand 'Data Collection' failures for desired metrics, and sometimes the probe needs to be tuned for them