How to deactivate UIM clariion self-monitoring alarms
search cancel

How to deactivate UIM clariion self-monitoring alarms

book

Article ID: 240285

calendar_today

Updated On:

Products

DX Unified Infrastructure Management (Nimsoft / UIM) DX Unified Infrastructure Management (Nimsoft / UIM)

Issue/Introduction

We keep getting the following error for the clariion probe

Self-Monitoring Failures for '172.31.220.12:STORAGEPROCESSOR.SP_ResponseTime': Data Collection (1 of 2 failed). See clariion.log for more details

What are these and how can they be disabled?

Environment

Release : Any

 

Resolution


The purpose of the alarm

     This is an indication of data collection in your monitor (metric).

How to disable the alarm feature

     Open your probe in Raw Configure, and add the below key under <setup> section.

        enable_self_monitoring_alarm = false

How to change the severity of the alarm

     Open your probe in Raw Configure, and add the below key under <setup> section.

        self_monitoring_alarm_severity = <Desired number>

        (5-Critical, 4-Major, 3-Minor, 2-Warning, 1-Informational). Default is 4


How to change the alarm to be generated per failed metric than failed metric 'type'

     Open your probe in Raw Configure, and add the below key under <setup> section.

        enable_self_monitoring_alarm_aggregation = false

     By default, the probe aggregates self-monitoring alarms based on monitor type.

     For example, if “ResponseTime" metric data collections failed for 1 of 2 monitors, it will aggregate them and only one alarm will be generated.

     Self-Monitoring Failures for 'CLARIION1:STORAGEPROCESSOR.SP_ResponseTime': Data Collection (1 of 2 failed).  See clariion.log for more details

     This aggregated alarm will indicate how many failed (e.g. - 1 out of 2 failed). With that change, the probe will generate an alarm for each incident, like below.


        Failed to collect data for monitor 'SP A.ResponseTime'. Updated value will not be available.
        Failed to collect data for monitor 'SP B.ResponseTime'. Updated value will not be available.


How not to send the same alarm when the data failure continues to happen

     Open your probe in Raw Configure, and add the below key under <setup> section.

        enable_self_monitoring_alarm_same_error_suppression = true

By default, the probe will resend the same failing self-monitoring alarm each probe collection cycle with the same suppression key.
With that change, the alarm will only be sent when it occurred under the first occurrence, if the number of errors changed or if the probe is restarted.

Additional Information

- This alarm feature is available in some monitoring probes.

- This error can be caused by outdated static alarm definitions