18-Aug-2012 12:59:49 GMT NOTIFICATION-Firewall _Unresponsive NOTIFY
18-Aug-2012 13:03:42 GMT NOTIFICATION-Firewall _Down NOTIFY
18-Aug-201213:07:16 GMT NOTIFICATION-Firewall_ Down CLEAR
23-Aug-2012 13:52:04 GMT NOTIFICATION-Firewall _Unresponsive CLEAR
The System Unresponsive reflects the true outage duration. The System Down lasted for < 4 minutes. Why didn't the System Down persist for the same time period as the System Unresponsive?
Since the correlation computation happens at every unit time, the new correlation might detect the relaying device to be "DOWN" from "MightBeDown" and thus clearing the "DOWN" message from the device which was "DOWN" earlier. Also, certain systems are "UNRESPONSIVE" based on attributes set. The System "DOWN" event lasts for less than 4 minutes because within that 4 minutes, there might be some other relaying device reporting as "MightBeDown" and this in turn can be considered for the new correlation calculation. Because Smarts correlates for every unit interval of time, during the correlation, with the present set of symptoms at the time of correlation calculation, it calculates certain systems as "DOWN".
In Addition Smarts defines an Unresponsive alert and a down alert using the following logic:
A Down event in Smarts IP is essentially calculated as follows:
An Unresponsive event is determined based on the results of IsUnresponsive calculation, which is calculated as follows:
There are some computed attributes which define IsUnresponsive, but the first few can be directly seen through the console. |
NOTE: For a down event to be computed, the device goes through the following transitions:
Device is:
1: Unresponsive (Not a Mandatory)
2: MightBeDown
3: DOWN
Unresponsive ---> when all the IP address and SNMP agent are unresponsive.
MightBeDown ---> When its Unresponsive and if it has at least one responding neighbor (i.e. IsEveryNeighborUnresponsive set to false).
DOWN ---> When its MightBedown and NeighboringSystemsMightBeDown symptom and other symptoms (connected neighbours status) are checked to deduce DOWN.