Hi,
We seem to have an issue with alerts not clearing (after they are OK) from the Alerts list.
Example below shows a checked called “Hardware Fault” which matches Warnings, Alerts and Down status for devices. These items (two output phases) were above a threshold for a period over the weekend, but then went back below it and the event cleared. The device itself shows that the thresholds are OK now but it does list as “Status: Checks Failed” even though the values (and the graph) show it as OK / Green:
[cid:image005.jpg@01D0D8D4.84525900]
The actual item is here:
[cid:image006.jpg@01D0D8D4.84525900]
The event in the Main Alerts List is here:
[cid:image007.jpg@01D0D8D4.84525900]
The Alert Checker configuration is here:
[cid:image008.jpg@01D0D8D4.84525900]
Sorry for all the images, but it seems the best way to explain this particular issue!
This is not the first time this has happened, last week we had 8 items stuck in an alert state, I had to delete the checked and clear the alert table for the so-called “failed” items to make them go away. However, now we have some new alerts they seem to be stuck in the same way.
Any ideas?
Cheers!
Robert Williams Custodian Data Centre Email: Robert@CustodianDC.com http://www.CustodianDC.com