You might have some luck increasing retries on the devices which do this. If they block snmp for more than a few seconds at a time there's not really much of a way of solving that.

adam.

Sent from Mailbird

On 2018-07-06 09:10:06, Christian Eriksson <christian.eriksson@advania.se> wrote:

This is due to HPE gear stops responding to SNMP from time to time. If you check the logs, does it say ” Device status changed to Down (snmp)”?

 

Regards
Christian

Från: observium <observium-bounces@observium.org> För Thomas Stather
Skickat: den 6 juli 2018 09:56
Till: observium@observium.org
Ämne: [Observium] how to avoid false-positives (device down/up), on HPE switches and Aruba APs

 

Hi

Observium: 18.1.9011 (stable, subscribtion)

We are monitoring our networking infrastructure with Observium and for a long time, i am seeing some false-positives.
The poller period is 5 minutes and i.e. on 10:00am Observium reports a switch beeing down, where at 10:05am it reports it as beeing up again.

This happens to HPE network gear (switches) as well as to wireless APs (Aruba).

I set

    $config['ping']['debug']   = TRUE;
    $config['ping']['timeout'] = 500;
 
 
 

in config.php and will now monitor if that helps.

Is there anything else i can try to solve this issue?




Best,
Thomas

-- 
Thomas Stather
IT Services
 
VCP4, VCP6-DCV
 
Tel: +49 6221-486 628
Fax: +49 6221-486 561
 
------------------------------------------------------------------------
Max Planck Institute for Medical Research (MPImF)
Jahnstrasse 29, 69120 Heidelberg
Germany