Hi Tom,
well it cannot really be that the device is marked as down by
alert.
Test conditions are as follows:
device_status_type ping
device_status equals 0
If I understand it correctly,
even if snmp is not reachable the device should NOT be marked as
down, because it is pingable.
Therefore I do not understand why it is still marked as down.
Message: 5 Date: Wed, 2 Nov 2016 20:36:19 +0100 From: Tom Laermans <tom.laermans@powersource.cx> To: Observium Network Observation System <observium@observium.org> Subject: Re: [Observium] Device Down False positive and snmpget Message-ID: <eff1c872-a32b-fe5f-ecb5-eafe35712ff4@powersource.cx> Content-Type: text/plain; charset="utf-8"; Format="flowed" Sebastian, The device was likely down due to ping, first... then, as it became pingable but not snmp-able, there was no change logged in device status (down = down). As to why snmpget is telling you 2 is > 128, I'm not sure... Tom On 02/11/2016 20:05, Sebastian Klute wrote:
Hello folks,
today I got kinda strange problem. I got a Server up and running fine
but it is still shown as device down. "Device status changed to Down
(ping)"
After some debugging and digging into the workflow of polling and
checking egc. I tried to redo the different checks that could lead to
a device down.
First of all: Is it pingable ?
root@om:/tmp# /usr/bin/fping -t1000 -t 500 -c 20 -q 84.200.41.226
84.2xx.xxx.xxx : xmt/rcv/%loss = 20/20/0%, min/avg/max =
0.20/0.30/0.43
seems good to me - no idea why it should be down by ping.
Is the hostname resolvable? Yes it is. Standart ping with hostname is
working fine.
So I'll dig further into debug logs and found this
CMD[/usr/bin/snmpget -v2c -c *** -Pu -OQUst -m SNMPv2-MIB -M
/opt/observium/mibs/rfc:/opt/observium/mibs/net-snmp
'udp':'hostname.example':'161' sysObjectID.0 sysUpTime.0]
CMD EXITCODE[1]
CMD RUNTIME[6.0283s]
STDOUT[
]
STDERR[
Timeout: No Response from udp:hostname.example:161.
]
SNMP STATUS[FALSE]
I wondered why and did it manualy.
The error is as followed:
Too many object identifiers specified. Only 128 allowed in one
request.
The question is now, what can I do to solve this issue? What about the
false positive device down by ping alert?
Additional Info:
> $data - /opt/observium/poller.php:170
/opt/observium/includes/alerts.inc.php:61
=========================================
array(
[device_status] => string(1) "0"
[device_status_type] => string(4) "snmp"
[device_ping] => string(4) "7.34"
[device_snmp] => int(0)
)
Thanks for the help and best regards
Sebastian Klute
--
Accelerated IT Services GmbH
Kruppstraße 105 - 60388 Frankfurt - Germany
sk@accelerated.de -http://www.accelerated.de/
Phone: +49 69 - 900 180 41 - Fax: +49 69 - 900 180 90
HRB: 60665 - Amtsgericht Ludwigshafen - VatID: DE253684415
Managing Directors: Nicolaj Kamensek & Ole Krieger
_______________________________________________
observium mailing list
observium@observium.org
http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
-- Accelerated IT Services GmbH Kruppstraße 105 - 60388 Frankfurt - Germany sk@accelerated.de - http://www.accelerated.de/ Phone: +49 69 - 900 180 41 - Fax: +49 69 - 900 180 90 HRB: 60665 - Amtsgericht Ludwigshafen - VatID: DE253684415 Managing Directors: Nicolaj Kamensek & Ole Krieger