I am seeing something similar, but only with two hosts.  They show down/up on what is seemingly a random pattern. The down/up emails are about 5 minutes apart when it happens.  They are both domain controllers. They are VMs on hosts with many other VMs, so I know the network isn't going down, otherwise I'd lose them all at once. Also the host servers are monitored and they never show down. It's just these two DCs. They are Server 2012 and we have other Server 2012 VMs.  I tried running snmpget from the CLI like the FAQ suggested, but it returns fine with no errors. I've even had Observium tell me these DCs are down when I am in active RDP sessions to them.

My gut says Observium is not to blame since only these two hosts are effected, but I don't have a lot of experience troubleshooting SNMP.  I've been digging through the event viewer on the DCs trying to find a cause, but so far no luck.


John :-)
-----------------------------
John Fano
Systems Administrator
North Canton City Schools
john@northcantonschools.org
330.497.5600 x309
-----------------------------
"Well, we'll not risk another frontal assault. That rabbit's dynamite."
         - King Arthur, "Monty Python and the Holy Grail"


On Mon, Mar 10, 2014 at 2:01 PM, Peter Persson <peter.persson@bredband2.se> wrote:
I dont think this is a observium issue...

If you ping the host for 10 minutes, does it go down in Observium then?

/P


2014-03-10 18:49 GMT+01:00 Joarli Leandro [INITNET] <jinitnet@gmail.com>:
Good morning, I have a problem I can not solve.
I have two installations of observium, CE and SUBSCRIPTION, on 2 different servers.

My
hosts are all down , 5 minutes after up. In both servers.

When I go
to check, no host had fallen, and neither appears in the logs instability.

 I installed another server, running only the SNMP and a dedicated link, and even then it falls and rises in 5 minutes.

What can it be? All three devices are dedicated to Observium, 1 physical machine, and another virtual. The client host only with SNMP are physical. All in 3 diferents Datacenter.
 See an example below.


2014-03-10 13:25:02 Machine1 System Device status changed to Up
2014-03-10 13:25:02 Machine2
System Device status changed to Up
2014-03-10 13:20:04 Machine1
System Device status changed to Down (ping)
2014-03-10 13:20:04 Machine2
System Device status changed to Down (ping)

--
Joarli Leandro
Tel: (11) 4478-6171
jleandro@initnet.com.br

_______________________________________________
observium mailing list
observium@observium.org
http://postman.memetic.org/cgi-bin/mailman/listinfo/observium



_______________________________________________
observium mailing list
observium@observium.org
http://postman.memetic.org/cgi-bin/mailman/listinfo/observium