Hi Patrick.

That's surely not supposed to happen. It's quite a number of ports, but certainly nothing that should be an issue, i poll larger stacks and people poll devices with thousands of interfaces.

I would try to check with the -d switch and see which queries lock it up, then report this to cisco as that's simply unacceptable.

If you know what causes it and you don't happen to need that info, you could disable that discovery module for those devices. That should help you out while cisco will treat your issue with utmost urgency and does the needful *cough*

Sent from my mobile, please excuse brevity and spelling errors.

----- Reply message -----
From: "Patrick Zaloum" <pzaloum@gmail.com>
To: <observium@observium.org>
Subject: [Observium] Discovery of large stack causes hardware to become unresponsive
Date: Mon, Mar 4, 2013 16:02


Hello
I are testing Observium in a new environment after previous success stories elsewhere. 
We have noted that when we run device discovery on a stack of 5 Cat3750 (12.2-50-SE1) the "device" becomes unresponsive to any other queries / pings / ssh for approximately 5-6 minutes (during the discovery cycle). 

Once the discovery is complete, the poller, pings, and other operations return to normal. Note that traffic flow does not appear to be affected across the switch.

Total ports on the stack are 278 (no argument that it is a large number of ports to poll from a single device).

Regular polling of the device does not appear to have any consequences, it is specific to the discovery module.

Does anyone have any insight into this?

Thanks
Patrick