my server freaks out sometimes!
* Hi, I am using the pro version Observium 23.8.12912 (stable)https://www.observium.org/ I am having an issue where over the course of 2 hours (or so) all my devices go down and then up. It claims that it missed pings but there is no issue with any of the devices. This has happened to me 2 times in the past 3 months, both times starting at midnight. When I look in the event log, I am seeing some odd things on that date: [cid:image004.jpg@01D9D4D7.42656A20]
First, notice that there are a bunch of snmp timeouts which do not occur on any other day. Also, it seems that all the devices have changes...again this does not happen on any other night. After the first incident I increased the RAM on the server thinking that it was constrained, but the poller info does not look bad: [cid:image005.jpg@01D9D4D7.42656A20]
I would appreciate help in determining what to look at for the cause of this. Thanks
Tony Guadagno O +1 585 577 1003 C +1 585 703 6700 E tonyg@guadagnoconsulting.commailto:tonyg@guadagnoconsulting.com [cid:image001.jpg@01D84DD6.FC9912E0]
This is likely a firewall or similar intervening device being unhappy at UDP volume and rate limiting.
It's likely that your discovery run starts at midnight, so increases SNMP UDP traffic.
adam.
Tony Guadagno via observium wrote on 22/08/2023 14:01:
·Hi, I am using the pro version Observium 23.8.12912 (stable) https://www.observium.org/
I am having an issue where over the course of 2 hours (or so) all my devices go down and then up. It claims that it missed pings but there is no issue with any of the devices. This has happened to me 2 times in the past 3 months, both times starting at midnight.
When I look in the event log, I am seeing some odd things on that date:
First, notice that there are a bunch of snmp timeouts which do not occur on any other day. Also, it seems that all the devices have changes…again this does not happen on any other night.
After the first incident I increased the RAM on the server thinking that it was constrained, but the poller info does not look bad:
I would appreciate help in determining what to look at for the cause of this.
Thanks
Tony Guadagno
O +1 585 577 1003
C +1 585 703 6700
E tonyg@guadagnoconsulting.com mailto:tonyg@guadagnoconsulting.com
cid:image001.jpg@01D84DD6.FC9912E0
observium mailing list -- observium@lists.observium.org To unsubscribe send an email to observium-leave@lists.observium.org
Plz unsubscribe my mail.
Thank and Regards
S. M Al-Mahmud Hashim Informations Security & Governance Manager, IT & ADC Ops Division NRB Bank Corporate Head Office |Uday Sanz (4th Floor)| Plot: 2/B, Road: 134, South Avenue| Gulshan-1. T: +880 9666-45-6263|M:+8801700702182 E: hashim.mahmud@nrbbankbd.commailto:mamun.seraji@nrbbankbd.com www.nrbbankbd.comhttp://www.nrbbankbd.com/
"The content of this message is confidential. If you have received it by mistake, please inform us by an email reply and then delete the message. It is forbidden to copy, forward, or in any way reveal the contents of this message to anyone. The integrity and security of this email cannot be guaranteed over the Internet. Therefore, the sender will not be held liable for any damage caused by the message."
From: Adam Armstrong via observium observium@lists.observium.org Sent: Tuesday, August 22, 2023 10:17 PM To: Tony Guadagno via observium observium@lists.observium.org Cc: Adam Armstrong adama@observium.org Subject: [Observium] Re: my server freaks out sometimes!
This is likely a firewall or similar intervening device being unhappy at UDP volume and rate limiting.
It's likely that your discovery run starts at midnight, so increases SNMP UDP traffic.
adam.
Tony Guadagno via observium wrote on 22/08/2023 14:01:
* Hi, I am using the pro version Observium 23.8.12912 (stable)https://www.observium.org/ I am having an issue where over the course of 2 hours (or so) all my devices go down and then up. It claims that it missed pings but there is no issue with any of the devices. This has happened to me 2 times in the past 3 months, both times starting at midnight. When I look in the event log, I am seeing some odd things on that date:
First, notice that there are a bunch of snmp timeouts which do not occur on any other day. Also, it seems that all the devices have changes...again this does not happen on any other night. After the first incident I increased the RAM on the server thinking that it was constrained, but the poller info does not look bad:
I would appreciate help in determining what to look at for the cause of this. Thanks
Tony Guadagno O +1 585 577 1003 C +1 585 703 6700 E tonyg@guadagnoconsulting.commailto:tonyg@guadagnoconsulting.com [cid:image001.jpg@01D9D54D.B1272130]
_______________________________________________
observium mailing list -- observium@lists.observium.orgmailto:observium@lists.observium.org
To unsubscribe send an email to observium-leave@lists.observium.orgmailto:observium-leave@lists.observium.org
participants (3)
-
Adam Armstrong
-
S M Al-Mahmud Hashim
-
Tony Guadagno