Re: [Observium] Devices rebooted?
But thus far it appears not to be happening to anyone else.
Adam.
observium@velder.li wrote:
It appears on different hardware/software.. For now i reverted to backup from 19.02.2013 (daily incremental). We will see...
Am 23.02.2013 01:22, schrieb Adam Armstrong:
I can't replicate it and I can't see why it's happening, unless your agent is randomly becoming unreachable.
Also, no one else is reporting the problem.
adam.
On Sat, 23 Feb 2013 02:09:01 +0100, observium@velder.li wrote:
Unfourtinately, i don't have additional infos in the logfile :( Do you need more infos from somewhere else?
Regards, Patrick
Am 23.02.2013 01:15, schrieb Adam Armstrong:
Very helpful debugging. Thanks.
On Sat, 23 Feb 2013 01:59:35 +0100, observium@velder.li wrote:
Still """rebooting""". Few days ago, i hadn't that problem :( Think it's because svn update..
Am 22.02.2013 21:05, schrieb Adam Armstrong:
svn up. I've added logging for unix agent connection failures to the observium.log
root@alpha:/home/observium/demo# grep agent observium.log Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: 111 Connection refused Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: 111 Connection refused Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: 111 Connection refused root@alpha:/home/observium/demo#
Could do with some timestamps too, maybe. :)
adam.
On Fri, 22 Feb 2013 21:53:41 +0100, observium@velder.li wrote: > See uptime on Observium Host: > root@admin:~# uptime > 21:51:48 up 60 days, 21:27, 1 user, load average: 0.00, > 0.00, > 0.00 > > > Uptime of "accesspoint" > root@accesspoint:~# uptime > 21:52:17 up 12 days, 42 min, load average: 0.03, 0.03, 0.04 > > > Uptime uf "ns3" > root@ns3:~# uptime > 21:52:37 up 64 days, 7:39, 1 user, load average: 0.00, > 0.00, > 0.00 > > > "accesspoint" is about 5km from the observium host, site to site > vpn.. > ns3 directly over internet, 6400km. > > > > Am 22.02.2013 20:58, schrieb Adam Armstrong: >> that uptime comes from /proc/uptime, i think it's quite >> reliable: >> >> >>
http://demo.observium.org/graphs/type=device_uptime/device=6/to=1361566071/f...
>> >> Is it possible that your agent is failing to connect? Is the >> host >> in >> question far away from the observium server? >> >> adam. >> >> >> On Fri, 22 Feb 2013 21:34:18 +0100, observium@velder.li wrote: >>> Hi Adam >>> >>> www-data@admin:~/observium.mydomain.ch$ ./poller.php -h 6 -m >>> unix-agent,system -d| grep uptime >>> RRD[update >>> >>> /var/www/observium.mydomain.ch/rrd/admin.mydomain.ch/uptime.rrd >>> N:5183936] Uptime: 59 days, 23h 58m 56s >>> [uptime] => 5183936 >>> SQL[UPDATE `devices` set `uptime` ='5183936',`last_polled` >>> =NOW(),`last_polled_timetaken` ='0.053' WHERE `device_id` = >>> '6'] >>> UPDATED! >>> >>> >>> >>> >>> Am 22.02.2013 20:37, schrieb Adam Armstrong: >>>> For example: >>>> >>>> root@alpha:/home/observium/demo# ./poller.php -h 6 -m >>>> unix-agent,system -d >>>> | grep uptime >>>> (root,7688,836,0.0) grep --color=auto uptime >>>> [uptime] => 20405260.06 64091168.44 >>>> RRD[update >>>> /mnt/ramdisk/observium_demo/alpha.memetic.org/uptime.rrd >>>> N:20405260] Uptime: 236 days, 4h 7m 40s >>>> [uptime] => 20405260 >>>> SQL[UPDATE `devices` set `uptime` ='20405260',`last_polled` >>>> =NOW(),`last_polled_timetaken` ='0.483' WHERE `device_id` = >>>> '6'] >>>> UPDATED! >>>> root@alpha:/home/observium/demo# >>>> >>>> adam. >>>> >>>> On Fri, 22 Feb 2013 21:23:54 +0100, observium@velder.li >>>> wrote: >>>>> No, it's on multiple devices, and snmpd is running since >>>>> last >>>>> reboot. >>>>> Poller has different values than Graph, "uptime" on system, >>>>> devices >>>>> page >>>> >>>>> ;) >>>>> >>>>> >>>>> >>>>> Am 22.02.2013 21:20, schrieb Mark Chaney: >>>>>> Its just probably stupid snmpd restarting or saying that >>>>>> they >>>>>> did. >>>>>> Nothing you can really do about that. >>>>>> >>>>>> On 2013-02-22 14:09, observium@velder.li wrote: >>>>>>> Hi there >>>>>>> >>>>>>> just saw that observium says that many devices were >>>>>>> rebooted >>>>>>> today. >>>>>>> But that's not true. See Screenshot. Is that a bug? >>>>>>> >>>>>>> Regards >>>>>>> Patrick >>>>>>> >>>>>>> _______________________________________________ >>>>>>> observium mailing list >>>>>>> observium@observium.org >>>>>>> >>>>>>> >>>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>> _______________________________________________ >>>>>> observium mailing list >>>>>> observium@observium.org >>>>>> >>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>> _______________________________________________ >>>>> observium mailing list >>>>> observium@observium.org >>>>> >>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>> _______________________________________________ >>>> observium mailing list >>>> observium@observium.org >>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>> _______________________________________________ >>> observium mailing list >>> observium@observium.org >>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >> _______________________________________________ >> observium mailing list >> observium@observium.org >> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium > _______________________________________________ > observium mailing list > observium@observium.org > http://postman.memetic.org/cgi-bin/mailman/listinfo/observium _______________________________________________ observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
Problem solved with restoring snapshot from 20.02.2013. From 21 on the problem is appearing
Am 23.02.2013 02:55, schrieb Adam Armstrong:
But thus far it appears not to be happening to anyone else.
Adam.
observium@velder.li wrote:
It appears on different hardware/software.. For now i reverted to backup from 19.02.2013 (daily incremental). We will see...
Am 23.02.2013 01:22, schrieb Adam Armstrong:
I can't replicate it and I can't see why it's happening, unless your agent is randomly becoming unreachable.
Also, no one else is reporting the problem.
adam.
On Sat, 23 Feb 2013 02:09:01 +0100, observium@velder.li wrote:
Unfourtinately, i don't have additional infos in the logfile :( Do you need more infos from somewhere else?
Regards, Patrick
Am 23.02.2013 01:15, schrieb Adam Armstrong:
Very helpful debugging. Thanks.
On Sat, 23 Feb 2013 01:59:35 +0100, observium@velder.li wrote:
Still """rebooting""". Few days ago, i hadn't that problem :( Think it's because svn update..
Am 22.02.2013 21:05, schrieb Adam Armstrong: > svn up. I've added logging for unix agent connection failures > to > the > observium.log > > root@alpha:/home/observium/demo# grep agent observium.log > Connection to UNIX agent on alpha.memetic.org:6556 failed. > ERROR: > 111 > Connection refused > Connection to UNIX agent on alpha.memetic.org:6556 failed. > ERROR: > 111 > Connection refused > Connection to UNIX agent on alpha.memetic.org:6556 failed. > ERROR: > 111 > Connection refused > root@alpha:/home/observium/demo# > > Could do with some timestamps too, maybe. :) > > adam. > > > On Fri, 22 Feb 2013 21:53:41 +0100, observium@velder.li wrote: >> See uptime on Observium Host: >> root@admin:~# uptime >> 21:51:48 up 60 days, 21:27, 1 user, load average: 0.00, >> 0.00, >> 0.00 >> >> >> Uptime of "accesspoint" >> root@accesspoint:~# uptime >> 21:52:17 up 12 days, 42 min, load average: 0.03, 0.03, 0.04 >> >> >> Uptime uf "ns3" >> root@ns3:~# uptime >> 21:52:37 up 64 days, 7:39, 1 user, load average: 0.00, >> 0.00, >> 0.00 >> >> >> "accesspoint" is about 5km from the observium host, site to >> site >> vpn.. >> ns3 directly over internet, 6400km. >> >> >> >> Am 22.02.2013 20:58, schrieb Adam Armstrong: >>> that uptime comes from /proc/uptime, i think it's quite >>> reliable: >>> >>> >>> > >
http://demo.observium.org/graphs/type=device_uptime/device=6/to=1361566071/f...
>>> >>> Is it possible that your agent is failing to connect? Is the >>> host >>> in >>> question far away from the observium server? >>> >>> adam. >>> >>> >>> On Fri, 22 Feb 2013 21:34:18 +0100, observium@velder.li >>> wrote: >>>> Hi Adam >>>> >>>> www-data@admin:~/observium.mydomain.ch$ ./poller.php -h 6 -m >>>> unix-agent,system -d| grep uptime >>>> RRD[update >>>> >>>> >>>> /var/www/observium.mydomain.ch/rrd/admin.mydomain.ch/uptime.rrd >>>> N:5183936] Uptime: 59 days, 23h 58m 56s >>>> [uptime] => 5183936 >>>> SQL[UPDATE `devices` set `uptime` ='5183936',`last_polled` >>>> =NOW(),`last_polled_timetaken` ='0.053' WHERE `device_id` = >>>> '6'] >>>> UPDATED! >>>> >>>> >>>> >>>> >>>> Am 22.02.2013 20:37, schrieb Adam Armstrong: >>>>> For example: >>>>> >>>>> root@alpha:/home/observium/demo# ./poller.php -h 6 -m >>>>> unix-agent,system -d >>>>> | grep uptime >>>>> (root,7688,836,0.0) grep --color=auto uptime >>>>> [uptime] => 20405260.06 64091168.44 >>>>> RRD[update >>>>> /mnt/ramdisk/observium_demo/alpha.memetic.org/uptime.rrd >>>>> N:20405260] Uptime: 236 days, 4h 7m 40s >>>>> [uptime] => 20405260 >>>>> SQL[UPDATE `devices` set `uptime` ='20405260',`last_polled` >>>>> =NOW(),`last_polled_timetaken` ='0.483' WHERE `device_id` = >>>>> '6'] >>>>> UPDATED! >>>>> root@alpha:/home/observium/demo# >>>>> >>>>> adam. >>>>> >>>>> On Fri, 22 Feb 2013 21:23:54 +0100, observium@velder.li >>>>> wrote: >>>>>> No, it's on multiple devices, and snmpd is running since >>>>>> last >>>>>> reboot. >>>>>> Poller has different values than Graph, "uptime" on >>>>>> system, >>>>>> devices >>>>>> page >>>>> >>>>>> ;) >>>>>> >>>>>> >>>>>> >>>>>> Am 22.02.2013 21:20, schrieb Mark Chaney: >>>>>>> Its just probably stupid snmpd restarting or saying that >>>>>>> they >>>>>>> did. >>>>>>> Nothing you can really do about that. >>>>>>> >>>>>>> On 2013-02-22 14:09, observium@velder.li wrote: >>>>>>>> Hi there >>>>>>>> >>>>>>>> just saw that observium says that many devices were >>>>>>>> rebooted >>>>>>>> today. >>>>>>>> But that's not true. See Screenshot. Is that a bug? >>>>>>>> >>>>>>>> Regards >>>>>>>> Patrick >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> observium mailing list >>>>>>>> observium@observium.org >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>>> _______________________________________________ >>>>>>> observium mailing list >>>>>>> observium@observium.org >>>>>>> >>>>>>> >>>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>> _______________________________________________ >>>>>> observium mailing list >>>>>> observium@observium.org >>>>>> >>>>>> >>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>> _______________________________________________ >>>>> observium mailing list >>>>> observium@observium.org >>>>> >>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>> _______________________________________________ >>>> observium mailing list >>>> observium@observium.org >>>> >>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>> _______________________________________________ >>> observium mailing list >>> observium@observium.org >>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >> _______________________________________________ >> observium mailing list >> observium@observium.org >> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium > _______________________________________________ > observium mailing list > observium@observium.org > http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
Seeing the same thing here, I just svn-ed to 3631 and observium eventlog says most linux boxes rebooted, while they didn't. :)
On 23 February 2013 13:38, observium@velder.li wrote:
Problem solved with restoring snapshot from 20.02.2013. From 21 on the problem is appearing
Am 23.02.2013 02:55, schrieb Adam Armstrong:
But thus far it appears not to be happening to anyone else.
Adam.
observium@velder.li wrote:
It appears on different hardware/software.. For now i reverted to
backup from 19.02.2013 (daily incremental). We will see...
Am 23.02.2013 01:22, schrieb Adam Armstrong:
I can't replicate it and I can't see why it's happening, unless your agent is randomly becoming unreachable.
Also, no one else is reporting the problem.
adam.
On Sat, 23 Feb 2013 02:09:01 +0100, observium@velder.li wrote:
Unfourtinately, i don't have additional infos in the logfile :( Do you need more infos from somewhere else?
Regards, Patrick
Am 23.02.2013 01:15, schrieb Adam Armstrong:
Very helpful debugging. Thanks.
On Sat, 23 Feb 2013 01:59:35 +0100, observium@velder.li wrote:
> Still """rebooting""". Few days ago, i hadn't that problem :( > Think > it's because svn update.. > > > > Am 22.02.2013 21:05, schrieb Adam Armstrong: > >> svn up. I've added logging for unix agent connection failures to >> the >> observium.log >> >> root@alpha:/home/observium/**demo# grep agent observium.log >> Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: >> 111 >> Connection refused >> Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: >> 111 >> Connection refused >> Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: >> 111 >> Connection refused >> root@alpha:/home/observium/**demo# >> >> Could do with some timestamps too, maybe. :) >> >> adam. >> >> >> On Fri, 22 Feb 2013 21:53:41 +0100, observium@velder.li wrote: >> >>> See uptime on Observium Host: >>> root@admin:~# uptime >>> 21:51:48 up 60 days, 21:27, 1 user, load average: 0.00, >>> 0.00, >>> 0.00 >>> >>> >>> Uptime of "accesspoint" >>> root@accesspoint:~# uptime >>> 21:52:17 up 12 days, 42 min, load average: 0.03, 0.03, 0.04 >>> >>> >>> Uptime uf "ns3" >>> root@ns3:~# uptime >>> 21:52:37 up 64 days, 7:39, 1 user, load average: 0.00, >>> 0.00, >>> 0.00 >>> >>> >>> "accesspoint" is about 5km from the observium host, site to site >>> vpn.. >>> ns3 directly over internet, 6400km. >>> >>> >>> >>> Am 22.02.2013 20:58, schrieb Adam Armstrong: >>> >>>> that uptime comes from /proc/uptime, i think it's quite >>>> reliable: >>>> >>>> >>>> >>>> >> >>
http://demo.observium.org/**graphs/type=device_uptime/** device=6/to=1361566071/from=**1330030071/http://demo.observium.org/graphs/type=device_uptime/device=6/to=1361566071/from=1330030071/
>>>> Is it possible that your agent is failing to connect? Is the >>>> host >>>> in >>>> question far away from the observium server? >>>> >>>> adam. >>>> >>>> >>>> On Fri, 22 Feb 2013 21:34:18 +0100, observium@velder.li wrote: >>>> >>>>> Hi Adam >>>>> >>>>> www-data@admin:~/observium.**mydomain.chhttp://observium.mydomain.ch$ >>>>> ./poller.php -h 6 -m >>>>> unix-agent,system -d| grep uptime >>>>> RRD[update >>>>> >>>>> >>>>> /var/www/observium.mydomain.**ch/rrd/admin.mydomain.ch/** >>>>> uptime.rrdhttp://observium.mydomain.ch/rrd/admin.mydomain.ch/uptime.rrd >>>>> N:5183936] Uptime: 59 days, 23h 58m 56s >>>>> [uptime] => 5183936 >>>>> SQL[UPDATE `devices` set `uptime` ='5183936',`last_polled` >>>>> =NOW(),`last_polled_timetaken` ='0.053' WHERE `device_id` = >>>>> '6'] >>>>> UPDATED! >>>>> >>>>> >>>>> >>>>> >>>>> Am 22.02.2013 20:37, schrieb Adam Armstrong: >>>>> >>>>>> For example: >>>>>> >>>>>> root@alpha:/home/observium/**demo# ./poller.php -h 6 -m >>>>>> unix-agent,system -d >>>>>> | grep uptime >>>>>> (root,7688,836,0.0) grep --color=auto uptime >>>>>> [uptime] => 20405260.06 64091168.44 >>>>>> RRD[update >>>>>> /mnt/ramdisk/observium_demo/al**pha.memetic.org/uptime.rrdhttp://alpha.memetic.org/uptime.rrd >>>>>> N:20405260] Uptime: 236 days, 4h 7m 40s >>>>>> [uptime] => 20405260 >>>>>> SQL[UPDATE `devices` set `uptime` ='20405260',`last_polled` >>>>>> =NOW(),`last_polled_timetaken` ='0.483' WHERE `device_id` = >>>>>> '6'] >>>>>> UPDATED! >>>>>> root@alpha:/home/observium/**demo# >>>>>> >>>>>> adam. >>>>>> >>>>>> On Fri, 22 Feb 2013 21:23:54 +0100, observium@velder.li >>>>>> wrote: >>>>>> >>>>>>> No, it's on multiple devices, and snmpd is running since >>>>>>> last >>>>>>> reboot. >>>>>>> Poller has different values than Graph, "uptime" on system, >>>>>>> devices >>>>>>> page >>>>>>> >>>>>> >>>>>> ;) >>>>>>> >>>>>>> >>>>>>> >>>>>>> Am 22.02.2013 21:20, schrieb Mark Chaney: >>>>>>> >>>>>>>> Its just probably stupid snmpd restarting or saying that >>>>>>>> they >>>>>>>> did. >>>>>>>> Nothing you can really do about that. >>>>>>>> >>>>>>>> On 2013-02-22 14:09, observium@velder.li wrote: >>>>>>>> >>>>>>>>> Hi there >>>>>>>>> >>>>>>>>> just saw that observium says that many devices were >>>>>>>>> rebooted >>>>>>>>> today. >>>>>>>>> But that's not true. See Screenshot. Is that a bug? >>>>>>>>> >>>>>>>>> Regards >>>>>>>>> Patrick >>>>>>>>> >>>>>>>>> ______________________________**_________________ >>>>>>>>> observium mailing list >>>>>>>>> observium@observium.org >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> http://postman.memetic.org/**cgi-bin/mailman/listinfo/** >>>>>>>>> observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>>>>> >>>>>>>> ______________________________**_________________ >>>>>>>> observium mailing list >>>>>>>> observium@observium.org >>>>>>>> >>>>>>>> >>>>>>>> http://postman.memetic.org/**cgi-bin/mailman/listinfo/** >>>>>>>> observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>>>> >>>>>>> ______________________________**_________________ >>>>>>> observium mailing list >>>>>>> observium@observium.org >>>>>>> >>>>>>> >>>>>>> http://postman.memetic.org/**cgi-bin/mailman/listinfo/** >>>>>>> observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>>> >>>>>> ______________________________**_________________ >>>>>> observium mailing list >>>>>> observium@observium.org >>>>>> >>>>>> http://postman.memetic.org/**cgi-bin/mailman/listinfo/** >>>>>> observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>> >>>>> ______________________________**_________________ >>>>> observium mailing list >>>>> observium@observium.org >>>>> >>>>> http://postman.memetic.org/**cgi-bin/mailman/listinfo/** >>>>> observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>> >>>> ______________________________**_________________ >>>> observium mailing list >>>> observium@observium.org >>>> http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>> >>> ______________________________**_________________ >>> observium mailing list >>> observium@observium.org >>> http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>> >> ______________________________**_________________ >> observium mailing list >> observium@observium.org >> http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium >> > ______________________________**_________________ observium mailing list observium@observium.org http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium
______________________________**_________________ observium mailing list observium@observium.org http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium
______________________________**_________________ observium mailing list observium@observium.org http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium
______________________________**_________________ observium mailing list observium@observium.org http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium
______________________________**_________________ observium mailing list observium@observium.org http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium
______________________________**_________________ observium mailing list observium@observium.org http://postman.memetic.org/**cgi-bin/mailman/listinfo/**observiumhttp://postman.memetic.org/cgi-bin/mailman/listinfo/observium
Hi guys,
please update to last revision.
I think the reason a small correction uptime for some devices (mostly network), where preferred SNMP-FRAMEWORK-MIB::snmpEngineTime.0 But for unix boxes preferred used HOST-RESOURCES-MIB::hrSystemUptime.0 I returned to the old behavior, but with some improvements.
On Sat, Feb 23, 2013 at 4:48 PM, Wouter Prins wp@null0.nl wrote:
Seeing the same thing here, I just svn-ed to 3631 and observium eventlog says most linux boxes rebooted, while they didn't. :)
On 23 February 2013 13:38, observium@velder.li wrote:
Problem solved with restoring snapshot from 20.02.2013. From 21 on the problem is appearing
Am 23.02.2013 02:55, schrieb Adam Armstrong:
But thus far it appears not to be happening to anyone else.
Adam.
observium@velder.li wrote:
It appears on different hardware/software.. For now i reverted to backup from 19.02.2013 (daily incremental). We will see...
Am 23.02.2013 01:22, schrieb Adam Armstrong:
I can't replicate it and I can't see why it's happening, unless your agent is randomly becoming unreachable.
Also, no one else is reporting the problem.
adam.
On Sat, 23 Feb 2013 02:09:01 +0100, observium@velder.li wrote:
Unfourtinately, i don't have additional infos in the logfile :( Do you need more infos from somewhere else?
Regards, Patrick
Am 23.02.2013 01:15, schrieb Adam Armstrong: > > Very helpful debugging. Thanks. > > On Sat, 23 Feb 2013 01:59:35 +0100, observium@velder.li wrote: >> >> Still """rebooting""". Few days ago, i hadn't that problem :( >> Think >> it's because svn update.. >> >> >> >> Am 22.02.2013 21:05, schrieb Adam Armstrong: >>> >>> svn up. I've added logging for unix agent connection failures to >>> the >>> observium.log >>> >>> root@alpha:/home/observium/demo# grep agent observium.log >>> Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: >>> 111 >>> Connection refused >>> Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: >>> 111 >>> Connection refused >>> Connection to UNIX agent on alpha.memetic.org:6556 failed. ERROR: >>> 111 >>> Connection refused >>> root@alpha:/home/observium/demo# >>> >>> Could do with some timestamps too, maybe. :) >>> >>> adam. >>> >>> >>> On Fri, 22 Feb 2013 21:53:41 +0100, observium@velder.li wrote: >>>> >>>> See uptime on Observium Host: >>>> root@admin:~# uptime >>>> 21:51:48 up 60 days, 21:27, 1 user, load average: 0.00, >>>> 0.00, >>>> 0.00 >>>> >>>> >>>> Uptime of "accesspoint" >>>> root@accesspoint:~# uptime >>>> 21:52:17 up 12 days, 42 min, load average: 0.03, 0.03, 0.04 >>>> >>>> >>>> Uptime uf "ns3" >>>> root@ns3:~# uptime >>>> 21:52:37 up 64 days, 7:39, 1 user, load average: 0.00, >>>> 0.00, >>>> 0.00 >>>> >>>> >>>> "accesspoint" is about 5km from the observium host, site to site >>>> vpn.. >>>> ns3 directly over internet, 6400km. >>>> >>>> >>>> >>>> Am 22.02.2013 20:58, schrieb Adam Armstrong: >>>>> >>>>> that uptime comes from /proc/uptime, i think it's quite >>>>> reliable: >>>>> >>>>> >>>>> >>> >>> > >
http://demo.observium.org/graphs/type=device_uptime/device=6/to=1361566071/f...
>>>>> >>>>> >>>>> Is it possible that your agent is failing to connect? Is the >>>>> host >>>>> in >>>>> question far away from the observium server? >>>>> >>>>> adam. >>>>> >>>>> >>>>> On Fri, 22 Feb 2013 21:34:18 +0100, observium@velder.li wrote: >>>>>> >>>>>> Hi Adam >>>>>> >>>>>> www-data@admin:~/observium.mydomain.ch$ ./poller.php -h 6 -m >>>>>> unix-agent,system -d| grep uptime >>>>>> RRD[update >>>>>> >>>>>> >>>>>> /var/www/observium.mydomain.ch/rrd/admin.mydomain.ch/uptime.rrd >>>>>> N:5183936] Uptime: 59 days, 23h 58m 56s >>>>>> [uptime] => 5183936 >>>>>> SQL[UPDATE `devices` set `uptime` ='5183936',`last_polled` >>>>>> =NOW(),`last_polled_timetaken` ='0.053' WHERE `device_id` = >>>>>> '6'] >>>>>> UPDATED! >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> Am 22.02.2013 20:37, schrieb Adam Armstrong: >>>>>>> >>>>>>> For example: >>>>>>> >>>>>>> root@alpha:/home/observium/demo# ./poller.php -h 6 -m >>>>>>> unix-agent,system -d >>>>>>> | grep uptime >>>>>>> (root,7688,836,0.0) grep --color=auto uptime >>>>>>> [uptime] => 20405260.06 64091168.44 >>>>>>> RRD[update >>>>>>> /mnt/ramdisk/observium_demo/alpha.memetic.org/uptime.rrd >>>>>>> N:20405260] Uptime: 236 days, 4h 7m 40s >>>>>>> [uptime] => 20405260 >>>>>>> SQL[UPDATE `devices` set `uptime` ='20405260',`last_polled` >>>>>>> =NOW(),`last_polled_timetaken` ='0.483' WHERE `device_id` = >>>>>>> '6'] >>>>>>> UPDATED! >>>>>>> root@alpha:/home/observium/demo# >>>>>>> >>>>>>> adam. >>>>>>> >>>>>>> On Fri, 22 Feb 2013 21:23:54 +0100, observium@velder.li >>>>>>> wrote: >>>>>>>> >>>>>>>> No, it's on multiple devices, and snmpd is running since >>>>>>>> last >>>>>>>> reboot. >>>>>>>> Poller has different values than Graph, "uptime" on system, >>>>>>>> devices >>>>>>>> page >>>>>>> >>>>>>> >>>>>>>> ;) >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> Am 22.02.2013 21:20, schrieb Mark Chaney: >>>>>>>>> >>>>>>>>> Its just probably stupid snmpd restarting or saying that >>>>>>>>> they >>>>>>>>> did. >>>>>>>>> Nothing you can really do about that. >>>>>>>>> >>>>>>>>> On 2013-02-22 14:09, observium@velder.li wrote: >>>>>>>>>> >>>>>>>>>> Hi there >>>>>>>>>> >>>>>>>>>> just saw that observium says that many devices were >>>>>>>>>> rebooted >>>>>>>>>> today. >>>>>>>>>> But that's not true. See Screenshot. Is that a bug? >>>>>>>>>> >>>>>>>>>> Regards >>>>>>>>>> Patrick >>>>>>>>>> >>>>>>>>>> _______________________________________________ >>>>>>>>>> observium mailing list >>>>>>>>>> observium@observium.org >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>>>>> >>>>>>>>> _______________________________________________ >>>>>>>>> observium mailing list >>>>>>>>> observium@observium.org >>>>>>>>> >>>>>>>>> >>>>>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> observium mailing list >>>>>>>> observium@observium.org >>>>>>>> >>>>>>>> >>>>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>>> >>>>>>> _______________________________________________ >>>>>>> observium mailing list >>>>>>> observium@observium.org >>>>>>> >>>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>>> >>>>>> _______________________________________________ >>>>>> observium mailing list >>>>>> observium@observium.org >>>>>> >>>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>>> >>>>> _______________________________________________ >>>>> observium mailing list >>>>> observium@observium.org >>>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>>> >>>> _______________________________________________ >>>> observium mailing list >>>> observium@observium.org >>>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium >>> >>> _______________________________________________ >>> observium mailing list >>> observium@observium.org >>> http://postman.memetic.org/cgi-bin/mailman/listinfo/observium > > _______________________________________________ > observium mailing list > observium@observium.org > http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
-- Wouter Prins wp@null0.nl
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
participants (4)
-
Adam Armstrong
-
Mike Stupalov
-
observium@velder.li
-
Wouter Prins