
Sorry. I feel lost... So the fact that it's NOT returning 1 isn't good enough to act on?
-- Henrik Cednert cto | td | compositor
Filmlance International Direct + 46 (0)704 71 89 54
From: observium <observium-bounces@observium.orgmailto:observium-bounces@observium.org> on behalf of Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> Reply-To: Observium Network Observation System <observium@observium.orgmailto:observium@observium.org> Date: Friday 15 January 2016 at 22:38 To: "observium@observium.orgmailto:observium@observium.org" <observium@observium.orgmailto:observium@observium.org> Subject: Re: [Observium] [ALERT-CHECKER] Alert to pick up on physical failure of disk.
In this case, I'm not sure what the status entry will be returning. You kinda need this information to make sure the check would pick the failure up :)
adam.
Sent from Mailbirdhttp://www.getmailbird.com/?utm_source=Mailbird&utm_medium=email&utm_campaign=sent-from-mailbird
On 15/01/2016 22:36:01, Henrik Cednert (Filmlance) <henrik.cednert@filmlance.semailto:henrik.cednert@filmlance.se> wrote:
I assume it did since there's a line at the 0 on attached graph. It's not there for the entire failure but at the start up until the morning of me replacing the disk. Since the disk is replaced and status is bak to 1 (ok/normal) I don't think the command will help me/us now. =/
-- Henrik Cednert cto | td | compositor
Filmlance International Direct + 46 (0)704 71 89 54
From: observium <observium-bounces@observium.orgmailto:observium-bounces@observium.org> on behalf of Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> Reply-To: Observium Network Observation System <observium@observium.orgmailto:observium@observium.org> Date: Friday 15 January 2016 at 22:30 To: "observium@observium.orgmailto:observium@observium.org" <observium@observium.orgmailto:observium@observium.org> Subject: Re: [Observium] [ALERT-CHECKER] Alert to pick up on physical failure of disk.
If it's writing RRDs, it should be calling the alerting code.
You can test this by running the poller in debug :
./poller.php -h <host> -m status -d -r
(-r disables rrd writing, so you don't dirty up the rrd files)
adam.
Sent from Mailbirdhttp://www.getmailbird.com/?utm_source=Mailbird&utm_medium=email&utm_campaign=sent-from-mailbird
On 15/01/2016 22:24:38, Henrik Cednert (Filmlance) <henrik.cednert@filmlance.semailto:henrik.cednert@filmlance.se> wrote:
Mkay. =/ So even if the "Status" variable/data/entry/cell (or what it is) in this particular case and for this disk is storing and logging "0" into the database and the rrd files, it's nothing we can alert or react on? =/
-- Henrik Cednert cto | td | compositor
Filmlance International Direct + 46 (0)704 71 89 54
From: observium <observium-bounces@observium.orgmailto:observium-bounces@observium.org> on behalf of Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> Reply-To: Observium Network Observation System <observium@observium.orgmailto:observium@observium.org> Date: Friday 15 January 2016 at 22:13 To: "observium@observium.orgmailto:observium@observium.org" <observium@observium.orgmailto:observium@observium.org> Subject: Re: [Observium] [ALERT-CHECKER] Alert to pick up on physical failure of disk.
The issue with this is that our alerting code is called during the polling process, so it's only called for entities which exist.
adam.
Sent from Mailbirdhttp://www.getmailbird.com/?utm_source=Mailbird&utm_medium=email&utm_campaign=sent-from-mailbird
On 15/01/2016 22:11:12, Henrik Cednert (Filmlance) <henrik.cednert@filmlance.semailto:henrik.cednert@filmlance.se> wrote:
But you do log status for it. And I assume everything is and/or could be watchable. So, maybe a bit ignorant, but isn't it "just" to monitor "status" as seen on my first screenshot? When != 1, alert...? I mean, there's still data in observium that it in theory could react on.
-- Henrik Cednert cto | td | compositor
Filmlance International Direct + 46 (0)704 71 89 54
From: observium <observium-bounces@observium.orgmailto:observium-bounces@observium.org> on behalf of Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> Reply-To: Observium Network Observation System <observium@observium.orgmailto:observium@observium.org> Date: Friday 15 January 2016 at 22:08 To: "observium@observium.orgmailto:observium@observium.org" <observium@observium.orgmailto:observium@observium.org> Subject: Re: [Observium] [ALERT-CHECKER] Alert to pick up on physical failure of disk.
This means that the device stops reporting stats for that entity, which is a little... unfortunate.
It's difficult (impossible) to alert on things which are removed from the device upon failure.
adam.
Sent from Mailbirdhttp://www.getmailbird.com/?utm_source=Mailbird&utm_medium=email&utm_campaign=sent-from-mailbird
On 15/01/2016 20:22:56, Henrik Cednert (Filmlance) <henrik.cednert@filmlance.semailto:henrik.cednert@filmlance.se> wrote:
Hi Adam
Thanks. The thing is, I already have that alert and it doesn't pick up on this particular event. Not sure it's the way the Synology handles it or what it is. But when a disk dies graphs and all just goes missing, see disk 12 on screenshot.
When looking in the Synology UI at it when a disk is dead it's pretty much the same. Not flagged or tagged as failed, just removed from the lists. Pretty stupid, yes I know but non the less I have to deal with it. =/ Hence the q about alerting on status != 0. =)
Cheers
-- Henrik Cednert cto | td | compositor
Filmlance International Direct + 46 (0)704 71 89 54
From: observium <observium-bounces@observium.orgmailto:observium-bounces@observium.org> on behalf of Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> Reply-To: Observium Network Observation System <observium@observium.orgmailto:observium@observium.org> Date: Friday 15 January 2016 at 20:03 To: "observium@observium.orgmailto:observium@observium.org" <observium@observium.orgmailto:observium@observium.org> Subject: Re: [Observium] [ALERT-CHECKER] Alert to pick up on physical failure of disk.
Hi Henrik,
This is a "status" entity check. You probably just want to create a check for all status, since it's easier to manage.
http://alpha.memetic.org/~adama/snaps/Observium_Dev____Alert_check_-_Google_... http://alpha.memetic.org/~adama/snaps/Observium_Dev____Alert_check_-_Google_...
adam.
Sent from Mailbirdhttp://www.getmailbird.com/?utm_source=Mailbird&utm_medium=email&utm_campaign=sent-from-mailbird
On 15/01/2016 18:31:02, Henrik Cednert (Filmlance) <henrik.cednert@filmlance.semailto:henrik.cednert@filmlance.se> wrote:
Hi there
Have gotten some hints at IRC but can't really wrap my head around on how to set it up. Yeah, I know I'm probably stupid. But I can't find a complete list of all commands available and it doesn't complain if I feed it something invalid. So it's really a guessing game. =/
So the checker needs to do something like:
if status of physical_class('storage' or 'hrDeviceDiskStorage') != 1 send alert
Possible? Please give detailed instructions if possible.
Would be sweet if all different possible combinations of alert checkers was added to the demo instance so once can look there for guidance. =)
Cheers and thanks
-- Henrik Cednert cto | td | compositor
Filmlance International Direct + 46 (0)704 71 89 54
_______________________________________________ observium mailing list observium@observium.orgmailto:observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium