I got alert mail in every 5 minutes. Bug? V2 :)
![](https://secure.gravatar.com/avatar/31624109ddddac107dccfee4b36d9545.jpg?s=120&d=mm&r=g)
Hi All,
Some week ago i wrote a mail because we found an alert bug and we got lot of mails (in every 5. minutes) instead of just 1 in 24 hours.
This Observium was an updated from the 2015.07 CE to the latest (CE.0.16.1.7533). We thougth that the problem was because of the upgrade process and the alert function was new and something went bad....
Now i made a clear install of the latest Observium CE from .tgz. I added only one host, add the same alert, and i’m receiving again the alerts mail in every 5. minutes. I didn’t touch any file just the config.php because of the SQL access of the Observium!
I added a host (emaussw01). It is a Cisco Nexus device with 7 FEX modules. I see the temperatures of FEXs (4 values from every FEX) all is under 50C but not one module. This module are the "FEX-111 Module-1 Die-1". This is over 50C.
I add an alert, export is this:
<?xml version="1.0"?> <templates> <template type="alert" description="Autogenerated observium template" version="0.91" created="Thu, 10 Mar 2016 15:29:04 +0100" observium="0.16.1.7533" id="73249fd2b23ef5f715d749ce2e98be41"> <entity_type>sensor</entity_type> <name>emaussw01 temp check</name> <message>Temp alarm!</message> <severity>crit</severity> <suppress_recovery>0</suppress_recovery> <delay>0</delay> <conditions_and>1</conditions_and> <conditions>sensor_value gt 50</conditions> <conditions_complex>sensor_value gt 50</conditions_complex> <associations> <device>hostname eq emaussw01</device> <entity>sensor_class is temperature</entity> </associations> </template> </templates>
I associated one user for this check via email alert.
Alert history is:
Date Device Alert Check Entity Message Status 2016-03-10 15:38:26 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:34:15 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:29:12 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:24:06 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:18:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:14:07 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:08:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:03:51 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
And i got email allways (1 minutes earlier) when ALERT_NOTIFY is present in the log:
# grep -i "Date: Thu” /var/spool/mail/obstestuser Date: Thu, 10 Mar 2016 15:02:50 +0100 Date: Thu, 10 Mar 2016 15:07:28 +0100 Date: Thu, 10 Mar 2016 15:13:07 +0100 Date: Thu, 10 Mar 2016 15:17:28 +0100 Date: Thu, 10 Mar 2016 15:23:06 +0100 Date: Thu, 10 Mar 2016 15:28:11 +0100 Date: Thu, 10 Mar 2016 15:33:14 +0100 Date: Thu, 10 Mar 2016 15:37:26 +0100
What can i debug for resolve this problem?
Thanks!
Regards Laszlo
![](https://secure.gravatar.com/avatar/0fa97865a0e1ab36152b6b2299eedb49.jpg?s=120&d=mm&r=g)
Repeating alerts is likely to be caused by mysql being in strict mode.
Adam.
Sent from BlueMail
On Mar 10, 2016, 14:59, at 14:59, Laszlo Nemeth laszlo.nemeth@emarsys.com wrote:
Hi All,
Some week ago i wrote a mail because we found an alert bug and we got lot of mails (in every 5. minutes) instead of just 1 in 24 hours.
This Observium was an updated from the 2015.07 CE to the latest (CE.0.16.1.7533). We thougth that the problem was because of the upgrade process and the alert function was new and something went bad....
Now i made a clear install of the latest Observium CE from .tgz. I added only one host, add the same alert, and i’m receiving again the alerts mail in every 5. minutes. I didn’t touch any file just the config.php because of the SQL access of the Observium!
I added a host (emaussw01). It is a Cisco Nexus device with 7 FEX modules. I see the temperatures of FEXs (4 values from every FEX) all is under 50C but not one module. This module are the "FEX-111 Module-1 Die-1". This is over 50C.
I add an alert, export is this:
<?xml version="1.0"?>
<templates> <template type="alert" description="Autogenerated observium template" version="0.91" created="Thu, 10 Mar 2016 15:29:04 +0100" observium="0.16.1.7533" id="73249fd2b23ef5f715d749ce2e98be41"> <entity_type>sensor</entity_type> <name>emaussw01 temp check</name> <message>Temp alarm!</message> <severity>crit</severity> <suppress_recovery>0</suppress_recovery> <delay>0</delay> <conditions_and>1</conditions_and> <conditions>sensor_value gt 50</conditions> <conditions_complex>sensor_value gt 50</conditions_complex> <associations> <device>hostname eq emaussw01</device> <entity>sensor_class is temperature</entity> </associations> </template> </templates>
I associated one user for this check via email alert.
Alert history is:
Date Device Alert Check Entity Message Status 2016-03-10 15:38:26 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:34:15 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:29:12 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:24:06 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:18:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:14:07 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:08:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:03:51 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
And i got email allways (1 minutes earlier) when ALERT_NOTIFY is present in the log:
# grep -i "Date: Thu” /var/spool/mail/obstestuser Date: Thu, 10 Mar 2016 15:02:50 +0100 Date: Thu, 10 Mar 2016 15:07:28 +0100 Date: Thu, 10 Mar 2016 15:13:07 +0100 Date: Thu, 10 Mar 2016 15:17:28 +0100 Date: Thu, 10 Mar 2016 15:23:06 +0100 Date: Thu, 10 Mar 2016 15:28:11 +0100 Date: Thu, 10 Mar 2016 15:33:14 +0100 Date: Thu, 10 Mar 2016 15:37:26 +0100
What can i debug for resolve this problem?
Thanks!
Regards Laszlo _______________________________________________ observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
![](https://secure.gravatar.com/avatar/0fa97865a0e1ab36152b6b2299eedb49.jpg?s=120&d=mm&r=g)
Specifically, you may see failed sql insert/updates to the alert_table table due to strict mode restrictions which is preventing observium storing the time that notifications have been sent, which will cause the notification to be resent on the next polling cycle.
adam.
Sent from Mailbird [http://www.getmailbird.com/?utm_source=Mailbird&utm_medium=email&utm...] On 10/03/2016 15:06:09, Adam Armstrong adama@memetic.org wrote: Repeating alerts is likely to be caused by mysql being in strict mode. Adam. Sent from BlueMail [http://www.bluemail.me/r]
On Mar 10, 2016, at 14:59, Laszlo Nemeth <laszlo.nemeth@emarsys.com [mailto:laszlo.nemeth@emarsys.com]> wrote: Hi All,
Some week ago i wrote a mail because we found an alert bug and we got lot of mails (in every 5. minutes) instead of just 1 in 24 hours.
This Observium was an updated from the 2015.07 CE to the latest (CE.0.16.1.7533). We thougth that the problem was because of the upgrade process and the alert function was new and something went bad....
Now i made a clear install of the latest Observium CE from .tgz. I added only one host, add the same alert, and i’m receiving again the alerts mail in every 5. minutes. I didn’t touch any file just the config.php because of the SQL access of the Observium!
I added a host (emaussw01). It is a Cisco Nexus device with 7 FEX modules. I see the temperatures of FEXs (4 values from every FEX) all is under 50C but not one module. This module are the "FEX-111 Module-1 Die-1". This is over 50C.
I add an alert, export is this:
<?xml version="1.0"?> <templates> <template type="alert" description="Autogenerated observium template" version="0.91" created="Thu, 10 Mar 2016 15:29:04 +0100" observium="0.16.1.7533" id="73249fd2b23ef5f715d749ce2e98be41"> <entity_type>sensor</entity_type> <name>emaussw01 temp check</name> <message>Temp alarm!</message> <severity>crit</severity> <suppress_recovery>0</suppress_recovery> <delay>0</delay> <conditions_and>1</conditions_and> <conditions>sensor_value gt 50</conditions> <conditions_complex>sensor_value gt 50</conditions_complex> <associations> <device>hostname eq emaussw01</device> <entity>sensor_class is temperature</entity> </associations> </template> </templates>
I associated one user for this check via email alert.
Alert history is:
Date Device Alert Check Entity Message Status 2016-03-10 15:38:26 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:34:15 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:29:12 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:24:06 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:18:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:14:07 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:08:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:03:51 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
And i got email allways (1 minutes earlier) when ALERT_NOTIFY is present in the log:
# grep -i "Date: Thu” /var/spool/mail/obstestuser Date: Thu, 10 Mar 2016 15:02:50 +0100 Date: Thu, 10 Mar 2016 15:07:28 +0100 Date: Thu, 10 Mar 2016 15:13:07 +0100 Date: Thu, 10 Mar 2016 15:17:28 +0100 Date: Thu, 10 Mar 2016 15:23:06 +0100 Date: Thu, 10 Mar 2016 15:28:11 +0100 Date: Thu, 10 Mar 2016 15:33:14 +0100 Date: Thu, 10 Mar 2016 15:37:26 +0100
What can i debug for resolve this problem?
Thanks!
Regards Laszlo
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium [http://postman.memetic.org/cgi-bin/mailman/listinfo/observium]
![](https://secure.gravatar.com/avatar/31624109ddddac107dccfee4b36d9545.jpg?s=120&d=mm&r=g)
Dear Adam,
I installed this linux (Ubuntu 15.10 Desktop) just because test the Observium CE. It was a clear install, just downloaded the Observium and installed via its documents.
I use the mysql-server version 5.6.28-0ubuntu0.15 from the Ubuntu repo, and no strict mode enabled (i modified the Mysql config for sure):
mysql> SELECT @@sql_mode; +------------------------+ | @@sql_mode | +------------------------+ | NO_ENGINE_SUBSTITUTION | +------------------------+ 1 row in set (0,00 sec)
But after that (and a full reboot) i received the 5 minutes alert :(
But after i modify the includes/alerts.inc.php like Radu said the 5 minutes upgrade stopped, and everything is working like the charm.
BR Laszlo
From: observium <observium-bounces@observium.orgmailto:observium-bounces@observium.org> on behalf of Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> Reply-To: Observium Network Observation System <observium@observium.orgmailto:observium@observium.org> Date: Thursday 10 March 2016 at 18:02 To: "observium@observium.orgmailto:observium@observium.org" <observium@observium.orgmailto:observium@observium.org> Subject: Re: [Observium] I got alert mail in every 5 minutes. Bug? V2 :)
Specifically, you may see failed sql insert/updates to the alert_table table due to strict mode restrictions which is preventing observium storing the time that notifications have been sent, which will cause the notification to be resent on the next polling cycle.
adam.
Sent from Mailbirdhttp://www.getmailbird.com/?utm_source=Mailbird&utm_medium=email&utm_campaign=sent-from-mailbird
On 10/03/2016 15:06:09, Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> wrote:
Repeating alerts is likely to be caused by mysql being in strict mode.
Adam.
Sent from BlueMailhttp://www.bluemail.me/r
On Mar 10, 2016, at 14:59, Laszlo Nemeth <laszlo.nemeth@emarsys.commailto:laszlo.nemeth@emarsys.com> wrote:
Hi All,
Some week ago i wrote a mail because we found an alert bug and we got lot of mails (in every 5. minutes) instead of just 1 in 24 hours.
This Observium was an updated from the 2015.07 CE to the latest (CE.0.16.1.7533). We thougth that the problem was because of the upgrade process and the alert function was new and something went bad....
Now i made a clear install of the latest Observium CE from .tgz. I added only one host, add the same alert, and i’m receiving again the alerts mail in every 5. minutes. I didn’t touch any file just the config.php because of the SQL access of the Observium!
I added a host (emaussw01). It is a Cisco Nexus device with 7 FEX modules. I see the temperatures of FEXs (4 values from every FEX) all is under 50C but not one module. This module are the "FEX-111 Module-1 Die-1". This is over 50C.
I add an alert, export is this:
<?xml version="1.0"?> <templates>
<template type="alert" description="Autogenerated observium template" version="0.91" created="Thu, 10 Mar 2016 15:29:04 +0100" observium="0.16.1.7533" id="73249fd2b23ef5f715d749ce2e98be41"> <entity_type>sensor</entity_type> <name>emaussw01 temp check</name> <message>Temp alarm!</message> <severity>crit</severity> <suppress_recovery>0</suppress_recovery> <delay>0</delay> <conditions_and>1</conditions_and> <conditions>sensor_value gt 50</conditions> <conditions_complex>sensor_value gt 50</conditions_complex> <associations> <device>hostname eq emaussw01</device> <entity>sensor_class is temperature</entity> </associations> </template> </templates>
I associated one user for this check via email alert.
Alert history is:
Date Device Alert Check Entity Message Status 2016-03-10 15:38:26 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:34:15 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:29:12 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:24:06 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:18:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:14:07 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:08:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:03:51 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
And i got email allways (1 minutes earlier) when ALERT_NOTIFY is present in the log:
# grep -i "Date: Thu” /var/spool/mail/obstestuser Date: Thu, 10 Mar 2016 15:02:50 +0100 Date: Thu, 10 Mar 2016 15:07:28 +0100 Date: Thu, 10 Mar 2016 15:13:07 +0100 Date: Thu, 10 Mar 2016 15:17:28 +0100 Date: Thu, 10 Mar 2016 15:23:06 +0100 Date: Thu, 10 Mar 2016 15:28:11 +0100 Date: Thu, 10 Mar 2016 15:33:14 +0100 Date: Thu, 10 Mar 2016 15:37:26 +0100
What can i debug for resolve this problem?
Thanks!
Regards Laszlo ________________________________
observium mailing list observium@observium.orgmailto:observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
![](https://secure.gravatar.com/avatar/0fa97865a0e1ab36152b6b2299eedb49.jpg?s=120&d=mm&r=g)
Hmm.
This is odd. I'm not entirely sure how this can be the case.
Do you have that variable in your includes/defaults.inc.php?
Adam.
Sent from BlueMail
On Mar 11, 2016, 10:13, at 10:13, Laszlo Nemeth laszlo.nemeth@emarsys.com wrote:
Dear Adam,
I installed this linux (Ubuntu 15.10 Desktop) just because test the Observium CE. It was a clear install, just downloaded the Observium and installed via its documents.
I use the mysql-server version 5.6.28-0ubuntu0.15 from the Ubuntu repo, and no strict mode enabled (i modified the Mysql config for sure):
mysql> SELECT @@sql_mode; +------------------------+ | @@sql_mode | +------------------------+ | NO_ENGINE_SUBSTITUTION | +------------------------+ 1 row in set (0,00 sec)
But after that (and a full reboot) i received the 5 minutes alert :(
But after i modify the includes/alerts.inc.php like Radu said the 5 minutes upgrade stopped, and everything is working like the charm.
BR Laszlo
From: observium <observium-bounces@observium.orgmailto:observium-bounces@observium.org> on behalf of Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> Reply-To: Observium Network Observation System <observium@observium.orgmailto:observium@observium.org> Date: Thursday 10 March 2016 at 18:02 To: "observium@observium.orgmailto:observium@observium.org" <observium@observium.orgmailto:observium@observium.org> Subject: Re: [Observium] I got alert mail in every 5 minutes. Bug? V2 :)
Specifically, you may see failed sql insert/updates to the alert_table table due to strict mode restrictions which is preventing observium storing the time that notifications have been sent, which will cause the notification to be resent on the next polling cycle.
adam.
Sent from Mailbirdhttp://www.getmailbird.com/?utm_source=Mailbird&utm_medium=email&utm_campaign=sent-from-mailbird
On 10/03/2016 15:06:09, Adam Armstrong <adama@memetic.orgmailto:adama@memetic.org> wrote:
Repeating alerts is likely to be caused by mysql being in strict mode.
Adam.
Sent from BlueMailhttp://www.bluemail.me/r
On Mar 10, 2016, at 14:59, Laszlo Nemeth <laszlo.nemeth@emarsys.commailto:laszlo.nemeth@emarsys.com> wrote:
Hi All,
Some week ago i wrote a mail because we found an alert bug and we got lot of mails (in every 5. minutes) instead of just 1 in 24 hours.
This Observium was an updated from the 2015.07 CE to the latest (CE.0.16.1.7533). We thougth that the problem was because of the upgrade process and the alert function was new and something went bad....
Now i made a clear install of the latest Observium CE from .tgz. I added only one host, add the same alert, and i’m receiving again the alerts mail in every 5. minutes. I didn’t touch any file just the config.php because of the SQL access of the Observium!
I added a host (emaussw01). It is a Cisco Nexus device with 7 FEX modules. I see the temperatures of FEXs (4 values from every FEX) all is under 50C but not one module. This module are the "FEX-111 Module-1 Die-1". This is over 50C.
I add an alert, export is this:
<?xml version="1.0"?>
<templates>
<template type="alert" description="Autogenerated observium template" version="0.91" created="Thu, 10 Mar 2016 15:29:04 +0100" observium="0.16.1.7533" id="73249fd2b23ef5f715d749ce2e98be41"> <entity_type>sensor</entity_type> <name>emaussw01 temp check</name> <message>Temp alarm!</message> <severity>crit</severity> <suppress_recovery>0</suppress_recovery> <delay>0</delay> <conditions_and>1</conditions_and> <conditions>sensor_value gt 50</conditions> <conditions_complex>sensor_value gt 50</conditions_complex>
<associations> <device>hostname eq emaussw01</device> <entity>sensor_class is temperature</entity> </associations> </template> </templates>
I associated one user for this check via email alert.
Alert history is:
Date Device Alert Check Entity Message Status 2016-03-10 15:38:26 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:34:15 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:29:12 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:24:06 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:18:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:14:07 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:08:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:03:51 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
And i got email allways (1 minutes earlier) when ALERT_NOTIFY is present in the log:
# grep -i "Date: Thu” /var/spool/mail/obstestuser Date: Thu, 10 Mar 2016 15:02:50 +0100 Date: Thu, 10 Mar 2016 15:07:28 +0100 Date: Thu, 10 Mar 2016 15:13:07 +0100 Date: Thu, 10 Mar 2016 15:17:28 +0100 Date: Thu, 10 Mar 2016 15:23:06 +0100 Date: Thu, 10 Mar 2016 15:28:11 +0100 Date: Thu, 10 Mar 2016 15:33:14 +0100 Date: Thu, 10 Mar 2016 15:37:26 +0100
What can i debug for resolve this problem?
Thanks!
Regards Laszlo ________________________________
observium mailing list observium@observium.orgmailto:observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
![](https://secure.gravatar.com/avatar/336b03f5d18f569f28f8ce37c512b266.jpg?s=120&d=mm&r=g)
Hello,
I had the same issue with CE 0.16.1.7533, after debugging it a bit I found that the values from config.php don't get properly loaded when alerts are processed. This might be a problem with my apache/php configuration or a bug in Observium (missing require somewhere or something).
As I didn't have too much time to spend on debugging this I just hardcoded the 1 day value in includes/alerts.inc.php, I replaced the following line:
if ((time() - $entry['last_alerted']) < $config['alerts']['interval'] && !isset($GLOBALS['spam'])) { $entry['suppress_alert'] = TRUE; }
with:
if ((time() - $entry['last_alerted']) < 86400 && !isset($GLOBALS['spam'])) { $entry['suppress_alert'] = TRUE; }
After that it just works as expected for me.
HTH,
Radu
On Thu, Mar 10, 2016 at 3:58 PM, Laszlo Nemeth laszlo.nemeth@emarsys.com wrote:
Hi All,
Some week ago i wrote a mail because we found an alert bug and we got lot of mails (in every 5. minutes) instead of just 1 in 24 hours.
This Observium was an updated from the 2015.07 CE to the latest (CE.0.16.1.7533). We thougth that the problem was because of the upgrade process and the alert function was new and something went bad....
Now i made a clear install of the latest Observium CE from .tgz. I added only one host, add the same alert, and i’m receiving again the alerts mail in every 5. minutes. I didn’t touch any file just the config.php because of the SQL access of the Observium!
I added a host (emaussw01). It is a Cisco Nexus device with 7 FEX modules. I see the temperatures of FEXs (4 values from every FEX) all is under 50C but not one module. This module are the "FEX-111 Module-1 Die-1". This is over 50C.
I add an alert, export is this:
<?xml version="1.0"?>
<templates> <template type="alert" description="Autogenerated observium template" version="0.91" created="Thu, 10 Mar 2016 15:29:04 +0100" observium="0.16.1.7533" id="73249fd2b23ef5f715d749ce2e98be41"> <entity_type>sensor</entity_type> <name>emaussw01 temp check</name> <message>Temp alarm!</message> <severity>crit</severity> <suppress_recovery>0</suppress_recovery> <delay>0</delay> <conditions_and>1</conditions_and> <conditions>sensor_value gt 50</conditions> <conditions_complex>sensor_value gt 50</conditions_complex> <associations> <device>hostname eq emaussw01</device> <entity>sensor_class is temperature</entity> </associations> </template> </templates>
I associated one user for this check via email alert.
Alert history is:
Date Device Alert Check Entity Message Status 2016-03-10 15:38:26 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:34:15 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:29:12 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:24:06 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:18:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:14:07 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:08:29 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY 2016-03-10 15:03:51 emaussw01 emaUSSW01 temp check Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
And i got email allways (1 minutes earlier) when ALERT_NOTIFY is present in the log:
# grep -i "Date: Thu” /var/spool/mail/obstestuser Date: Thu, 10 Mar 2016 15:02:50 +0100 Date: Thu, 10 Mar 2016 15:07:28 +0100 Date: Thu, 10 Mar 2016 15:13:07 +0100 Date: Thu, 10 Mar 2016 15:17:28 +0100 Date: Thu, 10 Mar 2016 15:23:06 +0100 Date: Thu, 10 Mar 2016 15:28:11 +0100 Date: Thu, 10 Mar 2016 15:33:14 +0100 Date: Thu, 10 Mar 2016 15:37:26 +0100
What can i debug for resolve this problem?
Thanks!
Regards Laszlo _______________________________________________ observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
![](https://secure.gravatar.com/avatar/0fa97865a0e1ab36152b6b2299eedb49.jpg?s=120&d=mm&r=g)
I think you must have broken something to result in this problem, since it doesn't seem to affect other installs.
Adam.
Sent from BlueMail
On Mar 10, 2016, 21:06, at 21:06, Radu Anghel eu@smellmysocks.net wrote:
Hello,
I had the same issue with CE 0.16.1.7533, after debugging it a bit I found that the values from config.php don't get properly loaded when alerts are processed. This might be a problem with my apache/php configuration or a bug in Observium (missing require somewhere or something).
As I didn't have too much time to spend on debugging this I just hardcoded the 1 day value in includes/alerts.inc.php, I replaced the following line:
if ((time() - $entry['last_alerted']) < $config['alerts']['interval'] && !isset($GLOBALS['spam'])) { $entry['suppress_alert'] = TRUE; }
with:
if ((time() - $entry['last_alerted']) < 86400 && !isset($GLOBALS['spam'])) { $entry['suppress_alert'] = TRUE; }
After that it just works as expected for me.
HTH,
Radu
On Thu, Mar 10, 2016 at 3:58 PM, Laszlo Nemeth laszlo.nemeth@emarsys.com wrote:
Hi All,
Some week ago i wrote a mail because we found an alert bug and we got
lot of mails (in every 5. minutes) instead of just 1 in 24 hours.
This Observium was an updated from the 2015.07 CE to the latest
(CE.0.16.1.7533).
We thougth that the problem was because of the upgrade process and
the alert function was new and something went bad....
Now i made a clear install of the latest Observium CE from .tgz. I added only one host, add the same alert, and i’m receiving again
the alerts mail in every 5. minutes.
I didn’t touch any file just the config.php because of the SQL access
of the Observium!
I added a host (emaussw01). It is a Cisco Nexus device with 7 FEX
modules. I see the temperatures of FEXs (4 values from every FEX) all is under 50C but not one module. This module are the "FEX-111 Module-1 Die-1". This is over 50C.
I add an alert, export is this:
<?xml version="1.0"?>
<templates> <template type="alert" description="Autogenerated observium
template" version="0.91" created="Thu, 10 Mar 2016 15:29:04 +0100" observium="0.16.1.7533" id="73249fd2b23ef5f715d749ce2e98be41">
<entity_type>sensor</entity_type> <name>emaussw01 temp check</name> <message>Temp alarm!</message> <severity>crit</severity> <suppress_recovery>0</suppress_recovery> <delay>0</delay> <conditions_and>1</conditions_and> <conditions>sensor_value gt 50</conditions> <conditions_complex>sensor_value gt 50</conditions_complex> <associations> <device>hostname eq emaussw01</device> <entity>sensor_class is temperature</entity> </associations>
</template> </templates>
I associated one user for this check via email alert.
Alert history is:
Date Device Alert Check Entity Message Status 2016-03-10 15:38:26 emaussw01 emaUSSW01 temp check
Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
2016-03-10 15:34:15 emaussw01 emaUSSW01 temp check
Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
2016-03-10 15:29:12 emaussw01 emaUSSW01 temp check
Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
2016-03-10 15:24:06 emaussw01 emaUSSW01 temp check
Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
2016-03-10 15:18:29 emaussw01 emaUSSW01 temp check
Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
2016-03-10 15:14:07 emaussw01 emaUSSW01 temp check
Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
2016-03-10 15:08:29 emaussw01 emaUSSW01 temp check
Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
2016-03-10 15:03:51 emaussw01 emaUSSW01 temp check
Fex-111 Module-1 Die-1 Alert notification sent ALERT_NOTIFY
And i got email allways (1 minutes earlier) when ALERT_NOTIFY is
present in the log:
# grep -i "Date: Thu” /var/spool/mail/obstestuser Date: Thu, 10 Mar 2016 15:02:50 +0100 Date: Thu, 10 Mar 2016 15:07:28 +0100 Date: Thu, 10 Mar 2016 15:13:07 +0100 Date: Thu, 10 Mar 2016 15:17:28 +0100 Date: Thu, 10 Mar 2016 15:23:06 +0100 Date: Thu, 10 Mar 2016 15:28:11 +0100 Date: Thu, 10 Mar 2016 15:33:14 +0100 Date: Thu, 10 Mar 2016 15:37:26 +0100
What can i debug for resolve this problem?
Thanks!
Regards Laszlo _______________________________________________ observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
observium mailing list observium@observium.org http://postman.memetic.org/cgi-bin/mailman/listinfo/observium
participants (3)
-
Adam Armstrong
-
Laszlo Nemeth
-
Radu Anghel