Issues with ack alerts, multiple sms and post-upgrade issues
Posted: Thu Apr 30, 2015 8:11 pm
Hi guys,
Our version of NagiosXI has been playing up recently and we have a few issues:
1. Multiple contacts started to get 2 x SMS/Emails for service alerts over the past few days, yet the event log is suggesting only 1 event at a time. We checked with our SMS carrier and they couldn't see an issue at our end.
2. After acknowledging an alert, all of a sudden these acks would still show in the "Open service problems" window - its as if the ack didn't work even though the ack comments were clearly visible.
We decided to upgrade to the latest version of Nagios XI (from 2014R2.6 to 2014R2.7) to see if that would help, but no luck.
3. I had a problem with the upgrade where I had to remove some lines from the sudoers config file as detailed in this thread:
http://support.nagios.com/forum/viewtop ... 16&t=30826
After removing the 3 x lines for sudoers the upgrade worked ok. However we then try to apply new configuration and we are getting the same issue as we do after previous upgrade attempts.
Error such as:
Error: Could not find a service matching host name 'ftp.x.com.au' and description 'Ping' (config file '/usr/local/nagios/etc/serviceescalations.cfg', starting on line 38)
However this service escalation does not exist in our config.
I've seen this before and I applied the same fix as usual as detailed here: (remove the sudo command from config file)
http://support.nagios.com/forum/viewtop ... 16&t=31065
However this time we still get the same apply config error even after changing - previously we have never received this error after making this change.
I tried to roll-back the sudoers config change and also the change in point 2, before applying config and still the same issue.
Can you please help me to understand and fix what is going on?
Our version of NagiosXI has been playing up recently and we have a few issues:
1. Multiple contacts started to get 2 x SMS/Emails for service alerts over the past few days, yet the event log is suggesting only 1 event at a time. We checked with our SMS carrier and they couldn't see an issue at our end.
2. After acknowledging an alert, all of a sudden these acks would still show in the "Open service problems" window - its as if the ack didn't work even though the ack comments were clearly visible.
We decided to upgrade to the latest version of Nagios XI (from 2014R2.6 to 2014R2.7) to see if that would help, but no luck.
3. I had a problem with the upgrade where I had to remove some lines from the sudoers config file as detailed in this thread:
http://support.nagios.com/forum/viewtop ... 16&t=30826
After removing the 3 x lines for sudoers the upgrade worked ok. However we then try to apply new configuration and we are getting the same issue as we do after previous upgrade attempts.
Error such as:
Error: Could not find a service matching host name 'ftp.x.com.au' and description 'Ping' (config file '/usr/local/nagios/etc/serviceescalations.cfg', starting on line 38)
However this service escalation does not exist in our config.
I've seen this before and I applied the same fix as usual as detailed here: (remove the sudo command from config file)
http://support.nagios.com/forum/viewtop ... 16&t=31065
However this time we still get the same apply config error even after changing - previously we have never received this error after making this change.
I tried to roll-back the sudoers config change and also the change in point 2, before applying config and still the same issue.
Can you please help me to understand and fix what is going on?