Page 1 of 1

Notifications not sent as expected

Posted: Wed May 02, 2018 3:52 pm
by lndifferent
I assume I misconfigured something but I am not sure what. At the moment we have a main switch at a single location down due to maintenance but I never received notification about it. When I look on Nagios home page, it says that 10 services are down (as expected) but all hosts but one are up?! None of them should be up as the main 6504 switch is down. The WJES-Black shows down only after I manually hit check now. The only notification I received is for wjesvm.jcss.us host. I created dependencies so that when 6504 is down, notifications will not be sent for other hosts but as I said, never received notification that 6504 is down - that one was expected. I checked notification settings and all are configured to check status every minute and send notifications immediately if down. See the attached screenshots. Weird part is that under host status page it says that last checks were being done in March???

Re: Notifications not sent as expected

Posted: Thu May 03, 2018 12:28 pm
by cdienger
Last check or scheduled checks can fail for a variety of reasons. https://support.nagios.com/kb/article.php?id=19 covers some of them. You may also just want to restart everything to get it back to normal which I'll provide below, but if you review the article, it doesn't help to resolve the problem and you still want to investigate the issue, please provide a profile(Admin > System Config > System Profile > Download System Profile). PMing it to me or another tech on here would be best as it can contain sensitive info.

If that document does not resolve your issue, please run the following commands in order and report any errors. You ***must*** use mariadb instead of mysqld in the commands below, ***if*** you have mariadb.
service nagios stop
service ndo2db stop
service mysqld stop
service crond stop
service httpd stop
killall -9 nagios
killall -9 ndo2db
rm -f /usr/local/nagios/var/rw/nagios.cmd
rm -f /usr/local/nagios/var/nagios.lock
rm -f /usr/local/nagios/var/ndo.sock
rm -f /usr/local/nagios/var/ndo2db.lock
rm -f /usr/local/nagiosxi/var/reconfigure_nagios.lock
for i in `ipcs -q | grep nagios |awk '{print $2}'`; do ipcrm -q $i; done
service mysqld start
service ndo2db start
service nagios start
service httpd start
service crond start