Nagios XI not sending email notifications on a server
Posted: Fri Feb 15, 2019 4:51 pm
Hello,
I have a critical system that recently the website stopped rendering. I have a URL check that checks to ensure the site is rendering. When the tomcat service stopped responding couple days ago, Nagios XI did not send any notifications to the team. Further investigation, I determined that the critical notification is not sending emails, even though the the check does turn critical after the 3rd failure. Even more investiagion, while the service check turns critical on the 3rd check failure, the status is not changing from "soft" to "hard". Is anyone experiencing this same issue or has experienced?
I have another application that is being monitored in the same fashion with the same Nagios agent, etc. The URL service check for this system is working fine on failure. I did clone that same URL check that is working on this system and configure it on the system URL that is failing. This new cloned check is notifying correctly. While I can redo the URL service check on the system for which is failing to notify, I really want to know what is causing this issue. I want to ensure that other systems are not in this same state of not being able to email alerts once the checks have turned critical.
Thank you,
Juana
I have a critical system that recently the website stopped rendering. I have a URL check that checks to ensure the site is rendering. When the tomcat service stopped responding couple days ago, Nagios XI did not send any notifications to the team. Further investigation, I determined that the critical notification is not sending emails, even though the the check does turn critical after the 3rd failure. Even more investiagion, while the service check turns critical on the 3rd check failure, the status is not changing from "soft" to "hard". Is anyone experiencing this same issue or has experienced?
I have another application that is being monitored in the same fashion with the same Nagios agent, etc. The URL service check for this system is working fine on failure. I did clone that same URL check that is working on this system and configure it on the system URL that is failing. This new cloned check is notifying correctly. While I can redo the URL service check on the system for which is failing to notify, I really want to know what is causing this issue. I want to ensure that other systems are not in this same state of not being able to email alerts once the checks have turned critical.
Thank you,
Juana