Thanks for sending that over. I'm not seeing the state change in the log, can you send over the log from nagios-09-10-2020-00.log as well.
It would be very helpful to force the service to generate a notification and then recover while we tail the logs. We can achieve this by directly sending passive check results to the service and then watch the logs.
Open up a shell on the server and run the following tail command:
Code: Select all
tail -F /var/log/maillog /usr/local/nagiosxi/tmp/phpmailer.log /usr/local/nagiosxi/var/eventman.log /usr/local/nagios/var/nagios.log
1. Find the Service and click on it
2. Click the + tab
3. Note these two rows:
Those columns tell the current State Type and the Current Check number. In order to generate a notification for a service you will need to submit MULTIPLE problem check results (the number that you need to submit is determined by the last number in the Current Check column, that is the max_check_attempts setting).State Type: Hard
Current Check: 1 of 4
For services, when you submit a passive check result, each result that you submit will be a SOFT state until you submit enough to hit the Max Check Attempts setting that you've defined on the service, only then will the service enter a HARD problem state which will generate the notification (just remember, notifications are only sent on HARD states). - Click the "Submit passive check result" link
- Select the Check Result and type in some text for the Check Output
- Click the Submit button
- Submit as many as you need, right after another, until the service enters the HARD state so that a notification will be sent
When coming from a HARD problem state (whether we are talking about hosts or services) if you submit an OK passive result it should fire off a recovery notification after a single passive result has been submitted.
Also, if you have flap detection enabled, if flapping is detected due to changing state types, notifications will be suppressed.
Let me know if you able to generate notifications for a due to non ok state and corresponding recover notification. Then you can cross-check this with Opsgenie.
If not please post the full output of the tail command. Thanks, Benjamin