Page 1 of 1

NRPE Disk Check sending out Recovery alerts for no reason

Posted: Wed Nov 28, 2018 6:44 pm
by rferebee
Hello,

We've been receiving notifications from one of our Nagios XI alert recipients that some of their Linux servers are sending out RECOVERY Service Alert notifications for Disk Checks randomly.

We cannot find anything in XI that would be causing these alerts to be sent out since the service never goes CRITICAL. There shouldn't be anything for the service to recover from as far as we can tell.

Is there anything else I can look at to figure out what is causing these RECOVERY messages to be sent out?

Thank you.

Re: NRPE Disk Check sending out Recovery alerts for no reaso

Posted: Thu Nov 29, 2018 9:22 am
by scottwilkerson
What version of XI are you running?

There was a bug prior to 5.5.7 that could cause this behavior

Re: NRPE Disk Check sending out Recovery alerts for no reaso

Posted: Thu Nov 29, 2018 10:00 am
by rferebee
We just upgraded to 5.5.7 on Tuesday and it occurred again yesterday (Wednesday). We were having the issue prior to the upgrade.

Re: NRPE Disk Check sending out Recovery alerts for no reaso

Posted: Thu Nov 29, 2018 10:05 am
by scottwilkerson
rferebee wrote:We just upgraded to 5.5.7 on Tuesday and it occurred again yesterday (Wednesday). We were having the issue prior to the upgrade.
The issue was caused by the notification number not being reset when hosts/services go back into an OK state.

It is possible that you could still get a few recoveries until all of them have cycled.

The only way around this would be to stop Nagios, remove the retention.dat file and restart nagios. This does however have a side affect of losing potential flapping data and comment history.

Re: NRPE Disk Check sending out Recovery alerts for no reaso

Posted: Mon Dec 03, 2018 10:54 am
by rferebee
Thank you, you can lock this thread.

Re: NRPE Disk Check sending out Recovery alerts for no reaso

Posted: Mon Dec 03, 2018 11:13 am
by scottwilkerson
rferebee wrote:Thank you, you can lock this thread.
Great!

Locking