Page 1 of 6

Passive check freshness check not working

Posted: Tue Dec 06, 2016 5:04 pm
by snapon_admin
I have a set of passive checks that run on various servers every 15 minutes and send data to Nagios. For these checks I have a freshness threshold of 1 hour set so that if no passive results are received after an hour the status is changed to Warning and an alert is sent about it. For some reason all of these checks are running that script (that's only supposed to be ran after the 1 hour threshold is hit) after only 1 minute. Every time I restart the nagios service the checks are greed, and then 1 minute later they all run the freshness check script.
Here's the check settings for the service check:
broken freshness.png
This is XI 5.3.0.

Re: Passive check freshness check not working

Posted: Tue Dec 06, 2016 5:13 pm
by snapon_admin
I also cannot seem to enable/disable notifications. This is the second time in about a month or so I've had similar issues. Last time the only fix was to reboot the server completely, obviously not ideal. I am probably going to have to do the same here as erroneous alerts are continuing to be sent for checks that should not be alerting.

Edit: Ignore this issue. Just rebooted the server and I seem to be able to enable notifications again. The freshness check issue still remains though.

Re: Passive check freshness check not working

Posted: Tue Dec 06, 2016 5:37 pm
by avandemore
Probably a silly question but let's just cross it off the list, did you do an Apply Configuration?

If so what is the name of host and service which is misbehaving?

XI > Admin > System Profile > Download Profile

Please include the zip file in your response. You can PM myself or other support personnel if you have privacy concerns.

Need the profile to cross reference the host/service.

I remember your previous complaint about enable/disable notifications, we can tackle that later if you wish.

Re: Passive check freshness check not working

Posted: Wed Dec 07, 2016 10:00 am
by snapon_admin
The name of the service is Glodengate Processes and the host names are:

lisaerp01p on lisprod02g
lisdbms13p on lisprod04g
lisdbms14p on lisprod04g
lisdbqy01p on lisprod04g
lisdbqy02p on lisprod04g
lisdbqy13p on lisprod04g
lishadb13p on lisprod02g
lishadb26p on lisprod04g

Re: Passive check freshness check not working

Posted: Wed Dec 07, 2016 10:55 am
by avandemore
I have received the profile and am looking through it. Has this worked in the past?

Re: Passive check freshness check not working

Posted: Wed Dec 07, 2016 10:57 am
by snapon_admin
Has been working fine for a little over a year now. Occasionally Nagios will stop receiving passive results for these checks for unknown reasons, but that's why I implemented the freshness check. Usually when Nagios stops receiving passive results from the script all I have to do is apply config to fix it. That is not the case since yesterday shortly before I started this post.

Re: Passive check freshness check not working

Posted: Wed Dec 07, 2016 11:20 am
by avandemore
Please set the service to this:

Retain Status information = On
Is Volatile = Off

Test and report the results.

Re: Passive check freshness check not working

Posted: Wed Dec 07, 2016 12:03 pm
by snapon_admin
After making the change and applying config, 3 or 4 of the checks went green for about 30 seconds and then back to the warning: no passive checks received alert.

Re: Passive check freshness check not working

Posted: Wed Dec 07, 2016 12:52 pm
by avandemore
Please set check_interval =< the interval checks are reported, in your case say 20 minutes.

If you look in /usr/local/nagios/var/retention.dat does the service in question have the correct last_check= value?

Also see here:
https://assets.nagios.com/downloads/nag ... tions.html

Re: Passive check freshness check not working

Posted: Wed Dec 07, 2016 1:21 pm
by snapon_admin
It's already set to 1 minute.

Yes.

And what, specifically am I looking at on this page? I'm mostly confused as to why this check suddenly decided to stop functioning when it's been working fine for over a year and I haven't changed anything that would affect this.