Check Freshness running early
Posted: Tue Sep 24, 2019 9:27 am
I have several Nagios XI 5.6.1 servers running on RHEL 7 64bit VM's. We have several passive checks, all setup with a freshness value of 7200 and the check command is "check_dummy" with ARG1 as 0 "Resetting check after 2 hours".
Here are some examples from our nagios.log file (Service and HOST name redacted) As you can see, Services 1 and 4 were reset more often than the 2 hour freshness check. The passive process is a custom log scraping we wrote and does not include the ability to send the resets themselves.
[1569277851] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569278149] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569278448] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569278747] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569279045] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569279344] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569279642] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569279941] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569280240] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569281134] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569281433] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569281732] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569282030] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569282329] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569284459] SERVICE ALERT: HOST1;SERVICE_2;OK;HARD;1;OK: Resetting check after 2 hours
[1569286497] SERVICE ALERT: HOST1;SERVICE_3;OK;HARD;1;OK: Resetting check after 2 hours
[1569287213] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569289004] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569291693] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569293185] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569308553] SERVICE ALERT: HOST1;SERVICE_5;OK;HARD;1;OK: Resetting check after 2 hours
[1569323646] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569324123] SERVICE ALERT: HOST1;SERVICE_3;OK;HARD;1;OK: Resetting check after 2 hours
Here is the config for Service 4 (Host and Service name redacted) Any assistance figuring out why we are resetting the check so often would be appreciated.
Here are some examples from our nagios.log file (Service and HOST name redacted) As you can see, Services 1 and 4 were reset more often than the 2 hour freshness check. The passive process is a custom log scraping we wrote and does not include the ability to send the resets themselves.
[1569277851] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569278149] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569278448] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569278747] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569279045] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569279344] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569279642] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569279941] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569280240] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569281134] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569281433] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569281732] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569282030] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569282329] SERVICE ALERT: HOST1;SERVICE_1;OK;HARD;1;OK: Resetting check after 2 hours
[1569284459] SERVICE ALERT: HOST1;SERVICE_2;OK;HARD;1;OK: Resetting check after 2 hours
[1569286497] SERVICE ALERT: HOST1;SERVICE_3;OK;HARD;1;OK: Resetting check after 2 hours
[1569287213] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569289004] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569291693] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569293185] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569308553] SERVICE ALERT: HOST1;SERVICE_5;OK;HARD;1;OK: Resetting check after 2 hours
[1569323646] SERVICE ALERT: HOST1;SERVICE_4;OK;HARD;1;OK: Resetting check after 2 hours
[1569324123] SERVICE ALERT: HOST1;SERVICE_3;OK;HARD;1;OK: Resetting check after 2 hours
Here is the config for Service 4 (Host and Service name redacted) Any assistance figuring out why we are resetting the check so often would be appreciated.