Page 1 of 1
State/availability report issue during host restarts
Posted: Wed Feb 03, 2021 8:57 am
by melchi
Hi,
We have this issue in Nagios 5.7.3 on which the service state history and the availability report shows unreasonable hours on critical state.
This happens everytime we restart the host machine. For instance, we restarted the server on Jan 17 for ~20mins and was correctly reflected to host state report (first image below) but it doesn't match with the service state history which shows 17h+ in critical state (second image). Was the service check keeps on waiting even if the host is up for this 17h timeframe?
I have also attached the check settings of the service we're trying to generate the availability report for the last 31 days. Logs in /usr/local/nagios/var/archives were also fine.
host state trend.JPG
service state history and availability report.JPG
service check settings.JPG
Regards,
Re: State/availability report issue during host restarts
Posted: Thu Feb 04, 2021 11:15 am
by benjaminsmith
Hi
@melchi,
That trends report is pulled straight from the Nagios log files, so whatever is in the logs should be reflected in the report. Does this host have any other services? Is the same behavior reflected in the other services? If not, then we should take a closer look at this particular service, maybe it didn't return to an OK state after re-booting.
Can you pull a State History report on this service over the period and make sure to select any
state type as both and
any state in the report options and upload to the thread or send it a PM.
Benjamin
Re: State/availability report issue during host restarts
Posted: Mon Feb 22, 2021 7:39 am
by melchi
Hi Benjamin,
Sorry for the delayed reply.
Please refer to the service log entries below in the generated availability report using sequence -> [Legacy Reports > Availability > Service(s) > Select Service > Report Period Last 31 Days].
As you can see, logs only updates every midnight this is why the next OK after the KO we had during the patching activity, is reported after 17h, causing the false Critical status.
Is there any way we can change the update schedule of the log? Because it seems not possible even after the restart of nagios core service.
service log entries.jpg
-Melchi
Re: State/availability report issue during host restarts
Posted: Mon Feb 22, 2021 6:48 pm
by benjaminsmith
Hi melchi,
Looking over the last screenshot, I see a couple of timeouts there, so it's likely there may have been some network issues that caused the service to fail even though the host was reporting up.
This can be resolved by increasing the timeout on the service check. If upload the system profile, and the exact name of this service, I can review the check command and make recommendations for you.
To send us your system profile.
Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
--Benjamin
Re: State/availability report issue during host restarts
Posted: Tue Feb 23, 2021 3:32 am
by melchi
Hi Benjamin,
I sent you pm for these details.
Thanks,
Melchi
Re: State/availability report issue during host restarts
Posted: Tue Feb 23, 2021 5:49 pm
by benjaminsmith
Hi Melchi,
For that particular service, it looks like you are using the following plugin for this service.
https://exchange.nagios.org/directory/P ... II/details
Try to add a
-t 60 option to the check command to increase the timeout option to reduce the number of timeouts. The default is 30 seconds.
Regarding, the other issue, to my knowledge, there is not an outstanding bug for this report. I would need all the nagios.logs for this time period to determine if the service was in a critical state for that time period or not.
Those files are located in te following directory:
--Benjamin
Re: State/availability report issue during host restarts
Posted: Wed Feb 24, 2021 4:19 am
by melchi
Hi Benjamin,
Sorry we just found out the issue. Its because when we generate the report we use the default one with the soft states not included that's why we get only hard states in the report.
You can lock this ticket. Thank you for your support!
-Melchi
Re: State/availability report issue during host restarts
Posted: Wed Feb 24, 2021 8:06 am
by scottwilkerson
melchi wrote:Hi Benjamin,
Sorry we just found out the issue. Its because when we generate the report we use the default one with the soft states not included that's why we get only hard states in the report.
You can lock this ticket. Thank you for your support!
-Melchi
Locking thread