Page 1 of 1

Nagios alerts on inaceesible node

Posted: Wed Jan 29, 2020 2:41 pm
by ITOMB_IMT
Hi,

I am using Nagios Xi 5.6.7. i was told in earlier posts that nagios doesn't send service alerts if the node is inaccessible, instead only one host alert. but yesterday i had only one inaccessible node, i received service alerts but not host alert.
do i need to check or change any configuration in nagios?
Thanks,

Re: Nagios alerts on inaceesible node

Posted: Wed Jan 29, 2020 4:41 pm
by tgriep
If you run a State History report for that Host and Services, does it show that the Host was down before or after the Service Critical?
Does it show the host being down at all during that time?

One thing, if you have a short check interval for the service and a much longer check interval for the host check, it the timing is right, the service check could go critical before the host check and send the Notification.

Re: Nagios alerts on inaceesible node

Posted: Wed Jan 29, 2020 5:00 pm
by ITOMB_IMT
I wonder, i don't see any host alerts, either in state history or notifications. The server blade lost its connection and its offline all the time during those alerts.

All the host alerts and service alerts have the same check and recheck intervals.

Re: Nagios alerts on inaceesible node

Posted: Wed Jan 29, 2020 5:13 pm
by tgriep
What do you see for the current Host Status?
Are the stats updating?
Check the Advanced Tab and verify that Active Checks are enabled.

If there are duplicated nagios processes, that may be the cause.
Run this as root

Code: Select all

service nagios stop
killall -9 nagios
service nagios start
Then check the system to see if that Host is still updating and wait to see if the issue happens again.

Re: Nagios alerts on inaceesible node

Posted: Thu Jan 30, 2020 9:54 am
by ITOMB_IMT
I see current Host Status from Host status advanced tab. I have active checks enabled but am not sure why last check is updating.

See attached advanced status details.

Re: Nagios alerts on inaceesible node

Posted: Thu Jan 30, 2020 1:11 pm
by tgriep
Previous versions Nagios Core that came with older versions of XI had a but that could cause that issue so if the server was updated to 5.6.9, it should be running Nagios Core 4.4.5
Run this as root to verify the version.

Code: Select all

/usr/local/nagios/bin/nagios
Not to fix the issue, you will have to remove the status of that host so Core can reset the counters and update the data.

Stop the nagios process

Code: Select all

service nagios stop
Edit the /usr/local/nagios/var/retention.dat file

Find the entry for that Host and delete all of it so it will be updated.

Start the nagios process

Code: Select all

service nagios start
Wait for 5 minutes so the check can run and look as the GUI to see if the data is updated.

Re: Nagios alerts on inaceesible node

Posted: Mon Feb 03, 2020 9:55 am
by ITOMB_IMT
recently, i have upgraded to 5.6.10 this friday. so am on higher version. May i know the reason why it sent service alerts, not host alert if the host is inaccessible? the server lost its connection from the blade which shows its not available during the whole time.

Re: Nagios alerts on inaceesible node

Posted: Mon Feb 03, 2020 10:33 am
by tgriep
Could you post or PM me your Nagios XI System Profile so we can review it?
To get your system profile. Login to the Nagios XI GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
Save the profile.zip file and upload it to the forum post or PM it to me.

What is the name of the Host and Service in question?