Host recovery emails being sent while host is unreachable

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Host recovery emails being sent while host is unreachabl

Post by tgriep »

The only thing I can think of is a routing table corruption and the ping is testing a bad entry in the table.
If you see if happen again, can you login to the XI server and run a ping to that IP address and see if it gets any replies?
Just a standard ping, not a plugin.
Thanks
Be sure to check out our Knowledgebase for helpful articles and solutions!
BIB
Posts: 46
Joined: Tue Dec 27, 2016 3:13 pm

Re: Host recovery emails being sent while host is unreachabl

Post by BIB »

No, there is no ping reply. Here are notifications for last occurence of this problem, after I changed check period parameters as you suggested.
The host is powered off for last 2 weeks, and has not been powered on since then.
Important thing is that we have another Nagios Core running, old version, and there is no problem. Configuration is migrated from old version of Nagios Core to new instance of Nagios XI. Problem occurs only on the new Nagios XI.

First notification received:
Notification Type: RECOVERY
Host: TLC_s_BG_146_2
State: UP
Address: 10.34.146.5
Info: WARNING - 10.34.146.5: rta 11.397ms, lost 80%
Date/Time: 2017-01-24 16:25:52


Second notification received:
Notification Type: PROBLEM
Service: TLC_PING 5000-70 10000-100
State: CRITICAL
Host: TLC_s_BG_146_2
Info: PING CRITICAL - Packet loss = 100%
Address: 10.34.146.5
Date/Time: 2017-01-24 16:27:43


Third notification received:
Notification Type: PROBLEM
Host: TLC_s_BG_146_2
State: DOWN
Address: 10.34.146.5
Info: CRITICAL - 10.34.146.5: rta nan, lost 100%
Date/Time: 2017-01-24 16:32:46
User avatar
tgriep
Madmin
Posts: 9190
Joined: Thu Oct 30, 2014 9:02 am

Re: Host recovery emails being sent while host is unreachabl

Post by tgriep »

Couple of things to try, can you edit that host's check command and change it to check_ping and see if after the change, the host exhibits the same issue?
Or, you can upgrade to the latest plugins and see if it is fixed there.
You can download the plugins from the link below.
https://www.nagios.org/downloads/nagios-plugins/
Be sure to check out our Knowledgebase for helpful articles and solutions!
BIB
Posts: 46
Joined: Tue Dec 27, 2016 3:13 pm

Re: Host recovery emails being sent while host is unreachabl

Post by BIB »

We tried changing host's check command previosly, no success. We tried now also, same result.
After plugins upgrade, there is no change, problem still exists.

We have another instance of Nagios XI running on remote site, we will configure same hosts and services and monitor the situation.
rkennedy
Posts: 6579
Joined: Mon Oct 05, 2015 11:45 am

Re: Host recovery emails being sent while host is unreachabl

Post by rkennedy »

BIB wrote:We tried changing host's check command previosly, no success. We tried now also, same result.
After plugins upgrade, there is no change, problem still exists.

We have another instance of Nagios XI running on remote site, we will configure same hosts and services and monitor the situation.
Really odd. Are things working properly at the new site?
Former Nagios Employee
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Host recovery emails being sent while host is unreachabl

Post by tmcdonald »

Just checking in since we have not heard from you in a while. Are things at the new site going well?
Former Nagios employee
BIB
Posts: 46
Joined: Tue Dec 27, 2016 3:13 pm

Re: Host recovery emails being sent while host is unreachabl

Post by BIB »

Sorry for not replying long time.

On the remote site there is no problems with this FALSE RECOVERY.
We performed packet capture on problematic Nagios XI server, but at that time it became reachable, it was plugged in again.
We will perform packet capture again, this time we will monitor another host whitch is unreachable and where we experience this problem, and provide you feedback as soon as possible.
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Host recovery emails being sent while host is unreachabl

Post by tmcdonald »

We'll keep it open for you.
Former Nagios employee
BIB
Posts: 46
Joined: Tue Dec 27, 2016 3:13 pm

Re: Host recovery emails being sent while host is unreachabl

Post by BIB »

During last 16 hours we captured traffic on our Nagios XI server. There was no ping reply packet captured, and there were 3 FALSE RECOVERY notifications.
Any other suggestion?
avandemore
Posts: 1597
Joined: Tue Sep 27, 2016 4:57 pm

Re: Host recovery emails being sent while host is unreachabl

Post by avandemore »

I think tgriep is right about there being some type of arp funny business going on. It's entirely possible(and valid) to get an ICMP echo reply from an address you didn't send to.

Perhaps if we turning on stalking we'd be able to get some more detail out of this.

https://assets.nagios.com/downloads/nag ... lking.html

I'd also like to see an updated Nagios log file containing these entries. You can PM or another tech them if you have privacy concerns.
Previous Nagios employee
Locked