Page 1 of 1
Child Hosts Send Alerts After Parent Recovers
Posted: Mon Dec 04, 2017 2:56 pm
by REFan
We are trying to get our XI environment up to speed. Currently running 5.4.8. A major issue i have is that child hosts send recovery emails that the host is up after its parent recovers from being down. So when the host recovers from being unreachable it sends a recovery email. We do not receive unreachable emails because we are trying to reduce noise. If the router for the site is down we dont want to be alerted about any hosts that are under that parent and the same for when the router recovers. No need to know all of the items under that router that are impacted.
Anyone else had Nagios consistently send recover emails after the unreachable status? Any settings I should double check for the hosts?
Re: Child Hosts Send Alerts After Parent Recovers
Posted: Tue Dec 05, 2017 10:50 am
by dwhitfield
I may be that the router is going down after the others. You can increase the frequency of checks on the router or decrease the frequency on others to help avoid this issue.
Can you PM me your Profile? You can download it by going to Admin > System Config > System Profile and click the ***Download Profile*** button towards the top. If for whatever reason you *cannot* download the profile, please put the output of View System Info (5.3.4+, Show Profile if older) in the thread (that will at least get us some info). This will give us access to many of the logs we would otherwise ask for individually. If security is a concern, you can unzip the profile take out what you like, and then zip it up again. We may end up needing something you remove, but we can ask for that specifically.
You can also generate a profile manually using the script at /usr/local/nagiosxi/html/includes/components/profile/getprofile.sh
That should generate a profile in /usr/local/nagiosxi/var/components/ which you can get off the server with an application such as FileZilla.
After you PM the profile, please update this thread. Updating this thread is the only way for it to show back up on our dashboard.
If you get an error that PROFILE BUILD FAILED, please see
https://support.nagios.com/kb/article.p ... ategory=44
UPDATE: Profile shared with techs
Re: Child Hosts Send Alerts After Parent Recovers
Posted: Wed Feb 07, 2018 2:53 pm
by REFan
Sorry I did not reply to this. Forgot to check back after the holidays.
I have not had this happen recently so I will keep an eye out and the next time I will reach out.
Re: Child Hosts Send Alerts After Parent Recovers
Posted: Wed Feb 07, 2018 2:55 pm
by dwhitfield
Sounds good. We'll be here!
Re: Child Hosts Send Alerts After Parent Recovers
Posted: Wed Feb 07, 2018 3:07 pm
by REFan
Is it best practice to reuse the thread or make a new one?
Re: Child Hosts Send Alerts After Parent Recovers
Posted: Wed Feb 07, 2018 3:08 pm
by dwhitfield
If it's the same issue, and you're the OP (like you are here), then you should use the same thread. Otherwise, start a new one!
Re: Child Hosts Send Alerts After Parent Recovers
Posted: Fri Feb 09, 2018 10:02 am
by REFan
Ok we had an instance last night where a site went down around 12:10 AM. I received an email that the router was down at 12:10, at 12:12 I received two emails that servers there recovered and that the router recovered. I checked these two hosts and they have the parent host set to that router so I dont know why those emails were received. There are other servers at that site that did not report a recovery but I may have notifications disabled for those.
I sent you an PM with the profile.
Re: Child Hosts Send Alerts After Parent Recovers
Posted: Fri Feb 09, 2018 2:54 pm
by kyang
It would be a good idea to install a Ramdisk. (You can run through the "Automatic RAM Disk Installation")
https://assets.nagios.com/downloads/nag ... giosXI.pdf
Could you send us your
nagios.log file. A few recent nagios.log one from a few days back until now would work.
This directory should have your previous nagios.logs. Let me know what you have or just PM or post them to me.
Thanks!