We are trying to get our XI environment up to speed. Currently running 5.4.8. A major issue i have is that child hosts send recovery emails that the host is up after its parent recovers from being down. So when the host recovers from being unreachable it sends a recovery email. We do not receive unreachable emails because we are trying to reduce noise. If the router for the site is down we dont want to be alerted about any hosts that are under that parent and the same for when the router recovers. No need to know all of the items under that router that are impacted.
Anyone else had Nagios consistently send recover emails after the unreachable status? Any settings I should double check for the hosts?
Child Hosts Send Alerts After Parent Recovers
Child Hosts Send Alerts After Parent Recovers
Last edited by REFan on Wed Feb 07, 2018 3:11 pm, edited 1 time in total.
-
dwhitfield
- Former Nagios Staff
- Posts: 4583
- Joined: Wed Sep 21, 2016 10:29 am
- Location: NoLo, Minneapolis, MN
- Contact:
Re: Child Hosts Send Alerts After Parent Recovers
I may be that the router is going down after the others. You can increase the frequency of checks on the router or decrease the frequency on others to help avoid this issue.
Can you PM me your Profile? You can download it by going to Admin > System Config > System Profile and click the ***Download Profile*** button towards the top. If for whatever reason you *cannot* download the profile, please put the output of View System Info (5.3.4+, Show Profile if older) in the thread (that will at least get us some info). This will give us access to many of the logs we would otherwise ask for individually. If security is a concern, you can unzip the profile take out what you like, and then zip it up again. We may end up needing something you remove, but we can ask for that specifically.
You can also generate a profile manually using the script at /usr/local/nagiosxi/html/includes/components/profile/getprofile.sh
That should generate a profile in /usr/local/nagiosxi/var/components/ which you can get off the server with an application such as FileZilla.
After you PM the profile, please update this thread. Updating this thread is the only way for it to show back up on our dashboard.
If you get an error that PROFILE BUILD FAILED, please see https://support.nagios.com/kb/article.p ... ategory=44
UPDATE: Profile shared with techs
Can you PM me your Profile? You can download it by going to Admin > System Config > System Profile and click the ***Download Profile*** button towards the top. If for whatever reason you *cannot* download the profile, please put the output of View System Info (5.3.4+, Show Profile if older) in the thread (that will at least get us some info). This will give us access to many of the logs we would otherwise ask for individually. If security is a concern, you can unzip the profile take out what you like, and then zip it up again. We may end up needing something you remove, but we can ask for that specifically.
You can also generate a profile manually using the script at /usr/local/nagiosxi/html/includes/components/profile/getprofile.sh
That should generate a profile in /usr/local/nagiosxi/var/components/ which you can get off the server with an application such as FileZilla.
After you PM the profile, please update this thread. Updating this thread is the only way for it to show back up on our dashboard.
If you get an error that PROFILE BUILD FAILED, please see https://support.nagios.com/kb/article.p ... ategory=44
UPDATE: Profile shared with techs
Last edited by dwhitfield on Fri Feb 09, 2018 11:32 am, edited 1 time in total.
Reason: pm received
Reason: pm received
Re: Child Hosts Send Alerts After Parent Recovers
Sorry I did not reply to this. Forgot to check back after the holidays.
I have not had this happen recently so I will keep an eye out and the next time I will reach out.
I have not had this happen recently so I will keep an eye out and the next time I will reach out.
-
dwhitfield
- Former Nagios Staff
- Posts: 4583
- Joined: Wed Sep 21, 2016 10:29 am
- Location: NoLo, Minneapolis, MN
- Contact:
Re: Child Hosts Send Alerts After Parent Recovers
Sounds good. We'll be here!
Re: Child Hosts Send Alerts After Parent Recovers
Is it best practice to reuse the thread or make a new one?
-
dwhitfield
- Former Nagios Staff
- Posts: 4583
- Joined: Wed Sep 21, 2016 10:29 am
- Location: NoLo, Minneapolis, MN
- Contact:
Re: Child Hosts Send Alerts After Parent Recovers
If it's the same issue, and you're the OP (like you are here), then you should use the same thread. Otherwise, start a new one!
Re: Child Hosts Send Alerts After Parent Recovers
Ok we had an instance last night where a site went down around 12:10 AM. I received an email that the router was down at 12:10, at 12:12 I received two emails that servers there recovered and that the router recovered. I checked these two hosts and they have the parent host set to that router so I dont know why those emails were received. There are other servers at that site that did not report a recovery but I may have notifications disabled for those.
I sent you an PM with the profile.
I sent you an PM with the profile.
-
kyang
Re: Child Hosts Send Alerts After Parent Recovers
It would be a good idea to install a Ramdisk. (You can run through the "Automatic RAM Disk Installation")
https://assets.nagios.com/downloads/nag ... giosXI.pdf
Could you send us your nagios.log file. A few recent nagios.log one from a few days back until now would work.
This directory should have your previous nagios.logs. Let me know what you have or just PM or post them to me.
Thanks!
https://assets.nagios.com/downloads/nag ... giosXI.pdf
Could you send us your nagios.log file. A few recent nagios.log one from a few days back until now would work.
This directory should have your previous nagios.logs. Let me know what you have or just PM or post them to me.
Code: Select all
/usr/local/nagios/var/archives