Server issues when multiple hosts were down

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: Server issues when multiple hosts were down

Post by lmiltchev »

So we have to start creating these wrapper scripts to work around this issue.
Let us know if this works out for you.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Fred Kroeger
Posts: 588
Joined: Wed Oct 19, 2011 11:36 pm
Location: Perth, Western Australia
Contact:

Re: Server issues when multiple hosts were down

Post by Fred Kroeger »

No it's not going to work for me. Why can't we have a proper solution instead of us all having to create/install wrappers?
For me this is just a basic requirement for an Enterprise Class Monitoring System - "If Host is down - stop checking services for that host"

regards... Fred
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Server issues when multiple hosts were down

Post by tmcdonald »

From our position on the support side we can't do major changes like this to the logic of Core - that would require a developer and a process of "Submit feature request -> implement -> test -> push changes" which can take time. We try to offer temporary workarounds like wrapper scripts in the meantime, and understand completely that this is not always the optimal solution.

In regards to this feature request specifically, I know there were a few in the past that were closed for one reason or another. Since we have one now - http://tracker.nagios.org/view.php?id=666 - I will speak to the devs and get a fresh perspective on things.

Now if I may contradict myself for a bit, while we on the support team cannot always make the major changes we often will make smaller ones. Bandit I am sure can vouch for me on this one. I am not a full-time C developer, but I know enough to be dangerous. Since there is a lot of interest in this feature, I will see what I can do about patching it in. I want to make it clear that this would not be an "official" patch - this would very much be my own and would not be guaranteed to work. It also might not work at all, or cause problems of its own. I am only making the offer since there does not seem to be a lot of movement on it right now.

If someone would like to make a feature request on my GitHub for this, I can start working on it: https://github.com/tmcnag/nagioscore

Again, I understand this is not ideal but I am doing what I can from where I sit.
Former Nagios employee
User avatar
snapon_admin
Posts: 952
Joined: Mon Jun 10, 2013 10:39 am
Location: Kenosha, WI
Contact:

Re: Server issues when multiple hosts were down

Post by snapon_admin »

If I have an account for that site and can't remember my password who would I contact about resetting my password? I tried the password reset from the site (twice) and I never get the reset email. I'd like to +1 this feature as well.
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Server issues when multiple hosts were down

Post by tmcdonald »

Are you referring to the tracker site or Github? If tracker, PM me your info and I'll take a look.
Former Nagios employee
User avatar
snapon_admin
Posts: 952
Joined: Mon Jun 10, 2013 10:39 am
Location: Kenosha, WI
Contact:

Re: Server issues when multiple hosts were down

Post by snapon_admin »

Tracker, sorry. PM sent.
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Server issues when multiple hosts were down

Post by tmcdonald »

I might have a solution, but I would want to test it on a box aside from my own to make sure it works how you guys are requesting. Would anyone be able to do a short remote session to test? It would require a live recompile of Core, so if you have a backup/test system that would be preferred.
Former Nagios employee
User avatar
BanditBBS
Posts: 2474
Joined: Tue May 31, 2011 12:57 pm
Location: Scio, OH
Contact:

Re: Server issues when multiple hosts were down

Post by BanditBBS »

tmcdonald wrote:I might have a solution, but I would want to test it on a box aside from my own to make sure it works how you guys are requesting. Would anyone be able to do a short remote session to test? It would require a live recompile of Core, so if you have a backup/test system that would be preferred.
I have a test/dev install and you could even use my join.me session if you want :) Anything for the all powerful Trevor!

EDIT: I have the patch running in my dev environment and its AWESOME. Works exactly like a dependency and just basically skips the check and follows the check interval and reschedules to the next time if the host is not UP.
2 of XI5.6.14 Prod/DR/DEV - Nagios LogServer 2 Nodes
See my projects on the Exchange at BanditBBS - Also check out my Nagios stuff on my personal page at Bandit's Home and at github
tmcdonald
Posts: 9117
Joined: Mon Sep 23, 2013 8:40 am

Re: Server issues when multiple hosts were down

Post by tmcdonald »

Give me some time to clean it up and turn it into a proper patch, then I will post it here and on my github (and Tracker).
Former Nagios employee
User avatar
WillemDH
Posts: 2320
Joined: Wed Mar 20, 2013 5:49 am
Location: Ghent
Contact:

Re: Server issues when multiple hosts were down

Post by WillemDH »

Wow. That was fast. Gj Trevor. So is it something we can enable / disable? And is it update proof?

Grtz

Willem
Nagios XI 5.8.1
https://outsideit.net
Locked