Nagios Core roadmap query
Posted: Fri Jan 22, 2016 8:57 am
Actual question at the end... but attempting to describe the scenario so that the picture is (hopefully) clearer.
Last year, a number of my systems running nagios had a problem with choked worker processes.
[all systems running Centos 6]
I found a match in http://tracker.nagios.org/view.php?id=642 and as detailed in the comments, the fix provided appears to have solved my problem.
In order to test this I was using a maintenance version of nagios, identifying itself as Nagios Core 4.1.0rc3
This was fine... except that every so often, the monitoring would stop.... and looking at the nagios controller process it appeared to be clocking up CPU time... and therefore I assumed it was looping somewhere.
Given that this was a maintenance version, I didn't report the problem.
Stopping and restarting nagios solved the problem.... until the next time.
Unfortunately, the "next time" was becoming more and more frequent.
Having reported that the fix appeared to work.... I had asked if the fix was included in 4.1.1... which it isn't... it is scheduled to be part of 4.1.2
Given that, the "hang" was becoming more frequent, and I had a "work around" for the choking problem (increase the number of workers), I opted to upgrade to 4.1.1
Unfortunately, I'm now up to 100s of worker processes.... which is taking a considerable number of resources.
Ideally, what I want is 4.1.2... which I note is no longer in beta....
I have downloaded, and built 4.1.2-Pre1 - and upgraded a couple of "less critical" servers... It seems good at present.... although of course, "less critical" also means "lower load" !!
So.... what I would like to know is....
- when is the official release of 4.1.2 expected ? (estimates greatly appreciated...)
Basically, I want to know, so that I can decide whether to wait for 4.1.2 ... or if I should be brave and roll 4.1.2-Pre1 out to some heavier loaded hosts.
Please advise, Malcolm
Last year, a number of my systems running nagios had a problem with choked worker processes.
[all systems running Centos 6]
I found a match in http://tracker.nagios.org/view.php?id=642 and as detailed in the comments, the fix provided appears to have solved my problem.
In order to test this I was using a maintenance version of nagios, identifying itself as Nagios Core 4.1.0rc3
This was fine... except that every so often, the monitoring would stop.... and looking at the nagios controller process it appeared to be clocking up CPU time... and therefore I assumed it was looping somewhere.
Given that this was a maintenance version, I didn't report the problem.
Stopping and restarting nagios solved the problem.... until the next time.
Unfortunately, the "next time" was becoming more and more frequent.
Having reported that the fix appeared to work.... I had asked if the fix was included in 4.1.1... which it isn't... it is scheduled to be part of 4.1.2
Given that, the "hang" was becoming more frequent, and I had a "work around" for the choking problem (increase the number of workers), I opted to upgrade to 4.1.1
Unfortunately, I'm now up to 100s of worker processes.... which is taking a considerable number of resources.
Ideally, what I want is 4.1.2... which I note is no longer in beta....
I have downloaded, and built 4.1.2-Pre1 - and upgraded a couple of "less critical" servers... It seems good at present.... although of course, "less critical" also means "lower load" !!
So.... what I would like to know is....
- when is the official release of 4.1.2 expected ? (estimates greatly appreciated...)
Basically, I want to know, so that I can decide whether to wait for 4.1.2 ... or if I should be brave and roll 4.1.2-Pre1 out to some heavier loaded hosts.
Please advise, Malcolm