Page 2 of 2

Re: Problem after Nagios restart all host checks completing

Posted: Wed Oct 23, 2013 4:54 am
by bananagios
Hello and thanks for answer ;)

I don't have any error messages; simply the checks, are performed (I was wrong), but they are not shown correctly on the web interface.

I did a lot of test during last weeks:
- Turned on the nagios.log and I did not find any error messages
- I sniffed the traffic by tcpdump and I saw correctly the nagios request and the host's reponse
- The "checkresults" folder seems correctly working

The configuration, in terms of scheduled check time and many other parameters, was checked many times and it is correct.

I solved the problem splitting the checks on to server and using check_mk in order to watch only one console.

So, I think, is true that the data structure (a list) probably create a perfomance issue two times:
- when a new check result is inserted (maybe an heap could be much better O(log (n)) where n is the number of checks in the list), operation in O(n)
- when the list is parsed in order to show the checks (again the heap data structure should be better, undeerstood as priority queue); the heap perform this operation in costant time O(1) intead of O(n), worst case, of the list.

BTW, splitting the checks on two servers I have great performaces 8-)

Bye
Roberto

PS: I hope I have no written too many crap :oops:

Re: Problem after Nagios restart all host checks completing

Posted: Wed Oct 23, 2013 4:49 pm
by abrist
bananagios wrote:PS: I hope I have no written too many crap :oops:
No problem. Interesting issues you bring up. You may want to upgrade to 3.5.1 as there were a number of bug fixes. 4.0 brings much better performance, but it is still a rather young release.

Re: Problem after Nagios restart all host checks completing

Posted: Tue Nov 05, 2013 1:01 pm
by ericfeldhusen
Just curious, I notice my max_service_check_spread is =30.

Any reason why I couldn't bump this down to like 5 to 10 minutes?

Eric

Re: Problem after Nagios restart all host checks completing

Posted: Tue Nov 05, 2013 1:02 pm
by ericfeldhusen
I also have my max_host_check_spread set at =30 as well.

I should be able to adjust both of these down to 5-10 minutes and help my problem out.

Eric

Re: Problem after Nagios restart all host checks completing

Posted: Tue Nov 05, 2013 1:25 pm
by ericfeldhusen
abrist wrote:
bananagios wrote:PS: I hope I have no written too many crap :oops:
No problem. Interesting issues you bring up. You may want to upgrade to 3.5.1 as there were a number of bug fixes. 4.0 brings much better performance, but it is still a rather young release.
Would some of those bug fixes in 3.5.1 address host and service check scheduling?

Eric

Re: Problem after Nagios restart all host checks completing

Posted: Wed Nov 06, 2013 4:50 am
by bananagios
I'm sorry Eric, but I didn't understand what you are asking for :roll:

Re: Problem after Nagios restart all host checks completing

Posted: Wed Nov 06, 2013 10:48 am
by tmcdonald
bananagios wrote:I'm sorry Eric, but I didn't understand what you are asking for :roll:
I believe that was a question for abrist