Nagios XI keeps crashing post upgrade to XI 2014

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by abrist »

I presume you have restarted the server so that the new kernel limits are loaded, correct?
run the following command to potentially fix the postgres vacuum issue:

Code: Select all

echo "vacuum;vacuum analyze;vacuum full;"|psql nagiosxi postgres
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by lmiltchev »

What is the Nagios XI version that you upgraded from?
Be sure to check out our Knowledgebase for helpful articles and solutions!
chriscamm
Posts: 72
Joined: Thu Aug 22, 2013 6:12 am

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by chriscamm »

I have run the vacuum and rebooted again.

I upgraded from XI 2012 - 2012R2.9

Thanks

Chris
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by lmiltchev »

Did you see any errors, while running the vacuum on the postgres? Let us know if the problem resurfaces.
Be sure to check out our Knowledgebase for helpful articles and solutions!
chriscamm
Posts: 72
Joined: Thu Aug 22, 2013 6:12 am

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by chriscamm »

No errors at all while running the vacuum commands

Code: Select all

Jun 10 22:04:35 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:35 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:36 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:36 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:37 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:37 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:38 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:38 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:39 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:39 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:40 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:40 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:41 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:41 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:42 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:42 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:43 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:43 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:44 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:44 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:45 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:45 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:46 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:46 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:47 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:47 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Jun 10 22:04:48 qualngs ndo2db[2294]: Message sent to queue.
Jun 10 22:04:48 qualngs ndo2db[2294]: Warning: queue send error, retrying...
Errors are back almost straight away.

What is the maximum Kernel settings I can use does anyone know?

Thanks

Chris
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by sreinhardt »

Before we keep increasing your kernel message queues, how about we try setting less worker processes. Considering message queues should only be used for inter-process communication, if we limit the number of workers which left to up to core could be as high as 18 ((6 physical cores + 6 hyper-threaded cores) X 1.5 = 18). Plus each worker spawns processes to run the actual checks, thats a whole lot of processes. Try altering the nagios.cfg setting check_workers to check_workers=6. Do a service nagios restart, and see how that gets along.
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
chriscamm
Posts: 72
Joined: Thu Aug 22, 2013 6:12 am

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by chriscamm »

Hi,

I have decreased the

Code: Select all

core_workers=6
max_concurrent=20
Still get

Code: Select all

Jun 12 11:24:00 qualngs ndo2db[2321]: Message sent to queue.
Jun 12 11:24:00 qualngs ndo2db[2321]: Warning: queue send error, retrying...
Jun 12 11:24:01 qualngs ndo2db[2321]: Message sent to queue.
Jun 12 11:24:01 qualngs ndo2db[2321]: Warning: queue send error, retrying...
Thanks

Chris
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by lmiltchev »

It seems like we may need to schedule a remote session to further troubleshoot this issue. You can send us an email at [email protected] to open a new ticket in our ticketing system.

BTW, can you run the following command and show us the output?

Code: Select all

/usr/local/nagios/bin/ndo2db | head -2
Be sure to check out our Knowledgebase for helpful articles and solutions!
chriscamm
Posts: 72
Joined: Thu Aug 22, 2013 6:12 am

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by chriscamm »

Code: Select all

[root@qualngs ~]# /usr/local/nagios/bin/ndo2db | head -2

NDO2DB 2.0.0
[root@qualngs ~]# 
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: Nagios XI keeps crashing post upgrade to XI 2014

Post by lmiltchev »

Is a remote session an option for you?
Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked