Stability Issues

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
syncbak
Posts: 11
Joined: Thu May 08, 2014 10:08 am

Stability Issues

Post by syncbak »

We have been having overall stability issues. We are having to reboot at least once per week, the data base table has crashed on us and it seems to regularly hang.
How can we work together to correct these issues, our renewal is fast approaching and if these are not corrected before then we may as well look for an open source product.

details: Linux nachos2.syncbak.com 2.6.32-504.8.1.el6.x86_64 #1 SMP Wed Jan 28 21:11:36 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux

Sincerely, Mike Cline
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Stability Issues

Post by abrist »

What is the main issue forcing the reboot? Load spikes, lockup, kernel panic, etc?
The database tables can crash if mysql/ndo2db is shutdown improperly.
What version of XI are you running?
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
syncbak
Posts: 11
Joined: Thu May 08, 2014 10:08 am

Re: Stability Issues

Post by syncbak »

The version is 2014R2.6
What causes us to reboot is when we apply a configuration change within the GUI it hangs forever until it times out. But if we do it from command prompt it works fine every time.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Stability Issues

Post by abrist »

Are you on centos/rhel 5, 6, or 7?
What is the output of:

Code: Select all

grep manage_services /usr/local/nagiosxi/scripts/restart_nagios_with_export.sh
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
syncbak
Posts: 11
Joined: Thu May 08, 2014 10:08 am

Re: Stability Issues

Post by syncbak »

version 6.6

sudo $BASEDIR/manage_services.sh restart nagios
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: Stability Issues

Post by jdalrymple »

syncbak wrote: But if we do it from command prompt it works fine every time.
When you say this do you mean just restarting Nagios? The process of restarting Nagios's core component from the command line is just a small portion of what happens during the apply config process performed in XI. There may still be some database issues lingering from the problems you had back in November.
syncbak
Posts: 11
Joined: Thu May 08, 2014 10:08 am

Re: Stability Issues

Post by syncbak »

When the Apply configuration within the GUI fails , we will then run this: /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg
and then service Nagios restart
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: Stability Issues

Post by jdalrymple »

Does the apply config process fail every time you try to make any changes in CCM, or is it intermittent?

If you don't ever try to make configuration changes does the stability still suffer?
syncbak
Posts: 11
Joined: Thu May 08, 2014 10:08 am

Re: Stability Issues

Post by syncbak »

Its intermittent

The purpose of this ticket was to resolve our issues, this service requires changing configurations for our use.
jdalrymple
Skynet Drone
Posts: 2620
Joined: Wed Feb 11, 2015 1:56 pm

Re: Stability Issues

Post by jdalrymple »

I understand syncbak, and we do want to resolve your issue. Right now it seems that the problem isn't with the Nagios core process but in the config building process. I was trying to understand if you can at all successfully build configs or if your system breaks every time you try the applyconfig process. One thing that may be useful would be a profile.zip taken around the time that you experience the problem (via PM or ticket, not on the forum). I understand that may be hard to get though if the problem is intermittent. Either way this problem probably should be moved out of the customer support forums and into a support ticket, would you agree? If so please send us an E-mail to [email protected] referencing this forum post.
Locked