Difference between reboot en restart of nagios XI ?

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
DennisPR
Posts: 149
Joined: Mon May 07, 2012 10:34 am

Difference between reboot en restart of nagios XI ?

Post by DennisPR »

I have a bizar issue since this morning on a Nagios XI 2012R2.3 server running on CentOS 6.4
If I reboot the server there are about 100 hosts from a total of 500 that turn grey (like they are pending) and they have the "notifications disabled" icon
Host.PNG
These hosts are not all in one particular host- or service group but are only part of some hostgroups.
If I click apply confuguration in CCM (without changing anything) Nagios XI reloads and all hosts are back to normal again.
If I reboot again I have the same issue ?
What is the diffirence between rebooting and restarting or reloading Nagios XI ?
What am I missing here ?
You do not have the required permissions to view the files attached to this post.
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Difference between reboot en restart of nagios XI ?

Post by slansing »

That can be partially caused by database corruption, if you are having to re-sync the database to return them to normal. If you are not using a safe reboot the MySql server could be unexpectedly stopping causing this. Please take a look at our database repair document:

http://assets.nagios.com/downloads/nagi ... tabase.pdf

It could also be just because you are restarting the entire server, or the nagios processes.

When you reboot, you are restarting the entire server or VM. When you restart the nagios service, you are doing exactly that. If that's what you are asking. Do these hosts ever change state? Are they being placed in downtime?
DennisPR
Posts: 149
Joined: Mon May 07, 2012 10:34 am

Re: Difference between reboot en restart of nagios XI ?

Post by DennisPR »

I have performed a repair of the database as described in http://assets.nagios.com/downloads/nagi ... tabase.pdf
The repair did not return any erorrs.

I did a reboot of the VM after the repair and I still have the same issue.
Some of the hosts come in a pending state and notifications are disabled.

Logging on to the console and performing the following command still solves the issue : service nagios reload

The hosts are not set for downtime.
It also seems that there are no services linked to these hosts anymore after a reboot
1.PNG
After the service nagios reload it looks like this ?
2.PNG
Any more advice pls ?
You do not have the required permissions to view the files attached to this post.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Difference between reboot en restart of nagios XI ?

Post by abrist »

As some retention files are not preserved on a system reboot, it may take a while for the summaries to update as everything gets scheduled.

You may have a problem with ndo not starting correctly on reboot if a service nagios restart fixes the issues.
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
DennisPR
Posts: 149
Joined: Mon May 07, 2012 10:34 am

Re: Difference between reboot en restart of nagios XI ?

Post by DennisPR »

Abrist can you tell me what I need to check after a reboot pls ?
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Difference between reboot en restart of nagios XI ?

Post by abrist »

After the VM is rebooted, how long have you waited for XI to schedule and start checking?
After a reboot, if things do not start working, check the status of:

Code: Select all

service nagios status
service ndo2db status
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
DennisPR
Posts: 149
Joined: Mon May 07, 2012 10:34 am

Re: Difference between reboot en restart of nagios XI ?

Post by DennisPR »

Hi I've retested after a reboot and waited for 15 minutes.

Code: Select all

[root@myhost ~]# uptime
 17:18:07 up 15 min,  1 user,  load average: 0.34, 0.66, 0.55
[root@myhost ~]# service nagios status
nagios (pid 2871) is running...
[root@myhost ~]# service ndo2db status
ndo2db (pid 2890) is running...
[root@ap-dco67-mon ~]#
Here are some more screenshots :
Idle1.PNG
Idle2.PNG
Idle3.PNG
You do not have the required permissions to view the files attached to this post.
DennisPR
Posts: 149
Joined: Mon May 07, 2012 10:34 am

Re: Difference between reboot en restart of nagios XI ?

Post by DennisPR »

If I click on "See this host in Nagios core" it looks like this
Idle4.PNG
You do not have the required permissions to view the files attached to this post.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: Difference between reboot en restart of nagios XI ?

Post by abrist »

If you restart ndo2db, does the XI frontend start reporting host status and host check information again?

Code: Select all

service ndo2db restart
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
sreinhardt
-fno-stack-protector
Posts: 4366
Joined: Mon Nov 19, 2012 12:10 pm

Re: Difference between reboot en restart of nagios XI ?

Post by sreinhardt »

Are you using an offloaded mysql database? Are your nagios configs or perf data on a mounted network or san share? Do you have any other performance changes that may be in place?

Code: Select all

ll /usr/local/nagios/var/ | grep cache
grep -i 'cache' /usr/local/nagios/etc/nagios.cfg
Nagios-Plugins maintainer exclusively, unless you have other C language bugs with open-source nagios projects, then I am happy to help! Please pm or use other communication to alert me to issues as I no longer track the forum.
Locked