Several Database and Permission issues.

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
argentra
Posts: 16
Joined: Wed Dec 19, 2012 10:41 pm
Location: Singapore
Contact:

Several Database and Permission issues.

Post by argentra »

Hello.

All those issues were encountered using your ready-to-go virtual appliance on either CentOS 6.2 32-bit / 64-bit and CentOS 6.3 32-bit / 64-bit.
We are not using proxy or SSL.

List of issues that occurred during the maintenance, modifications and implementation:

• Nagios MySQL database constantly fails or loses data during implementation of new hosts and services, also after each reboot of the machine Nagios is set in (requires running MySQL repair script each and every time after rebooting the machine)

• NagioSQL database appeared to be losing data as well, also the credentials for some scripts and the database has changed which cause NagioSQL login failure

• PHP execution time has to be increased due to time that is used to write configuration files for the core

• Permissions for some plug-ins or configuration files also change very often without any particular reason or a person involvement

• Permissions for NagioSQL html folder and all the file within the folders has been changed couple a times without a person involvement

• NagioSQL database tend to create duplicate tables which then had to be troubleshooted and removed manually

• Frequently the database wasn’t reflecting changes made in the core or within the server (for example, couple of the devices are appearing on the monitoring list with monitoring for linux machines like CPU, memory or disk usage, but there is nothing in the core and you are basically unable to re-configure it) which causes are very big issue for us

• Operations view from time to time gets corrupted and says "Session has expired"

• Few hosts are monitored from Nagios XI but there are no services / hosts listed in the core (cannot even remove those)

• Some commands are visible on the commands list, but when you check in services those are unavailable which causes corruption in applying the configuration (need to re-add the command each time it happens)

All the issues are related so I didn't see a point to open a topic for each of those.

Thanks.

Szymon Raclawicki
Senior Systems Engineer
Argentra Pte Ltd
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Several Database and Permission issues.

Post by slansing »

This may be quite basic but have you ran the database repair tools?

http://assets.nagios.com/downloads/nagi ... tabase.pdf
argentra
Posts: 16
Joined: Wed Dec 19, 2012 10:41 pm
Location: Singapore
Contact:

Re: Several Database and Permission issues.

Post by argentra »

Ok, but what is the point of running the tools every single time I restart the server?

It doesn't give us enough comfort. And in some cases it just doesn't work. Lets for example just put the "ghost host issue".
It is not in the database and not in the core files. Where is it?

That kind of mystery in most of the times will be very harmful for the business.
The database is getting corrupted simply to often.

I have been aware of some of these issues before, but didn't expect there is so many.
For example the commands that were on the list weren't listed in the service commands.. I have tried everything to resolve this including writing the files etc.
Still didn't help and had to completely remove it and re-add.

I need a very clear reply on how Nagios Team is going to improve and resolve those issues.

P.S.

Can you also move this post to Customer Support?
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: Several Database and Permission issues.

Post by slansing »

The database repair suggestion was because you had not mentioned you used it. How are you "restarting" the server, if you do not shut down the machine properly you will get corruption. We recommend using the following command to safely shut down before powering off the box:

Code: Select all

shutdown -h now

Meanwhile we are looking into your other questions regarding this.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Several Database and Permission issues.

Post by scottwilkerson »

argentra wrote: I have been aware of some of these issues before, but didn't expect there is so many.
For example the commands that were on the list weren't listed in the service commands.. I have tried everything to resolve this including writing the files etc.
Still didn't help and had to completely remove it and re-add.
I am a little confused where in the Nagios XI interface you are referring to. Are you adding commands in Configure -> CCM -> Commands and they aren't showing up in Configure -> CCM -> Services -> Modify -> Check Command ??
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
argentra
Posts: 16
Joined: Wed Dec 19, 2012 10:41 pm
Location: Singapore
Contact:

Re: Several Database and Permission issues.

Post by argentra »

When I am in CCM editing services the commands are not listed. When I check on command list (interface where you add commands) its there... activated.

It is getting corrupted quite frequently.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Several Database and Permission issues.

Post by scottwilkerson »

What version of XI are you running?
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
argentra
Posts: 16
Joined: Wed Dec 19, 2012 10:41 pm
Location: Singapore
Contact:

Re: Several Database and Permission issues.

Post by argentra »

Latest one R1.3.

*Today my Nagios suddenly lost Postgres config (perms) and I had to manually fix it.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Several Database and Permission issues.

Post by scottwilkerson »

Can you outline the problem you experienced?
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
Locked