Page 1 of 1

Nagios Engine not Starting

Posted: Mon Feb 20, 2017 6:37 am
by venturegardengroup
Hello,

My Nagios Engine refused to start... I think I started having this issue once I tried to add a new nagios user. See error screenshots attached:

Regards,

Re: Nagios Engine not Starting

Posted: Mon Feb 20, 2017 12:01 pm
by avandemore
Do you have that contact defined?

Please try this:
XI > Configure > Core Config Manager > Config File Management > Delete Files > Write Configs > Verify Files > Restart Nagios Core
Older Nagios Versions:
XI > Configure > Core Config Manager > Write Config Files > Delete > Write > Verify > Restart

Re: Nagios Engine not Starting

Posted: Wed Feb 22, 2017 8:14 am
by venturegardengroup
Please see output attached from following the steps you recommended.

Regards.


NagiosXI version 5.2.3
CentOS 7, 64bits
32 or 64bit?
Manual Install

Re: Nagios Engine not Starting

Posted: Wed Feb 22, 2017 8:15 am
by venturegardengroup
Missed the attachment. Please find.

Re: Nagios Engine not Starting

Posted: Wed Feb 22, 2017 11:08 am
by avandemore
It's saying you have a host group referencing a non-existent host called Bandwidth_Mgr_Svr. Is that true? If so you should remove the reference. Otherwise you can revert to a previous known good snapshot if you wish.

Re: Nagios Engine not Starting

Posted: Thu Feb 23, 2017 4:17 am
by venturegardengroup
Thanks,

Please can you give me a guide on how to remove the reference?

Thanks.

Re: Nagios Engine not Starting

Posted: Thu Feb 23, 2017 11:06 am
by avandemore
XI > Configure > CCM > Host Groups > [Host group] ] > Manage Hosts. One of the hosts listed is called Bandwidth_Mgr_Svr, it should be removed.

Also you could attach or PM XI > Configure > CCM > Configuration Snapshots > most recent Config Error snapshot if you want confirmation of exactly where the change needs to happen.

Re: Nagios Engine not Starting

Posted: Thu Feb 23, 2017 11:15 am
by ssax
First, go to Configure > Core Config Manager > Hosts and validate whether that host actually exists in the CCM.

If it's disabled, activate it, and then try to apply configuration.

If it's disabled and/or missing and you don't want to add it back in, please go to Configure > Core Config Manager > Tools > Write Config Files
- Click Delete (don't worry, it's safe, they get re-written)
- Click Write
- Now go edit your /usr/local/nagios/etc/hostgroups.cfg file and search for the host Bandwidth_Mgr_Svr and note the name(s) of the hostgroups that it's in.
- Go to Configure > Core Config Manager > Host Groups
- Edit each one of the hostgroups that you noted above, click the Manage Hosts button, either remove the host from the right hand side OR if it's not listed on the right, just click the Close button, then click the Save button.
- This will force the backend DB info to be rewritten for those hostgroups
- Now try to apply configuration again and let us know the results.

If you would like help determining it, we need to get the files in a bad state so that we can find all of the issues, please perform these steps without any additional steps (like applying config, etc) in between.

Please go to Configure > Core Config Manager > Tools > Write Config Files
- Click Delete (don't worry, it's safe, they get re-written)
- Click Write
- Now zip up your entire /usr/local/nagios/etc directory and PM it to one of us so that we can review it.

Code: Select all

zip -r /tmp/NAGIOSCONFIGS.zip /usr/local/nagios/etc/

Thank you

Re: Nagios Engine not Starting

Posted: Fri Feb 24, 2017 4:11 am
by venturegardengroup
I couldn't find Bandwidth_Mgr_Svr in any of the hostgroups, however, I have attached the most recent config snapshot and the zip file for your analysis and help.

Thanks.

Mod Edit: Files received and placed on shared drive

Re: Nagios Engine not Starting

Posted: Fri Feb 24, 2017 12:31 pm
by ssax
If it's disabled and/or missing and you don't want to add it back in, please go to Configure > Core Config Manager > Tools > Write Config Files
- Click Delete (don't worry, it's safe, they get re-written)
- Click Write
- Click the Verify button and post the entire output

Then run this tail command (and let it run):

Code: Select all

tail -F /usr/local/nagiosxi/var/cmdsubsys.log
Then apply configuration and send the full output of the tail command after it fails.


Thank you