Page 1 of 3

Monitoring Wizard - Configuration verification failed

Posted: Fri Aug 26, 2016 6:13 pm
by gbnag
Hello,

We installed the nagiosxi Linux agent on Ubuntu14-04. Getting a 'Configuration verification failed' message after we finish going through the Linux Server Monitoring Wizard tool.

I verified that the nagios server can communicate with the target Ubuntu system.
-------------------------------------------------------------------------------------------

root@localhost-010049098179 ~]# nmap -p5666 gberger-ubuntu.qualcomm.com

Starting Nmap 6.47 ( http://nmap.org ) at 2016-08-26 16:05 PDT
Nmap scan report for gberger-ubuntu.qualcomm.com (x.x.x.x)
Host is up (0.011s latency).
PORT STATE SERVICE
5666/tcp open nrpe

Nmap done: 1 IP address (1 host up) scanned in 0.09 seconds
-----------------------------------------------------------------------------------

also verified that in ccm, tools, Write config files that there are no errors:

Write host configurations ...
Configuration file: gberger-ubuntu.qualcomm.com.cfg successfully written!
WARNING! Ghost host config detected! Removing the file oum-master.qualcomm.com.cfg
Host configuration files successfully written!

Write service configurations ...
Configuration file: gberger-ubuntu.qualcomm.com.cfg successfully written!
Service configuration files successfully written!


Please advise

Thank you,

Re: Monitoring Wizard - Configuration verification failed

Posted: Sun Aug 28, 2016 8:28 pm
by Box293
Can you run the following in an ssh session on your Nagios XI host.

Code: Select all

tail -f /usr/local/nagiosxi/var/cmdsubsys.log
Then go into XI and Apply Configuration.

What output is produced in the SSH session?

Re: Monitoring Wizard - Configuration verification failed

Posted: Mon Aug 29, 2016 1:55 pm
by gbnag
reran the linux wizard and recorded the cmdsubsys, log file attached.

thank you

Re: Monitoring Wizard - Configuration verification failed

Posted: Mon Aug 29, 2016 4:28 pm
by tgriep
Bleow are the errors that are causing the verification error.

Code: Select all

Error: Could not find any host matching 'oum-master.qualcomm.com' (config file '/usr/local/nagios/etc/services/oum-master.qualcomm.com.cfg', starting on line 32)
Error: Failed to expand host list 'oum-master.qualcomm.com' for service '_um_ URL Status' (/usr/local/nagios/etc/services/oum-master.qualcomm.com.cfg:32)
If you go in to Core Config Manager and edit the service called '_um_ URL Status' and add a host to it, that should fix it for you.
If you don't need that service check, you can delete it and that would fix the error as well.

Re: Monitoring Wizard - Configuration verification failed

Posted: Mon Aug 29, 2016 7:26 pm
by gbnag
that worked, thank you.

Now with the ubuntu system showing in nagiosxi, we would like to add some process daemon monitors. the processes have the same name but using different args.
I remember during our quickstart session we could not get it to discover due to the issue we just resolved.
what would be the correct sequence of steps to add these proc monitors to capture avail, cpu and memory util stats per process?

thanks in advance.

Re: Monitoring Wizard - Configuration verification failed

Posted: Mon Aug 29, 2016 7:54 pm
by Box293
Can you give us an example. What are you doing in the wizard? Does the wizard fail?

Re: Monitoring Wizard - Configuration verification failed

Posted: Mon Aug 29, 2016 8:03 pm
by gbnag
Actually, we can now see the ubuntu devices in nagiosxi but all checks are red, see attached CAPTURE.PNG.

Re: Monitoring Wizard - Configuration verification failed

Posted: Mon Aug 29, 2016 8:19 pm
by Box293
Can you show us the service details of one of the services.

Also, go into CCM > Monitoring > Services
Click the disk icon and paste the text config here of that service.

Re: Monitoring Wizard - Configuration verification failed

Posted: Tue Aug 30, 2016 11:43 am
by gbnag
Attached is the services screenshot, below is the disk config for the ubuntu system:

Code: Select all

###############################################################################
#
# Service configuration file
#
# Created by: Nagios Core Config Manager 2.5.2
# Date:	      2016-08-30 09:41:59
# Version:    Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND --- 
# Nagios CCM will overwrite all manual settings during the next update if you 
# would like to edit files manually, place them in the 'static' directory or 
# import your configs into the CCM by placing them in the 'import' directory.
#
###############################################################################

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		/ Disk Usage
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_disk!-a '-w 20% -c 10% -p /'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		APT Updates
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_apt!-a '-U'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		CPU Stats
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_cpu_stats!-a '-w 85 -c 95'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Cron Scheduling Daemon
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_init_service!-a 'cron'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Load
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_load!-a '-w 15,10,5 -c 30,20,10'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Memory Usage
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_mem!-a '-w 20 -c 10'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Node Javascript process
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_services!-a 'node'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Open Files
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_open_files!-a '-w 30 -c 50'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Ping
	use				xiwizard_linuxserver_ping_service
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		SSH Server
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_init_service!-a 'ssh'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Swap Usage
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_swap!-a '-w 50 -c 20'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Total Processes
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_procs!-a '-w 150 -c 250'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

define service {
	host_name			dmg-dev.qualcomm.com
	service_description		Users
	use				xiwizard_nrpe_service
	check_command			check_nrpe!check_users!-a '-w 5 -c 10'
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	_xiwizard			linux-server
	register			1
	}	

###############################################################################
#
# Service configuration file
#
# END OF FILE

Re: Monitoring Wizard - Configuration verification failed

Posted: Tue Aug 30, 2016 2:41 pm
by rkennedy
Usually this is related to permissions, please manually run a few checks against the client machine from the CLI, and while doing so look at /var/log/messages on the client machine. What error do you see returned?