Page 1 of 2

Service check alerts not going out

Posted: Fri Feb 07, 2014 10:46 am
by micropact
Hi Everyone,

I've added about 60 hosts to our environment with several service checks on each host. We're getting alerts when a host check doesn't return OK but we're not getting any warning or error alerts on our service checks. The settings for the service checks are the same as for the host checks yet the alerts are just not being sent.

This specifically became a problem today when the transaction log drive on our primary Exchange server filled up completely and the server stopped sending emails. When I go into the configuration settings for the service (monitoring the transaction log drive for disk space) it says that there is a warning and it's set to send out emails. The alerts are just not going out.

Any help would be greatly appreciated because we purchased Nagios to eliminate problems like this specifically.

Thanks!

Nate

Re: Service check alerts not going out

Posted: Fri Feb 07, 2014 3:29 pm
by tmcdonald
Can you post the config of the misbehaving service? Find the service in your CCM and click the icon to the right for "View Text Config", then post here. Be sure to censor any sensitive information.

Re: Service check alerts not going out

Posted: Mon Feb 10, 2014 8:27 am
by micropact
define service {
host_name XX-XX-XX-XX
service_description Drive E: Disk Usage
use xiwizard_windowsserver_nsclient_service
check_command check_xi_service_nsclient!worldgate!USEDDISKSPACE!-l E -w 80 -c 95
max_check_attempts 5
check_interval 5
retry_interval 1
check_period xi_timeperiod_24x7
notification_interval 60
notification_period xi_timeperiod_24x7
contacts nagiosadmin
contact_groups IT Operations
_xiwizard windowsserver
register 1
}

Re: Service check alerts not going out

Posted: Mon Feb 10, 2014 10:42 am
by slansing
Alright, we will also need to see the configuration of one of your example services that is not alerting properly, and the templates/contacts configurations which are attached to it, there are a few places that could be causing problems.

Re: Service check alerts not going out

Posted: Fri Feb 21, 2014 9:02 am
by micropact
Could you please tell me where these files are located on the server?

Re: Service check alerts not going out

Posted: Fri Feb 21, 2014 10:28 am
by tmcdonald
Typically they will be in /usr/local/nagios/etc/ and there are a few directories in their for hosts, services, etc named appropriately.

Re: Service check alerts not going out

Posted: Fri Feb 21, 2014 4:19 pm
by micropact

Code: Select all

###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date:	      2014-02-21 09:52:36
# Version:    Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND --- 
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################

define service {
	host_name			HQ-MP-MAIL-01
	service_description		CPU Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!CPULOAD!-l 5,80,90
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		CPU Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!CPULOAD!-l 5,80,90
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Drive C: Disk Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!USEDDISKSPACE!-l C -w 80 -c 95
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Drive C: Disk Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!USEDDISKSPACE!-l C -w 80 -c 95
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Drive D: Disk Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!USEDDISKSPACE!-l D -w 80 -c 95
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Drive E: Disk Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!USEDDISKSPACE!-l E -w 80 -c 95
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Drive G: Disk Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!USEDDISKSPACE!-l G -w 80 -c 95
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Exchange Core Services
	use				xiwizard_exchange_service
	check_command			check_xi_service_nsclient!worldgate!SERVICESTATE!-l MSExchangeADTopology,MSExchangeAntispamUpdate,MSExchangeEdgeSync,MSExchangeFDS,MSExchangeImap4,MSExchangeIS,MSExchangeMailboxAssistants,MSExchangeMailSubmission,MSExchangeMonangePop3,MSExchangeRepl,MSExchangeSA,MSExchangeSearch,MSExchangeServiceHost,MSExchangeTransport,MSExchangeTransportLogSearch,msftesql-Exchange -d SHOWALL
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			exchange
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Exchange Web Services
	use				xiwizard_exchange_service
	check_command			check_xi_service_nsclient!worldgate!SERVICESTATE!-l W3SVC -d SHOWALL
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			exchange
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Memory Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!MEMUSE!-w 80 -c 90
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Memory Usage
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!MEMUSE!-w 80 -c 90
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Messages Pending Routing
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!COUNTER!-l "\\SMTP Server(_Total)\\Messages Pending Routing" -w 25 -c 100
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			exchange
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		OWA HTTPS
	use				xiwizard_exchange_service
	check_command			check_xi_service_http! -H mail.micropact.com -f ok -I hq-mp-mail-01 -u "/owa/" -S
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			exchange
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Ping
	use				xiwizard_windowsserver_ping_service
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Ping
	use				xiwizard_windowsserver_ping_service
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Remote Queue Length
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!COUNTER!-l "\\SMTP Server(_Total)\\Remote Queue Length" -w 25 -c 50
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			exchange
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		SMTP
	use				xiwizard_smtp_service
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			exchange
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Uptime
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!UPTIME
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

define service {
	host_name			HQ-MP-MAIL-01
	service_description		Uptime
	use				xiwizard_windowsserver_nsclient_service
	check_command			check_xi_service_nsclient!worldgate!UPTIME
	max_check_attempts		5
	check_interval			5
	retry_interval			1
	check_period			xi_timeperiod_24x7
	notification_interval		60
	notification_period		xi_timeperiod_24x7
	contacts			nagiosadmin
	contact_groups			IT Operations
	_xiwizard			windowsserver
	register			1
	}	

###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################
Moderator note: Please put code tags around your long output like I have done to your post

Re: Service check alerts not going out

Posted: Mon Feb 24, 2014 11:13 am
by lmiltchev
These are only service definitions... Please, post the servicetemplates.cfg and the definitions of "nagiosadmin" and "IT Operations".

Re: Service check alerts not going out

Posted: Mon Feb 24, 2014 11:19 am
by micropact
OK, where do I get the configuration for nagiosadmin and IT Operations?

Re: Service check alerts not going out

Posted: Mon Feb 24, 2014 11:20 am
by micropact
nagiosadmin:
define contact {
contact_name nagiosadmin
alias Nagios Administrator
host_notification_period nagiosadmin_notification_times
service_notification_period nagiosadmin_notification_times
host_notification_options d,u,r,f,s
service_notification_options w,u,c,r,f,s
host_notification_commands xi_host_notification_handler
service_notification_commands xi_service_notification_handler
email nagios@localhost
use xi_contact_generic
}