Page 1 of 2

fs_/usr service escalation issue

Posted: Sat Feb 02, 2013 8:17 pm
by nunezric
Hi,

Our Nagios service escalation for fs_/usr service is not working. We believe it should be related to the “/” character.

mtl1log02 server belongs to “linux-servers” host group.

Parts of /etc/check_mk/main.mk:

all_hosts = [
(…)
"mtl1log02|linux|linux-servers",
(..)
]
(…)
host_groups = [
(…)
( 'linux-servers', [ 'linux-servers' ], ALL_HOSTS ),
(…)
]
(…)

I defined a service escalation for this host group (file /usr/local/nagios/etc/serviceescalations.cfg):

define serviceescalation {
hostgroup_name linux-servers
service_description adinfo,CPU load,fs_/,fs_/boot,fs_/tmp,fs_/usr,fs_/usr/export/home,fs_/var,Memory used,ypwhich
contacts service-now
first_notification 1
last_notification 1
notification_interval 1440
escalation_period 24x7
escalation_options c,
}

These services were added by hand as using the Nagios Core in order to define the escalation.

/usr/local/nagios/etc/services/adinfo.cfg
define service {
service_description adinfo
display_name adinfo
register 1
}

/usr/local/nagios/etc/services/fs_usr.cfg
define service {
service_description fs_/usr
register 1
}


The critical adinfo problem was escalated, but not the fs_/usr issue. We believe it should be related to the “/” character.

Ricardo


From: Nagios XI MTL [mailto:[email protected]]
Sent: Friday, February 01, 2013 3:52 PM
To: Ricardo Nunez
Subject: PROBLEM Service Alert - mtl1log02/adinfo is CRITICAL

***** Nagios XI Alert *****

** This is an escalated notification ** Nagios has detected a problem with this service.

Notification Type: PROBLEM

Service: adinfo
Host: mtl1log02
Address: 134.87.177.115
State: CRITICAL
Info:
Critical. Connected to nombre
Date/Time: 2013-02-01 20:52:22
Location: Montreal

Respond: http://nagiosxi-mtl.pmc-sierra.bc.ca/na ... 668af3fb09
Nagios URL: http://nagiosxi-mtl.pmc-sierra.bc.ca/nagiosxi/




From: Nagios XI MTL [mailto:[email protected]]
Sent: Friday, February 01, 2013 3:34 PM
To: Ricardo Nunez
Subject: PROBLEM Service Alert - mtl1log02/fs_/usr is CRITICAL

***** Nagios XI Alert *****

Nagios has detected a problem with this service.

Notification Type: PROBLEM

Service: fs_/usr
Host: mtl1log02
Address: 134.87.177.115
State: CRITICAL
Info:
CRIT - 97.0% used (8.25 of 8.5 GB), (levels at 80.0/90.0%), trend: 0.00B / 24 hours
Date/Time: 2013-02-01 20:34:12
Location: Montreal

Respond: http://nagiosxi-mtl.pmc-sierra.bc.ca/na ... 668af3fb09
Nagios URL: http://nagiosxi-mtl.pmc-sierra.bc.ca/nagiosxi/

Re: fs_/usr service escalation issue

Posted: Mon Feb 04, 2013 10:03 am
by scottwilkerson
nunezric wrote:These services were added by hand as using the Nagios Core in order to define the escalation.

/usr/local/nagios/etc/services/adinfo.cfg
define service {
service_description adinfo
display_name adinfo
register 1
}

/usr/local/nagios/etc/services/fs_usr.cfg
define service {
service_description fs_/usr
register 1
}
these don't appear to be complete definitions. In addition you should not edit the files directly in /usr/local/nagios/etc/services it is a bad idea. If you want to manually edit configuration items they should be placed in /usr/local/nagios/etc/static otherwise, all items should be managed through Configure -> CCM

Re: fs_/usr service escalation issue

Posted: Mon Feb 04, 2013 12:48 pm
by nunezric
Yes, I used Nagios CCM to define those services. I was just avoiding to copy/paste too much garbage. I'll do that now:

In /usr/local/nagios/etc/services we have chosen these two files for this issue:

adinfo.cfg:
###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2013-02-01 23:38:56
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################

define service {
service_description adinfo
display_name adinfo
register 1
}

###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################

fs_usr.cfg:
###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2013-02-01 23:38:56
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################

define service {
service_description fs_/usr
register 1
}

###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################

I do believe the issue is related to the character "/"


More information:

Nagios XI 2012R1.4 after at least two updates.

Linux Distribution and version?
rpm -qa redhat-release
redhat-release-5Server-5.8.0.3


32 or 64bit?
uname -a
Linux mtlems02.pmc-sierra.bc.ca 2.6.18-308.11.1.el5 #1 SMP Fri Jun 15 15:41:53 EDT 2012 x86_64 x86_64 x86_64 GNU/Linux


VMware Image or Manual Install of XI?
Manual install of XI

Are there specials configurations on your system, ie; is Gnome installed? Are you using a proxy? Are you using SSL? No, No, No

Re: fs_/usr service escalation issue

Posted: Mon Feb 04, 2013 3:24 pm
by scottwilkerson
Neither of the services you defined have hosts or hostgroups associated with them. They also are missing check commands as well as other settings..

Every service must have a host or will not perform any checks..

Am I missing something in your configuration?

Re: fs_/usr service escalation issue

Posted: Mon Feb 04, 2013 4:18 pm
by nunezric
You were right.

I've attached a zip file which has the actual /usr/local/nagios/etc/static/check_mk_objects.cfg file.

You will see that a lot of devices have these services.


Ricardo

Re: fs_/usr service escalation issue

Posted: Mon Feb 04, 2013 5:17 pm
by scottwilkerson
I didn't realize you had items in the static directory for these services....

Let me re-look at the thread...

Re: fs_/usr service escalation issue

Posted: Mon Feb 04, 2013 5:21 pm
by scottwilkerson
Can you verify that EVERY host in linux-servers hostgroup has ALL of these services

Code: Select all

adinfo,CPU load,fs_/,fs_/boot,fs_/tmp,fs_/usr,fs_/usr/export/home,fs_/var,Memory used,ypwhich

Re: fs_/usr service escalation issue

Posted: Tue Feb 05, 2013 9:22 am
by nunezric
Yes, every host which belongs to linux-servers has all of those services.

If not, Nagios CCM won't let me "Apply Configuration" when I was adding the service escalation.


Ricardo

Re: fs_/usr service escalation issue

Posted: Wed Feb 06, 2013 11:58 am
by scottwilkerson
Is Ricardo Nunez part of the admins contact group which is added as the contacts for these checks?

Re: fs_/usr service escalation issue

Posted: Wed Feb 06, 2013 4:52 pm
by nunezric
Attaching a ZIP file which has:

/usr/local/nagios/etc/contactgroups.cfg
/usr/local/nagios/etc/contacts.cfg

By the way, "Ricardo Nunez" does receive the e-mails regarding fs_/usr service, the problem is that is doesn't escalate. The alerts related to "adinfo", for example, do escalate. That's the issue I'm having.

In other words, "Ricardo Nunez" always receives the alerts, but the “service-now” contact does receive the adinfo alerts but NOT the fs_/usr alerts.

Attaching a another ZIP file which has:

/usr/local/nagios/etc/serviceescalations.cfg


Ricardo