fs_/usr service escalation issue
fs_/usr service escalation issue
Hi,
Our Nagios service escalation for fs_/usr service is not working. We believe it should be related to the “/” character.
mtl1log02 server belongs to “linux-servers” host group.
Parts of /etc/check_mk/main.mk:
all_hosts = [
(…)
"mtl1log02|linux|linux-servers",
(..)
]
(…)
host_groups = [
(…)
( 'linux-servers', [ 'linux-servers' ], ALL_HOSTS ),
(…)
]
(…)
I defined a service escalation for this host group (file /usr/local/nagios/etc/serviceescalations.cfg):
define serviceescalation {
hostgroup_name linux-servers
service_description adinfo,CPU load,fs_/,fs_/boot,fs_/tmp,fs_/usr,fs_/usr/export/home,fs_/var,Memory used,ypwhich
contacts service-now
first_notification 1
last_notification 1
notification_interval 1440
escalation_period 24x7
escalation_options c,
}
These services were added by hand as using the Nagios Core in order to define the escalation.
/usr/local/nagios/etc/services/adinfo.cfg
define service {
service_description adinfo
display_name adinfo
register 1
}
/usr/local/nagios/etc/services/fs_usr.cfg
define service {
service_description fs_/usr
register 1
}
The critical adinfo problem was escalated, but not the fs_/usr issue. We believe it should be related to the “/” character.
Ricardo
From: Nagios XI MTL [mailto:nagios-admin@pmc-sierra.com]
Sent: Friday, February 01, 2013 3:52 PM
To: Ricardo Nunez
Subject: PROBLEM Service Alert - mtl1log02/adinfo is CRITICAL
***** Nagios XI Alert *****
** This is an escalated notification ** Nagios has detected a problem with this service.
Notification Type: PROBLEM
Service: adinfo
Host: mtl1log02
Address: 134.87.177.115
State: CRITICAL
Info:
Critical. Connected to nombre
Date/Time: 2013-02-01 20:52:22
Location: Montreal
Respond: http://nagiosxi-mtl.pmc-sierra.bc.ca/na ... 668af3fb09
Nagios URL: http://nagiosxi-mtl.pmc-sierra.bc.ca/nagiosxi/
From: Nagios XI MTL [mailto:nagios-admin@pmc-sierra.com]
Sent: Friday, February 01, 2013 3:34 PM
To: Ricardo Nunez
Subject: PROBLEM Service Alert - mtl1log02/fs_/usr is CRITICAL
***** Nagios XI Alert *****
Nagios has detected a problem with this service.
Notification Type: PROBLEM
Service: fs_/usr
Host: mtl1log02
Address: 134.87.177.115
State: CRITICAL
Info:
CRIT - 97.0% used (8.25 of 8.5 GB), (levels at 80.0/90.0%), trend: 0.00B / 24 hours
Date/Time: 2013-02-01 20:34:12
Location: Montreal
Respond: http://nagiosxi-mtl.pmc-sierra.bc.ca/na ... 668af3fb09
Nagios URL: http://nagiosxi-mtl.pmc-sierra.bc.ca/nagiosxi/
Our Nagios service escalation for fs_/usr service is not working. We believe it should be related to the “/” character.
mtl1log02 server belongs to “linux-servers” host group.
Parts of /etc/check_mk/main.mk:
all_hosts = [
(…)
"mtl1log02|linux|linux-servers",
(..)
]
(…)
host_groups = [
(…)
( 'linux-servers', [ 'linux-servers' ], ALL_HOSTS ),
(…)
]
(…)
I defined a service escalation for this host group (file /usr/local/nagios/etc/serviceescalations.cfg):
define serviceescalation {
hostgroup_name linux-servers
service_description adinfo,CPU load,fs_/,fs_/boot,fs_/tmp,fs_/usr,fs_/usr/export/home,fs_/var,Memory used,ypwhich
contacts service-now
first_notification 1
last_notification 1
notification_interval 1440
escalation_period 24x7
escalation_options c,
}
These services were added by hand as using the Nagios Core in order to define the escalation.
/usr/local/nagios/etc/services/adinfo.cfg
define service {
service_description adinfo
display_name adinfo
register 1
}
/usr/local/nagios/etc/services/fs_usr.cfg
define service {
service_description fs_/usr
register 1
}
The critical adinfo problem was escalated, but not the fs_/usr issue. We believe it should be related to the “/” character.
Ricardo
From: Nagios XI MTL [mailto:nagios-admin@pmc-sierra.com]
Sent: Friday, February 01, 2013 3:52 PM
To: Ricardo Nunez
Subject: PROBLEM Service Alert - mtl1log02/adinfo is CRITICAL
***** Nagios XI Alert *****
** This is an escalated notification ** Nagios has detected a problem with this service.
Notification Type: PROBLEM
Service: adinfo
Host: mtl1log02
Address: 134.87.177.115
State: CRITICAL
Info:
Critical. Connected to nombre
Date/Time: 2013-02-01 20:52:22
Location: Montreal
Respond: http://nagiosxi-mtl.pmc-sierra.bc.ca/na ... 668af3fb09
Nagios URL: http://nagiosxi-mtl.pmc-sierra.bc.ca/nagiosxi/
From: Nagios XI MTL [mailto:nagios-admin@pmc-sierra.com]
Sent: Friday, February 01, 2013 3:34 PM
To: Ricardo Nunez
Subject: PROBLEM Service Alert - mtl1log02/fs_/usr is CRITICAL
***** Nagios XI Alert *****
Nagios has detected a problem with this service.
Notification Type: PROBLEM
Service: fs_/usr
Host: mtl1log02
Address: 134.87.177.115
State: CRITICAL
Info:
CRIT - 97.0% used (8.25 of 8.5 GB), (levels at 80.0/90.0%), trend: 0.00B / 24 hours
Date/Time: 2013-02-01 20:34:12
Location: Montreal
Respond: http://nagiosxi-mtl.pmc-sierra.bc.ca/na ... 668af3fb09
Nagios URL: http://nagiosxi-mtl.pmc-sierra.bc.ca/nagiosxi/
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: fs_/usr service escalation issue
these don't appear to be complete definitions. In addition you should not edit the files directly in /usr/local/nagios/etc/services it is a bad idea. If you want to manually edit configuration items they should be placed in /usr/local/nagios/etc/static otherwise, all items should be managed through Configure -> CCMnunezric wrote:These services were added by hand as using the Nagios Core in order to define the escalation.
/usr/local/nagios/etc/services/adinfo.cfg
define service {
service_description adinfo
display_name adinfo
register 1
}
/usr/local/nagios/etc/services/fs_usr.cfg
define service {
service_description fs_/usr
register 1
}
Re: fs_/usr service escalation issue
Yes, I used Nagios CCM to define those services. I was just avoiding to copy/paste too much garbage. I'll do that now:
In /usr/local/nagios/etc/services we have chosen these two files for this issue:
adinfo.cfg:
###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2013-02-01 23:38:56
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################
define service {
service_description adinfo
display_name adinfo
register 1
}
###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################
fs_usr.cfg:
###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2013-02-01 23:38:56
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################
define service {
service_description fs_/usr
register 1
}
###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################
I do believe the issue is related to the character "/"
More information:
Nagios XI 2012R1.4 after at least two updates.
Linux Distribution and version?
rpm -qa redhat-release
redhat-release-5Server-5.8.0.3
32 or 64bit?
uname -a
Linux mtlems02.pmc-sierra.bc.ca 2.6.18-308.11.1.el5 #1 SMP Fri Jun 15 15:41:53 EDT 2012 x86_64 x86_64 x86_64 GNU/Linux
VMware Image or Manual Install of XI?
Manual install of XI
Are there specials configurations on your system, ie; is Gnome installed? Are you using a proxy? Are you using SSL? No, No, No
In /usr/local/nagios/etc/services we have chosen these two files for this issue:
adinfo.cfg:
###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2013-02-01 23:38:56
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################
define service {
service_description adinfo
display_name adinfo
register 1
}
###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################
fs_usr.cfg:
###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2013-02-01 23:38:56
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################
define service {
service_description fs_/usr
register 1
}
###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################
I do believe the issue is related to the character "/"
More information:
Nagios XI 2012R1.4 after at least two updates.
Linux Distribution and version?
rpm -qa redhat-release
redhat-release-5Server-5.8.0.3
32 or 64bit?
uname -a
Linux mtlems02.pmc-sierra.bc.ca 2.6.18-308.11.1.el5 #1 SMP Fri Jun 15 15:41:53 EDT 2012 x86_64 x86_64 x86_64 GNU/Linux
VMware Image or Manual Install of XI?
Manual install of XI
Are there specials configurations on your system, ie; is Gnome installed? Are you using a proxy? Are you using SSL? No, No, No
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: fs_/usr service escalation issue
Neither of the services you defined have hosts or hostgroups associated with them. They also are missing check commands as well as other settings..
Every service must have a host or will not perform any checks..
Am I missing something in your configuration?
Every service must have a host or will not perform any checks..
Am I missing something in your configuration?
Re: fs_/usr service escalation issue
You were right.
I've attached a zip file which has the actual /usr/local/nagios/etc/static/check_mk_objects.cfg file.
You will see that a lot of devices have these services.
Ricardo
I've attached a zip file which has the actual /usr/local/nagios/etc/static/check_mk_objects.cfg file.
You will see that a lot of devices have these services.
Ricardo
You do not have the required permissions to view the files attached to this post.
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: fs_/usr service escalation issue
I didn't realize you had items in the static directory for these services....
Let me re-look at the thread...
Let me re-look at the thread...
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: fs_/usr service escalation issue
Can you verify that EVERY host in linux-servers hostgroup has ALL of these services
Code: Select all
adinfo,CPU load,fs_/,fs_/boot,fs_/tmp,fs_/usr,fs_/usr/export/home,fs_/var,Memory used,ypwhich
Re: fs_/usr service escalation issue
Yes, every host which belongs to linux-servers has all of those services.
If not, Nagios CCM won't let me "Apply Configuration" when I was adding the service escalation.
Ricardo
If not, Nagios CCM won't let me "Apply Configuration" when I was adding the service escalation.
Ricardo
-
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: fs_/usr service escalation issue
Is Ricardo Nunez part of the admins contact group which is added as the contacts for these checks?
Re: fs_/usr service escalation issue
Attaching a ZIP file which has:
/usr/local/nagios/etc/contactgroups.cfg
/usr/local/nagios/etc/contacts.cfg
By the way, "Ricardo Nunez" does receive the e-mails regarding fs_/usr service, the problem is that is doesn't escalate. The alerts related to "adinfo", for example, do escalate. That's the issue I'm having.
In other words, "Ricardo Nunez" always receives the alerts, but the “service-now” contact does receive the adinfo alerts but NOT the fs_/usr alerts.
Attaching a another ZIP file which has:
/usr/local/nagios/etc/serviceescalations.cfg
Ricardo
/usr/local/nagios/etc/contactgroups.cfg
/usr/local/nagios/etc/contacts.cfg
By the way, "Ricardo Nunez" does receive the e-mails regarding fs_/usr service, the problem is that is doesn't escalate. The alerts related to "adinfo", for example, do escalate. That's the issue I'm having.
In other words, "Ricardo Nunez" always receives the alerts, but the “service-now” contact does receive the adinfo alerts but NOT the fs_/usr alerts.
Attaching a another ZIP file which has:
/usr/local/nagios/etc/serviceescalations.cfg
Ricardo
You do not have the required permissions to view the files attached to this post.