Hey All,
I'm using Nagios v3.3.1 linked to 15 Linux clients.
I'm having an issue while one of the client servers, receives a warning regarding 'Current Load' service - I get emails from ALL clients as they were part of the problem also. which means, instead of getting 1 email from Client1 ONLY, I get 15 emails.
Would appreciate your help.
Regards,
Nir.
'Current load' flood
Re: 'Current load' flood
I'm not quite sure what you mean by "linked," would you mind elaborating a little bit? Do you just mean that you are running checks against them? Are any of them dependent on the others?
Re: 'Current load' flood
Thanks for your reply, agriffin.agriffin wrote:I'm not quite sure what you mean by "linked," would you mind elaborating a little bit? Do you just mean that you are running checks against them? Are any of them dependent on the others?
Yes, My meaning was that I just run checks on various clients. none of them are depended on each other but yet, whenever one of them happens to get 'Current Load' warning - I get emails from all clients. as they were part of the problem as well.
Re: 'Current load' flood
I'm not sure what could cause this. Would you mind posting your service definitions?
Re: 'Current load' flood
Sure.agriffin wrote:I'm not sure what could cause this. Would you mind posting your service definitions?
This is Nagios's server .cfg file - localhost.cfg
Code: Select all
###############################################################################
# LOCALHOST.CFG - SAMPLE OBJECT CONFIG FILE FOR MONITORING THIS MACHINE
#
# Last Modified: 05-31-2007
#
# NOTE: This config file is intended to serve as an *extremely* simple
# example of how you can create configuration entries to monitor
# the local (Linux) machine.
#
###############################################################################
###############################################################################
###############################################################################
#
# HOST DEFINITION
#
###############################################################################
###############################################################################
# Define a host for the local machine
define host{
use linux-server ; Name of host template to use
; This host definition will inherit all variables that are defined
; in (or inherited by) the linux-server host template definition.
host_name n-nis02
alias n-nis02
address 127.0.0.1
}
###############################################################################
###############################################################################
#
# HOST GROUP DEFINITION
#
###############################################################################
###############################################################################
# Define an optional hostgroup for Linux machines
define hostgroup{
hostgroup_name linux-servers ; The name of the hostgroup
alias Linux Servers ; Long name of the group
members n-nis02,lx101,lx102,lx103,lx104,lx105,lx106,n-nis01 ; Comma separated list of hosts that belong to this group
}
###############################################################################
###############################################################################
#
# SERVICE DEFINITIONS
#
###############################################################################
###############################################################################
# Define a service to "ping" the local machine
define service{
use local-service ; Name of service template to use
host_name n-nis02
service_description PING
check_command check_ping!100.0,20%!500.0,60%
}
# Define a service to check the disk space of the root partition
# on the local machine. Warning if < 20% free, critical if
# < 10% free space on partition.
define service{
use local-service ; Name of service template to use
host_name n-nis02
service_description Root Partition
check_command check_local_disk!10%!5%!/
}
# Define a service to check the disk space of a mounted partition
# on the local machine. Warning if < 10% free, critical if
# < 5% free space on partition.
define service{
use local-service ; Name of service template to use
host_name n-nis02
service_description /DATA Disk Space
check_command check_nrpe!check_disk!3%!5%!/
}
# Define a service to check the number of currently logged in
# users on the local machine. Warning if > 20 users, critical
# if > 50 users.
define service{
use local-service ; Name of service template to use
host_name n-nis02
service_description Current Users
check_command check_local_users!20!50
}
# Define a service to check the number of currently running procs
# on the local machine. Warning if > 250 processes, critical if
# > 400 users.
define service{
use local-service ; Name of service template to use
host_name n-nis02
service_description Total Processes
check_command check_local_procs!250!400!RSZDT
}
# Define a service to check the load on the local machine.
define service{
use local-service ; Name of service template to use
host_name n-nis02
service_description Current Load
check_command check_local_load!5.0,4.0,3.0!10.0,6.0,4.0
}
# Define a service to check the swap usage the local machine.
# Critical if less than 10% of swap is free, warning if less than 20% is free
define service{
use local-service ; Name of service template to use
host_name n-nis02
service_description Swap Usage
check_command check_local_swap!20!10
}
# Define a service to check SSH on the local machine.
# Disable notifications for this service by default, as not all users may have SSH enabled.
define service{
use local-service ; Name of service template to use
host_name n-nis02
service_description SSH
check_command check_ssh
notifications_enabled 0
}
# Define a service to check HTTP on the local machine.
# Disable notifications for this service by default, as not all users may have HTTP enabled.
#define service{
# use local-service ; Name of service template to use
# host_name n-nis02
# service_description HTTP
# check_command check_http
# notifications_enabled 0
# }