Nagios Event Handler is not triggering

Support forum for Nagios Core, Nagios Plugins, NCPA, NRPE, NSCA, NDOUtils and more. Engage with the community of users including those using the open source solutions.
Locked
dreamboy_sivak
Posts: 18
Joined: Tue Mar 25, 2014 2:13 am

Nagios Event Handler is not triggering

Post by dreamboy_sivak »

Nagios Event Handler is not triggering when the service is taking more time to response or down.

My configuration in below

nagios.cfg

enable_event_handlers=1

localhost.cfg

define service {

use generic-service
host_name Server
service_description test-server
servicegroups test-service
check_command check-service
is_volatile 0
check_period 24x7
max_check_attempts 4
normal_check_interval 2
retry_check_interval 2
contact_groups testcontacts
notification_period 24x7
notification_options w,u,c,r
notifications_enabled 1
event_handler_enabled 1
event_handler recheck-service
}

command.cfg

define command{
command_name recheck-service
command_line /usr/local/nagios/libexec/alert.sh $SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$
}

alert.sh file

!/bin/sh
set -x

case "$1" in

OK)

# The service just came back up, so don't do anything...

;;

WARNING)

# We don't really care about warning states, since the service is probably still running...

;;
UNKNOWN)
# We don't know what might be causing an unknown error, so don't do anything...
;;
CRITICAL)
Aha! The HTTP service appears to have a problem - perhaps we should restart the server...

Is this a "soft" or a "hard" state?
case "$2" in

We're in a "soft" state, meaning that Nagios is in the middle of retrying the
check before it turns into a "hard" state and contacts get notified...
SOFT)

# What check attempt are we on? We don't want to restart the web server on the first
check, because it may just be a fluke!
case "$3" in

Wait until the check has been tried 3 times before restarting the web server.
If the check fails on the 4th time (after we restart the web server), the state
type will turn to "hard" and contacts will be notified of the problem.
Hopefully this will restart the web server successfully, so the 4th check will
result in a "soft" recovery. If that happens no one gets notified because we
fixed the problem!
3)
echo -n "Going To Ping the Virtual Machine (3rd soft critical state)..."
# Call the init script to restart the HTTPD server
myresult=`/usr/local/nagios/libexec/check_http xyz.com -t 100 | grep 'time'| awk '{print $10}'`
echo "Your Service Is taking the following time Delay" "$myresult Seconds" |mail -s "WARNING : Service Taken More Time To Response" xyz@gmail.com
;;
esac
;;

# The HTTP service somehow managed to turn into a hard error without getting fixed.
# It should have been restarted by the code above, but for some reason it didn't.
# Let's give it one last try, shall we?
# Note: Contacts have already been notified of a problem with the service
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: Nagios Event Handler is not triggering

Post by scottwilkerson »

At which stage in alert.sh is it not functioning?

I'm assuming what you have here is pseudo-code and you do not have the un-commented comments actually in the file, correct?
Former Nagios employee
Creator:
ahumandesign.com
enneagrams.com
dreamboy_sivak
Posts: 18
Joined: Tue Mar 25, 2014 2:13 am

Re: Nagios Event Handler is not triggering

Post by dreamboy_sivak »

Thanks scott for your reply.

Fixed this issue. Forget to compile the nagios and found file accessing permission issue to nagios user.

so corrected this all mistakes now event works fine
User avatar
lmiltchev
Former Nagios Staff
Posts: 13587
Joined: Mon May 23, 2011 12:15 pm

Re: Nagios Event Handler is not triggering

Post by lmiltchev »

I am glad your issue has been resolved! I am locking this topic.
Be sure to check out our Knowledgebase for helpful articles and solutions!
Locked