I have a most perplexing couple of issues:
-----how to verify that a host is really up
1st Im using Host checks (setup in previous email)
2nd after an erroneous system up the system is marked up
Since I can set the number and interval etc to determine if a system is
down I guess I would assume that the
same criteria would be used to see if a system is up (ie in this case I
would expect
1) I probably missed it but I can find no way of counting say 3
successful pings in a row to indicate the system is up
2) can a event handler be run prior to having Nagios send a
notification (to do futher testing)
what I would really like to do to verify that a system is up is:
1) get 3 OK pings
2) then run an ssh to the system that just does (say) an uptime and
then if that works declare system up
but if the ssh fails stay in a loop reissuing till the correct
response is received
AND
Can I get there using HOST checks - seemed like it was going to be
simplier when I switched a while back but
now Im not sure I can get them to work correctly.
NOTE: the No output! creating the host up?
date
mmdd hhmm ss
0203 0332 43 [1138966363] HOST ALERT: system14;DOWN;SOFT;1;CRITICAL -
Plugin timed out after 20 seconds
0203 0333 03 [1138966383] HOST ALERT: system14;DOWN;SOFT;2;CRITICAL -
Plugin timed out after 20 seconds
0203 0333 23 [1138966403] HOST ALERT: system14;DOWN;HARD;3;CRITICAL -
Plugin timed out after 20 seconds
0203 0333 23 [1138966403] HOST NOTIFICATION:
mtools;system14;DOWN;host-notify-by-mknotify;CRITICAL - Plugin timed
out after 20 seconds
0203 0426 05 [1138969565] HOST ALERT: system14;UP;HARD;1;(No output!)
0203 0426 05 [1138969565] HOST NOTIFICATION:
mtools;system14;UP;host-notify-by-mknotify;(No output!)
DEFINITELY open to suggestions
thanks
-Lee
This post was automatically imported from historical nagios-devel mailing list archives
Original poster: [email protected]