Been a while since I've been on Nagios, have had some major projects taking up too much time =(
Setup has been running fine for quite a while now, but yesterday we needed to add http checks to 4 of our websites. This sounded simple enough, but for some reason the services are constantly flapping.
Below is the command definition (I've removed the -w and -c to keep it simple) and the service definitions:
Code: Select all
define command {
command_name check_http_hosts
command_line /usr/local/nagios/libexec/check_http -H $ARG1
}
define service {
use http-service
host_name localhost
service_description HTTP - www.globility.co.uk
check_command check_http_hosts!www.globility.co.uk
}
.... (x3 more)
Code: Select all
nagios@nagios:/usr/local/nagios/libexec# ./check_http -H www.globility.co.uk
HTTP OK: HTTP/1.0 200 OK - 10020 bytes in 0.034 second response time |time=0.033887s;;;0.000000 size=10020B;;;0
In the past we had one http check running non stop which never had trouble. We removed that one and added these 4 new ones and are seeing this behaviour on them all. We are unable to add the old one back to check as that domain is no longer in service.Name or service not known
HTTP CRITICAL - Unable to open TCP socket
Thanks for any help and I'm to provide more information if needed.
Kind Regards,
Gary Shergill