CRITICAL - Socket timeout after 10 seconds
Posted: Wed Aug 08, 2012 5:07 am
Hi everyone,
Need a little help.
Getting a CRITICAL - Socket timeout after 10 seconds error on all my services intermittently. Seems to happen every 10-12 hours or so.
Nagios core installed on CentOS 5.8 - using only passive checks on Windows servers and services.
Think the problem is probably related to NSCA on the CentOS machine as the fix for this is:
killall nsca
killall nsca
/usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg
Which kills 2 nsca processes then re-registers it (I think) I've discovered this through trial and error messing around trying to get it sorted
Not sure if it's related to the fact that nsca is running twice possibly?
My 2 questions are:
1). anyone have any ideas on a permanent fix for this?
2). Could I have a cronjob that just does killall nsca x 2 and then /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg every 10 minutes or so - this should keep things running as doing that command when its working doesn't seem have any affect on Nagios
I tried putting the following in my etc/crontab:
# kill duplicate NSCA pid's
*/10 * * * * killall nsca
*/10 * * * * killall nsca
*/10 * * * * /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg
Which I think should run those commands every 10 mins, but not sure if that will work?
Need a little help.
Getting a CRITICAL - Socket timeout after 10 seconds error on all my services intermittently. Seems to happen every 10-12 hours or so.
Nagios core installed on CentOS 5.8 - using only passive checks on Windows servers and services.
Think the problem is probably related to NSCA on the CentOS machine as the fix for this is:
killall nsca
killall nsca
/usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg
Which kills 2 nsca processes then re-registers it (I think) I've discovered this through trial and error messing around trying to get it sorted
Not sure if it's related to the fact that nsca is running twice possibly?
My 2 questions are:
1). anyone have any ideas on a permanent fix for this?
2). Could I have a cronjob that just does killall nsca x 2 and then /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg every 10 minutes or so - this should keep things running as doing that command when its working doesn't seem have any affect on Nagios
I tried putting the following in my etc/crontab:
# kill duplicate NSCA pid's
*/10 * * * * killall nsca
*/10 * * * * killall nsca
*/10 * * * * /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg
Which I think should run those commands every 10 mins, but not sure if that will work?