Page 1 of 1

Socket timeout after 10 seconds

Posted: Mon Jan 14, 2013 9:26 am
by lafargeuser
Now days we are getting no of servers with below alert,

***** Nagios XI Alert *****

Nagios has detected a problem with this service.

Notification Type: PROBLEM

Service: Memory Usage
Host: wfsvhtx003
Address: 10.10.26.89
State: CRITICAL
Info:
CRITICAL - Socket timeout after 10 seconds
Date/Time: 2013-01-14 12:45:08


Is this because of time out value in nagios.cfg is not suffient to execute that particular command ?
service_check_timeout=60
host_check_timeout=30
event_handler_timeout=30
notification_timeout=30
ocsp_timeout=5
perfdata_timeout=5

Re: Socket timeout after 10 seconds

Posted: Mon Jan 14, 2013 10:36 am
by slansing
This is most likely due to nrpe being unable to contact an agent to retrieve check information if you are indeed using NRPE for this check, can you show us your nrpe.cfg file on both the Nagios XI server, and your remote machine?

Re: Socket timeout after 10 seconds

Posted: Tue Jan 15, 2013 6:56 am
by lafargeuser
Attaching..........

Re: Socket timeout after 10 seconds

Posted: Tue Jan 15, 2013 10:52 am
by slansing
In both NRPE.CFG files you must have the other server's IP placed in the allowed_hosts section... This allows them to communicate:

Code: Select all

# ALLOWED HOST ADDRESSES
# This is an optional comma-delimited list of IP address or hostnames 
# that are allowed to talk to the NRPE daemon.
After you make this change restart the daemon, or re init the file if using the inet.d/xinet.d method, and also make sure that port 5666 tcp/udp is open on both systems.