how can solve Socket timeout problem
how can solve Socket timeout problem
dear all
Hi
i have a windows server but sometimes 1 or 2 sensors show this message in nagios server (CRITICAL - Socket timeout after 10 seconds)
but other services are ok and this message usaually appear every 10min and after that is ok
how can solve this problem ?
please see attach pic in first time pic1 show and after few minutes pic2 appear and all is ok but after few minutes again appear pic1
this problem just appear for 2 or 3 servers of 110 server
what is problem?
Hi
i have a windows server but sometimes 1 or 2 sensors show this message in nagios server (CRITICAL - Socket timeout after 10 seconds)
but other services are ok and this message usaually appear every 10min and after that is ok
how can solve this problem ?
please see attach pic in first time pic1 show and after few minutes pic2 appear and all is ok but after few minutes again appear pic1
this problem just appear for 2 or 3 servers of 110 server
what is problem?
Re: how can solve Socket timeout problem
If this is using check_nt, you could try altering your Service/Command definitions to have a higher timeout threshold. Something like -t 30 might be sufficient.
If you're not sure how to go about that, can you show us the associated Service and Command definitions that are having problems?
If you're not sure how to go about that, can you show us the associated Service and Command definitions that are having problems?
Former Nagios employee
https://www.mcapra.com/
https://www.mcapra.com/
Re: how can solve Socket timeout problem
Thanksmcapra wrote:If this is using check_nt, you could try altering your Service/Command definitions to have a higher timeout threshold. Something like -t 30 might be sufficient.
If you're not sure how to go about that, can you show us the associated Service and Command definitions that are having problems?
I used -t 240 but for that sensor show this message
connect to address x.x.x.x and port 12489: Connection timed out
Re: how can solve Socket timeout problem
Can you attach the nsclient.log file from the client machine, and also show us your entire service definition?
Former Nagios Employee
Re: how can solve Socket timeout problem
atach critical error appearrkennedy wrote:Can you attach the nsclient.log file from the client machine, and also show us your entire service definition?
and i have attached nsclient.log and server.cfg file
Code: Select all
define host{
use windows-server ; Inherit default values from a template
host_name onlinecard_cdc1 ; The name we're giving to this host
alias server onlinecard_cdc1 ; A longer name associated with the host
address x.x.x.x ; IP address of the host
}
define service{
use local-service
host_name onlinecard_cdc1
service_description Memory Usage
check_command check_nt!MEMUSE!-w 90 -c 95
}
define service{
use local-service
host_name onlinecard_cdc1
service_description Service
check_command check_nt!SERVICESTATE!-d SHOWALL -l SNMP -t 240
}
define service{
use local-service
host_name onlinecard_cdc1
service_description Cpu Usage
check_command check_nrpe!alias_cpu
}
define service{
use local-service
host_name onlinecard_cdc1
service_description Disk Space
check_command check_nrpe!alias_volumes
}
define service{
use local-service
host_name onlinecard_cdc1
service_description Time
check_command check_nrpe!check_windows_time
}
define service{
use generic-service
host_name onlinecard_cdc1
service_description Uptime
check_command check_nt!UPTIME -t 180
}
define service{
use generic-service
host_name onlinecard_cdc1
service_description CPU Load
check_command check_nt!CPULOAD!-l 5,80,90
}
connect to address x.x.x.x and port 12489: Connection timed out
- Attachments
-
nsclient.log- (2.37 KiB) Downloaded 353 times
-
nsclient.ini- (15.48 KiB) Downloaded 322 times
Re: how can solve Socket timeout problem
It looks like some of your checks that aren't failing are using NRPE, but the failing ones are using check_nt. Is anything blocking traffic on port 12489 periodically?
Former Nagios Employee
Re: how can solve Socket timeout problem
all ports on my servers are open my servers has full port access i am realy confusedrkennedy wrote:It looks like some of your checks that aren't failing are using NRPE, but the failing ones are using check_nt. Is anything blocking traffic on port 12489 periodically?
Re: how can solve Socket timeout problem
From the command line of Nagios, please run the following commands and post back the full output (install nmap if you do not have it already) -
(replace x.x.x.x with the IP of the Windows machine we're troubleshooting here.)
Code: Select all
nmap x.x.x.x -p 12489
nmap x.x.x.x
Former Nagios Employee
Re: how can solve Socket timeout problem
rkennedy wrote:From the command line of Nagios, please run the following commands and post back the full output (install nmap if you do not have it already) -(replace x.x.x.x with the IP of the Windows machine we're troubleshooting here.)Code: Select all
nmap x.x.x.x -p 12489 nmap x.x.x.x
Code: Select all
nmap 10.4.1.144 -p 12489
Starting Nmap 5.21 ( http://nmap.org ) at 2016-08-18 00:08 IRDT
mass_dns: warning: Unable to determine any DNS servers. Reverse DNS is disabled. Try using --system-dns or specify valid servers with --dns-servers
Nmap scan report for 10.4.1.144
Host is up (0.00085s latency).
PORT STATE SERVICE
12489/tcp filtered unknown
Nmap done: 1 IP address (1 host up) scanned in 0.35 seconds
Code: Select all
nmap 10.0.4.123 -p 12489
Starting Nmap 5.21 ( http://nmap.org ) at 2016-08-18 00:11 IRDT
mass_dns: warning: Unable to determine any DNS servers. Reverse DNS is disabled. Try using --system-dns or specify valid servers with --dns-servers
Nmap scan report for 10.0.4.123
Host is up (0.00082s latency).
PORT STATE SERVICE
12489/tcp open unknown
MAC Address: 00:0C:29:4A:E1:B0 (VMware)
Nmap done: 1 IP address (1 host up) scanned in 0.17 seconds
Code: Select all
nmap 10.0.4.119 -p 12489
Starting Nmap 5.21 ( http://nmap.org ) at 2016-08-18 00:13 IRDT
mass_dns: warning: Unable to determine any DNS servers. Reverse DNS is disabled. Try using --system-dns or specify valid servers with --dns-servers
Nmap scan report for 10.0.4.119
Host is up (0.0015s latency).
PORT STATE SERVICE
12489/tcp open unknown
MAC Address: 00:50:56:86:28:0D (VMware)
Nmap done: 1 IP address (1 host up) scanned in 0.17 seconds
Re: how can solve Socket timeout problem
For the record, which machine are we troubleshooting? It looks like you posted 3 separate nmap's.
Former Nagios Employee