CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
Locked
nyakamaw
Posts: 1
Joined: Sun Apr 05, 2020 6:19 am

CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.

Post by nyakamaw »

nsclient.ini
Hi All

Can someone assist me.
After Migating my Nagios XI fro CentOS6.5 to new Server which is running CentOS8.0.

My NRPE commincation to host is giving me an error although nagios.service is running fine:
nagios.service - Nagios
Loaded: loaded (/etc/systemd/system/nagios.service; enabled; vendor preset: disabled)
Active: active (running) since Fri 2021-07-30 08:55:05 SAST; 55min ago
Main PID: 989 (nagios)
Tasks: 14 (limit: 100688)
Memory: 69.7M
CGroup: /system.slice/nagios.service
├─ 989 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
├─ 999 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1000 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1001 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1002 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1003 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1004 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1039 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
├─59149 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.72 -2 -t 30 -c check_cpu -a time=5m time=1m warn=load gt 80 crit=load gt 90
├─59154 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.72 -2 -t 30 -c check_drivesize -a drive=C: drive=D: warn=used_pct gt 80 crit=used_pct gt 90
├─59200 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -2 -t 30 -c check_cpu -a time=5m time=1m warn=load gt 80 crit=load gt 90
├─59244 /usr/bin/snmpget -Le -t 3 -r 5 -m -v 1 -c 10.207.128.1:161 1.3.6.1.2.1.33.1.4.1.0
├─59300 /usr/local/nagios/libexec/check_snmp -H 10.207.128.1 -o 1.3.6.1.2.1.33.1.2.2.0 -C public -P 1 -l Seconds on Battery power
└─59301 /usr/bin/snmpget -Le -t 3 -r 5 -m -v 1 -c 10.207.128.1:161 1.3.6.1.2.1.33.1.2.2.0

Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: host=ELTM09ENGMS06.tiremes.contiwan.com; service=Memory Usage; contact=cooperg
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: early_timeout=0; exited_ok=1; wait_status=512; error_code=0;
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: stderr line 01: execvp(/usr/local/nagios/libexec/notify-service_email, ...) failed. errno is 2: No such file or directory
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: early_timeout=0; exited_ok=1; wait_status=512; error_code=0;
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: stderr line 01: execvp(/usr/local/nagios/libexec/notify-service_email, ...) failed. errno is 2: No such file or directory
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1002]: job 2191 (pid=59421): read() returned error 11
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1002]: job 2191 (pid=59421): read() returned error 11
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1004]: job 2192 (pid=59428): read() returned error 11
Jul 30 09:50:59 nagios3cooper.tiremes.contiwan.com nagios[989]: HOST ALERT: ELTMVM940.tiremes.contiwan.com;UP;HARD;1;OK - 10.10.43.70: rta 0.084ms, lost 0%
Jul 30 09:50:59 nagios3cooper.tiremes.contiwan.com check_nrpe[59462]: Error: (!log_opts) Could not complete SSL handshake with 10.10.15.108: dh key too small


But there seem to be a problem with NRPE, see below output.

[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 127.0.0.1 -c check_load
OK - load average per CPU: 0.07, 0.06, 0.03|load1=0.075;0.150;0.300;0; load5=0.055;0.100;0.250;0; load15=0.030;0.050;0.200;0;
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -c check_load
CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds.
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -2 -t 30 -c check_cpu -a time=5m time=1m "warn=load gt 80" "crit=load gt 90"
CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.


I do not know whether is it related to the SSL handshake error on the nagios.service.
I did change the NSClient++.ini file allowed host to the ip of my new server but still cannot connect to the host.

I have attached the NSClient++.ini file.
You do not have the required permissions to view the files attached to this post.
gsmith
Posts: 1253
Joined: Tue Mar 02, 2021 11:15 am

Re: CHECK_NRPE STATE CRITICAL: Socket timeout after 30 secon

Post by gsmith »

HI

On the Nagios server run:

Code: Select all

nmap -p 5666 <ip of remote nrpe machine>
If that fails then check out the firewall settings on the remote nrpe machine

On the remote nrpe machine:

Code: Select all

vi /etc/nagios/nrpe.cfg
verify that the ip of the Nagios server is listed as a value for the allowed_hosts property

Please let me know how things work out.
Locked