CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.

nyakamaw · Post by **nyakamaw** » Fri Jul 30, 2021 3:23 am

nsclient.ini

Hi All

Can someone assist me.
After Migating my Nagios XI fro CentOS6.5 to new Server which is running CentOS8.0.

My NRPE commincation to host is giving me an error although nagios.service is running fine:
● nagios.service - Nagios
Loaded: loaded (/etc/systemd/system/nagios.service; enabled; vendor preset: disabled)
Active: active (running) since Fri 2021-07-30 08:55:05 SAST; 55min ago
Main PID: 989 (nagios)
Tasks: 14 (limit: 100688)
Memory: 69.7M
CGroup: /system.slice/nagios.service
├─ 989 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
├─ 999 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1000 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1001 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1002 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1003 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1004 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1039 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
├─59149 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.72 -2 -t 30 -c check_cpu -a time=5m time=1m warn=load gt 80 crit=load gt 90
├─59154 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.72 -2 -t 30 -c check_drivesize -a drive=C: drive=D: warn=used_pct gt 80 crit=used_pct gt 90
├─59200 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -2 -t 30 -c check_cpu -a time=5m time=1m warn=load gt 80 crit=load gt 90
├─59244 /usr/bin/snmpget -Le -t 3 -r 5 -m -v 1 -c 10.207.128.1:161 1.3.6.1.2.1.33.1.4.1.0
├─59300 /usr/local/nagios/libexec/check_snmp -H 10.207.128.1 -o 1.3.6.1.2.1.33.1.2.2.0 -C public -P 1 -l Seconds on Battery power
└─59301 /usr/bin/snmpget -Le -t 3 -r 5 -m -v 1 -c 10.207.128.1:161 1.3.6.1.2.1.33.1.2.2.0

Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: host=ELTM09ENGMS06.tiremes.contiwan.com; service=Memory Usage; contact=cooperg
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: early_timeout=0; exited_ok=1; wait_status=512; error_code=0;
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: stderr line 01: execvp(/usr/local/nagios/libexec/notify-service_email, ...) failed. errno is 2: No such file or directory
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: early_timeout=0; exited_ok=1; wait_status=512; error_code=0;
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: stderr line 01: execvp(/usr/local/nagios/libexec/notify-service_email, ...) failed. errno is 2: No such file or directory
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1002]: job 2191 (pid=59421): read() returned error 11
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1002]: job 2191 (pid=59421): read() returned error 11
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1004]: job 2192 (pid=59428): read() returned error 11
Jul 30 09:50:59 nagios3cooper.tiremes.contiwan.com nagios[989]: HOST ALERT: ELTMVM940.tiremes.contiwan.com;UP;HARD;1;OK - 10.10.43.70: rta 0.084ms, lost 0%
Jul 30 09:50:59 nagios3cooper.tiremes.contiwan.com check_nrpe[59462]: Error: (!log_opts) Could not complete SSL handshake with 10.10.15.108: dh key too small

But there seem to be a problem with NRPE, see below output.

[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 127.0.0.1 -c check_load
OK - load average per CPU: 0.07, 0.06, 0.03|load1=0.075;0.150;0.300;0; load5=0.055;0.100;0.250;0; load15=0.030;0.050;0.200;0;
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -c check_load
CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds.
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -2 -t 30 -c check_cpu -a time=5m time=1m "warn=load gt 80" "crit=load gt 90"
CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.

I do not know whether is it related to the SSL handshake error on the nagios.service.
I did change the NSClient++.ini file allowed host to the ip of my new server but still cannot connect to the host.

I have attached the NSClient++.ini file.

gsmith · Post by **gsmith** » Fri Jul 30, 2021 4:03 pm

HI

On the Nagios server run:

Code: Select all

nmap -p 5666 <ip of remote nrpe machine>

If that fails then check out the firewall settings on the remote nrpe machine

On the remote nrpe machine:

Code: Select all

vi /etc/nagios/nrpe.cfg

verify that the ip of the Nagios server is listed as a value for the allowed_hosts property

Please let me know how things work out.

Nagios Support Forum

CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.

CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.

Re: CHECK_NRPE STATE CRITICAL: Socket timeout after 30 secon