CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
Posted: Fri Jul 30, 2021 3:23 am
Hi All
Can someone assist me.
After Migating my Nagios XI fro CentOS6.5 to new Server which is running CentOS8.0.
My NRPE commincation to host is giving me an error although nagios.service is running fine:
● nagios.service - Nagios
Loaded: loaded (/etc/systemd/system/nagios.service; enabled; vendor preset: disabled)
Active: active (running) since Fri 2021-07-30 08:55:05 SAST; 55min ago
Main PID: 989 (nagios)
Tasks: 14 (limit: 100688)
Memory: 69.7M
CGroup: /system.slice/nagios.service
├─ 989 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
├─ 999 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1000 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1001 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1002 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1003 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1004 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1039 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
├─59149 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.72 -2 -t 30 -c check_cpu -a time=5m time=1m warn=load gt 80 crit=load gt 90
├─59154 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.72 -2 -t 30 -c check_drivesize -a drive=C: drive=D: warn=used_pct gt 80 crit=used_pct gt 90
├─59200 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -2 -t 30 -c check_cpu -a time=5m time=1m warn=load gt 80 crit=load gt 90
├─59244 /usr/bin/snmpget -Le -t 3 -r 5 -m -v 1 -c 10.207.128.1:161 1.3.6.1.2.1.33.1.4.1.0
├─59300 /usr/local/nagios/libexec/check_snmp -H 10.207.128.1 -o 1.3.6.1.2.1.33.1.2.2.0 -C public -P 1 -l Seconds on Battery power
└─59301 /usr/bin/snmpget -Le -t 3 -r 5 -m -v 1 -c 10.207.128.1:161 1.3.6.1.2.1.33.1.2.2.0
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: host=ELTM09ENGMS06.tiremes.contiwan.com; service=Memory Usage; contact=cooperg
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: early_timeout=0; exited_ok=1; wait_status=512; error_code=0;
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: stderr line 01: execvp(/usr/local/nagios/libexec/notify-service_email, ...) failed. errno is 2: No such file or directory
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: early_timeout=0; exited_ok=1; wait_status=512; error_code=0;
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: stderr line 01: execvp(/usr/local/nagios/libexec/notify-service_email, ...) failed. errno is 2: No such file or directory
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1002]: job 2191 (pid=59421): read() returned error 11
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1002]: job 2191 (pid=59421): read() returned error 11
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1004]: job 2192 (pid=59428): read() returned error 11
Jul 30 09:50:59 nagios3cooper.tiremes.contiwan.com nagios[989]: HOST ALERT: ELTMVM940.tiremes.contiwan.com;UP;HARD;1;OK - 10.10.43.70: rta 0.084ms, lost 0%
Jul 30 09:50:59 nagios3cooper.tiremes.contiwan.com check_nrpe[59462]: Error: (!log_opts) Could not complete SSL handshake with 10.10.15.108: dh key too small
But there seem to be a problem with NRPE, see below output.
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 127.0.0.1 -c check_load
OK - load average per CPU: 0.07, 0.06, 0.03|load1=0.075;0.150;0.300;0; load5=0.055;0.100;0.250;0; load15=0.030;0.050;0.200;0;
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -c check_load
CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds.
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -2 -t 30 -c check_cpu -a time=5m time=1m "warn=load gt 80" "crit=load gt 90"
CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
I do not know whether is it related to the SSL handshake error on the nagios.service.
I did change the NSClient++.ini file allowed host to the ip of my new server but still cannot connect to the host.
I have attached the NSClient++.ini file.
Can someone assist me.
After Migating my Nagios XI fro CentOS6.5 to new Server which is running CentOS8.0.
My NRPE commincation to host is giving me an error although nagios.service is running fine:
● nagios.service - Nagios
Loaded: loaded (/etc/systemd/system/nagios.service; enabled; vendor preset: disabled)
Active: active (running) since Fri 2021-07-30 08:55:05 SAST; 55min ago
Main PID: 989 (nagios)
Tasks: 14 (limit: 100688)
Memory: 69.7M
CGroup: /system.slice/nagios.service
├─ 989 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
├─ 999 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1000 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1001 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1002 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1003 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1004 /usr/local/nagios/bin/nagios --worker /usr/local/nagios/var/rw/nagios.qh
├─ 1039 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg
├─59149 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.72 -2 -t 30 -c check_cpu -a time=5m time=1m warn=load gt 80 crit=load gt 90
├─59154 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.72 -2 -t 30 -c check_drivesize -a drive=C: drive=D: warn=used_pct gt 80 crit=used_pct gt 90
├─59200 /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -2 -t 30 -c check_cpu -a time=5m time=1m warn=load gt 80 crit=load gt 90
├─59244 /usr/bin/snmpget -Le -t 3 -r 5 -m -v 1 -c 10.207.128.1:161 1.3.6.1.2.1.33.1.4.1.0
├─59300 /usr/local/nagios/libexec/check_snmp -H 10.207.128.1 -o 1.3.6.1.2.1.33.1.2.2.0 -C public -P 1 -l Seconds on Battery power
└─59301 /usr/bin/snmpget -Le -t 3 -r 5 -m -v 1 -c 10.207.128.1:161 1.3.6.1.2.1.33.1.2.2.0
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: host=ELTM09ENGMS06.tiremes.contiwan.com; service=Memory Usage; contact=cooperg
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: early_timeout=0; exited_ok=1; wait_status=512; error_code=0;
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: stderr line 01: execvp(/usr/local/nagios/libexec/notify-service_email, ...) failed. errno is 2: No such file or directory
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: early_timeout=0; exited_ok=1; wait_status=512; error_code=0;
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[989]: wproc: stderr line 01: execvp(/usr/local/nagios/libexec/notify-service_email, ...) failed. errno is 2: No such file or directory
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1002]: job 2191 (pid=59421): read() returned error 11
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1002]: job 2191 (pid=59421): read() returned error 11
Jul 30 09:50:58 nagios3cooper.tiremes.contiwan.com nagios[1004]: job 2192 (pid=59428): read() returned error 11
Jul 30 09:50:59 nagios3cooper.tiremes.contiwan.com nagios[989]: HOST ALERT: ELTMVM940.tiremes.contiwan.com;UP;HARD;1;OK - 10.10.43.70: rta 0.084ms, lost 0%
Jul 30 09:50:59 nagios3cooper.tiremes.contiwan.com check_nrpe[59462]: Error: (!log_opts) Could not complete SSL handshake with 10.10.15.108: dh key too small
But there seem to be a problem with NRPE, see below output.
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 127.0.0.1 -c check_load
OK - load average per CPU: 0.07, 0.06, 0.03|load1=0.075;0.150;0.300;0; load5=0.055;0.100;0.250;0; load15=0.030;0.050;0.200;0;
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -c check_load
CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds.
[root@nagios3cooper nagios-plugins-release-2.2.1]# /usr/local/nagios/libexec/check_nrpe -H 10.10.21.15 -2 -t 30 -c check_cpu -a time=5m time=1m "warn=load gt 80" "crit=load gt 90"
CHECK_NRPE STATE CRITICAL: Socket timeout after 30 seconds.
I do not know whether is it related to the SSL handshake error on the nagios.service.
I did change the NSClient++.ini file allowed host to the ip of my new server but still cannot connect to the host.
I have attached the NSClient++.ini file.