For one server nrpe-2.14 works, nrpe-3.0 does not
Posted: Thu Aug 25, 2016 2:16 am
Hi.
I am using nagios-4.2.0 as the Nagios Core Server, it runs CentOS6.X - all latest patches
Most servers I can monitor but one, all monitored servers run the same CentOS6.X (all patches) using nrpe 3.0.
If I downgrade that one server to nrpe 2.14 (with ssl), it works, happily.
If I upgrade that one server to nrpe 3.0 (with ssl), it does NOT work.
The Nagios server sits on a local network running NRPE 3.0 behind a firewall.
The Nagios Server happily talks to all other servers on various networks on port 5666 - I know the port is open on ALL machines for access from the Nagios Core Server.
I, too, know the port 5666 is open on the machine that has problem using nrpe 3.0 - I can telnet to it on port 5666, but it than bails out when using 3.0 but does not bail out when using 2.4
It would not work at all if the port is NOT open.
If I downgrade that machine NRPE_CLIENT to nrpe 2.14 it happily works, but the server logs are full with
Aug 25 16:43:47 CORESERVER check_nrpe: Remote NRPE_CLIENT does not support Version 3 Packets
Aug 25 16:43:49 CORESERVER check_nrpe: Remote NRPE_CLIENT accepted a Version 2 Packet
and the logs of that NRPE_CLIENT report
Aug 25 16:45:05 NRPE_CLIENT nrpe[23013]: Error: Request packet type/version was invalid!
Aug 25 16:45:05 NRPE_CLIENT nrpe[23013]: Client request was invalid, bailing out...
I get when testing it:
[root@NAGIOSCORE /var/log] #>/usr/local/nagios/libexec/check_nrpe -H NRPE_CLIENT -c check_procs
PROCS OK : count 1706 |count=706;2000;2100 runqueue=1 blocked=0 running=1 new=0.00
Now if I upgrade the NRPE_CLIENT to nrpe 3.0, the logs are filled with
Aug 25 16:56:15 NAGIOSCORE check_nrpe: Error: Could not complete SSL handshake with NRPE_CLIENT: rc=0 SSL-error=5
and the logs of that NRPE_CLIENT report
Aug 25 16:55:17 NRPE_CLIENT nrpe[23815]: Host NAGIOSCORE is not allowed to talk to us!
What I see on the firewall is a "deny tcp (no connection) from NAGIOSCORE to NRPE_CLIENT flags on interface", which clearly means that the SSL handshake flow attempting to go through it does not seem to be following the correct TCP session flow (SYN, SYN ACK etc.) - It sees the second on but not the first one.
But my issue is that this does NOT happen when using nrpe-2.4
So what's makes this SSL thingo so different in 3.0 with respect to 2.4?
Does anybody have an idea how I can fix this?
thanks
Jobst
I am using nagios-4.2.0 as the Nagios Core Server, it runs CentOS6.X - all latest patches
Most servers I can monitor but one, all monitored servers run the same CentOS6.X (all patches) using nrpe 3.0.
If I downgrade that one server to nrpe 2.14 (with ssl), it works, happily.
If I upgrade that one server to nrpe 3.0 (with ssl), it does NOT work.
The Nagios server sits on a local network running NRPE 3.0 behind a firewall.
The Nagios Server happily talks to all other servers on various networks on port 5666 - I know the port is open on ALL machines for access from the Nagios Core Server.
I, too, know the port 5666 is open on the machine that has problem using nrpe 3.0 - I can telnet to it on port 5666, but it than bails out when using 3.0 but does not bail out when using 2.4
It would not work at all if the port is NOT open.
If I downgrade that machine NRPE_CLIENT to nrpe 2.14 it happily works, but the server logs are full with
Aug 25 16:43:47 CORESERVER check_nrpe: Remote NRPE_CLIENT does not support Version 3 Packets
Aug 25 16:43:49 CORESERVER check_nrpe: Remote NRPE_CLIENT accepted a Version 2 Packet
and the logs of that NRPE_CLIENT report
Aug 25 16:45:05 NRPE_CLIENT nrpe[23013]: Error: Request packet type/version was invalid!
Aug 25 16:45:05 NRPE_CLIENT nrpe[23013]: Client request was invalid, bailing out...
I get when testing it:
[root@NAGIOSCORE /var/log] #>/usr/local/nagios/libexec/check_nrpe -H NRPE_CLIENT -c check_procs
PROCS OK : count 1706 |count=706;2000;2100 runqueue=1 blocked=0 running=1 new=0.00
Now if I upgrade the NRPE_CLIENT to nrpe 3.0, the logs are filled with
Aug 25 16:56:15 NAGIOSCORE check_nrpe: Error: Could not complete SSL handshake with NRPE_CLIENT: rc=0 SSL-error=5
and the logs of that NRPE_CLIENT report
Aug 25 16:55:17 NRPE_CLIENT nrpe[23815]: Host NAGIOSCORE is not allowed to talk to us!
What I see on the firewall is a "deny tcp (no connection) from NAGIOSCORE to NRPE_CLIENT flags on interface", which clearly means that the SSL handshake flow attempting to go through it does not seem to be following the correct TCP session flow (SYN, SYN ACK etc.) - It sees the second on but not the first one.
But my issue is that this does NOT happen when using nrpe-2.4
So what's makes this SSL thingo so different in 3.0 with respect to 2.4?
Does anybody have an idea how I can fix this?
thanks
Jobst