Another NRPE timeout puzzle
Posted: Wed May 10, 2017 12:40 pm
The Nagios installation in question is 4.2.4 core.
The OS it's on is CentOS 6.7, it's nrpe version is nrpe-2.15-7.el6.x86_64
The client server is CentOS 6.6, it's nrpe version is nrpe-2.15-7.el6.x86_64
I've surfed the solutions I could find but I'm still running into a problem with timeouts on an iperf plugin.
SET UP
NAGIOS Server
From /usr/local/nagios/etc/objects/commands.cfg
From /usr/local/nagios/etc/objects/hosts/resource.cfg
NRPE Client
From the nrpe client (and the server that runs the iperf query)'
TESTING:
NAGIOS Server
The command nominally take about 50 seconds to set up and run.
FAIL: The command times out in the GUI.
FAIL: The command times out when run from the CLI.
SUCCESS: The command works from the CLI when the timout is manually applied;
From the NRPE Client
One thread suggested checking out the command from the config, but I'm not seeing anything "telling" with this, other than the time out seems to have been ignored.

So I see the timeout is set to 60 in the nagios and nrpe configs, yet NRPE is complaining after 10 seconds. I have it explicitly set in the command definition as well.
I'm hoping that I've spaced out and forgotten something or have a heinous typo somewhere. Any help is appreciated.
.
The OS it's on is CentOS 6.7, it's nrpe version is nrpe-2.15-7.el6.x86_64
The client server is CentOS 6.6, it's nrpe version is nrpe-2.15-7.el6.x86_64
I've surfed the solutions I could find but I'm still running into a problem with timeouts on an iperf plugin.
SET UP
NAGIOS Server
Code: Select all
cat /usr/local/nagios/etc/nagios.cfg | grep service_check_timeout=
service_check_timeout=60Code: Select all
cat /etc/nagios/nrpe.cfg | grep command_timeout=
command_timeout=60Code: Select all
define command {
command_name check_nrpe_iperf
command_line $USER1$/check_nrpe_iperf -H $HOSTADDRESS$ -c $ARG1$ -t 60
}Code: Select all
define service{
use generic-service,nagiosgraph
host_name resource
service_description Network stats
check_command check_nrpe!check_nrpe_iperfFrom the nrpe client (and the server that runs the iperf query)'
Code: Select all
/etc/nagios/nrpe.cfg
command[check_nrpe_iperf]=/usr/local/bin/check_iperf3.pl 10.6.117.4 50 60NAGIOS Server
The command nominally take about 50 seconds to set up and run.
FAIL: The command times out in the GUI.
FAIL: The command times out when run from the CLI.
Code: Select all
/usr/local/nagios/libexec/check_nrpe -H resource -c check_nrpe_iperf
CHECK_NRPE: Socket timeout after 10 seconds.Code: Select all
/usr/local/nagios/libexec/check_nrpe -H resource -c check_nrpe_iperf -t 90
Critical: iperf speed of '10.6.117.4' is 9,01 and [mincrit:50]|Bandwidth=9,01MB
<I'm snipping the rest of the data>Code: Select all
SUCCESS: [root@resource ~]# check_iperf3.pl 10.6.117.4 1 1
OK: iperf returns 2,29MB |Bandwidth=2,29MB
<I'm snipping this too>
So I see the timeout is set to 60 in the nagios and nrpe configs, yet NRPE is complaining after 10 seconds. I have it explicitly set in the command definition as well.
I'm hoping that I've spaced out and forgotten something or have a heinous typo somewhere. Any help is appreciated.
.