Page 1 of 2
since upgrading to 5.5.1 snmp checks timeout
Posted: Thu Jul 19, 2018 8:09 am
by rzietlow
Hi Support,
since upgrading to 5.5.1 we have some hosts that we get multiple hosts that are getting a general timeout. Some will just come and go quickly, others will stay in a "stable" state, but once we go in and force a check they go green.
Is there something else we can do other than recompile the kernel to get these to not have timeouts like that, we fear we will miss something critical because nagios isn't monitoring correctly/getting timeouts.
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Thu Jul 19, 2018 8:17 am
by scottwilkerson
Can you share the commands for some of these SNMP checks so we can determine which plugins you are experiencing it with?
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Thu Jul 19, 2018 9:17 am
by rzietlow
all tend to be "$USER1$/check_snmp_storage.pl -H $HOSTADDRESS$ $ARG1$" with our supplied args.
-C $COMMUNITY --v2c -m ^C: -w 85 -c 90 -f in this instance.
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Thu Jul 19, 2018 9:34 am
by JGCG
Sorry to jump in, but this looks to be the same issue we are having:
https://support.nagios.com/forum/viewto ... 43#p257243
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Thu Jul 19, 2018 9:52 am
by scottwilkerson
Out of curiosity, do you know what version you upgraded from?
Also, do you know if you have every needed to edit this plugin in previous versions?
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Thu Jul 19, 2018 12:02 pm
by rzietlow
I have not had to edit it in the 2-3 years I've been more of a primary admin.
we upgraded from 5.4.13
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Thu Jul 19, 2018 4:06 pm
by scottwilkerson
Can you execute the commands from the CLI?
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Fri Jul 20, 2018 9:18 am
by rzietlow
yes
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121529MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121529MB;173992;184227;0;204697
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Fri Jul 20, 2018 9:22 am
by rzietlow
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m12.412s
user 0m0.053s
sys 0m0.010s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m14.694s
user 0m0.061s
sys 0m0.004s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.060s
user 0m0.058s
sys 0m0.006s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m14.830s
user 0m0.059s
sys 0m0.006s
You have mail in /var/spool/mail/root
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.062s
user 0m0.057s
sys 0m0.010s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.059s
user 0m0.052s
sys 0m0.011s
Re: since upgrading to 5.5.1 snmp checks timeout
Posted: Fri Jul 20, 2018 12:13 pm
by scottwilkerson
I see many are erroring at a 15 second mark.
If you add a -t 30 to your command to they succeed more frequently?