Hi Support,
since upgrading to 5.5.1 we have some hosts that we get multiple hosts that are getting a general timeout. Some will just come and go quickly, others will stay in a "stable" state, but once we go in and force a check they go green.
Is there something else we can do other than recompile the kernel to get these to not have timeouts like that, we fear we will miss something critical because nagios isn't monitoring correctly/getting timeouts.
since upgrading to 5.5.1 snmp checks timeout
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: since upgrading to 5.5.1 snmp checks timeout
Can you share the commands for some of these SNMP checks so we can determine which plugins you are experiencing it with?
Re: since upgrading to 5.5.1 snmp checks timeout
all tend to be "$USER1$/check_snmp_storage.pl -H $HOSTADDRESS$ $ARG1$" with our supplied args.
-C $COMMUNITY --v2c -m ^C: -w 85 -c 90 -f in this instance.
-C $COMMUNITY --v2c -m ^C: -w 85 -c 90 -f in this instance.
Re: since upgrading to 5.5.1 snmp checks timeout
Sorry to jump in, but this looks to be the same issue we are having:
https://support.nagios.com/forum/viewto ... 43#p257243
https://support.nagios.com/forum/viewto ... 43#p257243
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: since upgrading to 5.5.1 snmp checks timeout
Out of curiosity, do you know what version you upgraded from?
Also, do you know if you have every needed to edit this plugin in previous versions?
Also, do you know if you have every needed to edit this plugin in previous versions?
Re: since upgrading to 5.5.1 snmp checks timeout
I have not had to edit it in the 2-3 years I've been more of a primary admin.
we upgraded from 5.4.13
we upgraded from 5.4.13
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: since upgrading to 5.5.1 snmp checks timeout
Can you execute the commands from the CLI?
Re: since upgrading to 5.5.1 snmp checks timeout
yes
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121529MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121529MB;173992;184227;0;204697
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
[root@localhost ~]# /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121529MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121529MB;173992;184227;0;204697
Re: since upgrading to 5.5.1 snmp checks timeout
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m12.412s
user 0m0.053s
sys 0m0.010s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m14.694s
user 0m0.061s
sys 0m0.004s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.060s
user 0m0.058s
sys 0m0.006s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m14.830s
user 0m0.059s
sys 0m0.006s
You have mail in /var/spool/mail/root
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.062s
user 0m0.057s
sys 0m0.010s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.059s
user 0m0.052s
sys 0m0.011s
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m12.412s
user 0m0.053s
sys 0m0.010s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m14.694s
user 0m0.061s
sys 0m0.004s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.060s
user 0m0.058s
sys 0m0.006s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
C:\ Label:OS Serial Number 84e74ea0: 59%used(121458MB/204697MB) (<85%) : OK | 'C:\_Label:OS__Serial_Number_84e74ea0'=121458MB;173992;184227;0;204697
real 0m14.830s
user 0m0.059s
sys 0m0.006s
You have mail in /var/spool/mail/root
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.062s
user 0m0.057s
sys 0m0.010s
[root@localhost ~]# time /usr/local/nagios/libexec/check_snmp_storage.pl -H <SNIP> -C <SNIP> --v2c -m ^C: -w 85 -c 90 -f
ERROR: General time-out (Alarm signal)
real 0m15.059s
user 0m0.052s
sys 0m0.011s
-
scottwilkerson
- DevOps Engineer
- Posts: 19396
- Joined: Tue Nov 15, 2011 3:11 pm
- Location: Nagios Enterprises
- Contact:
Re: since upgrading to 5.5.1 snmp checks timeout
I see many are erroring at a 15 second mark.
If you add a -t 30 to your command to they succeed more frequently?
If you add a -t 30 to your command to they succeed more frequently?