Page 1 of 1

Nagios Timeout

Posted: Mon Oct 01, 2018 9:53 am
by dtidwell1224
I am currently seeing frequent Nagios timeouts from certain hosts, but only in the GUI.
If I run the check on the command line it runs just fine, or if I go into the check in CCM and issue the "Run Check Command" from the UI, it also works. But in the standard check, it times out every single time.

The check in question is "check_snmp_storage.pl" with a timeout of 30 seconds defined in the command.

As an example:

Code: Select all

[nagios@server01 ~]$ /usr/local/nagios/libexec/check_snmp_storage.pl -H intranet.domain.com  -C REDACTED --v2c -m ^C: -w 80 -c 90 -f -t 30
C:\ Label:  Serial Number b683054e: 27%used(78183MB/285466MB) (<80%) : OK | 'C:\_Label:__Serial_Number_b683054e'=78183MB;228373;256919;0;285466
It almost seems as though Nagios is ignoring the static timeout value defined in the check.
Any suggestions on where to look would be greatly appreciated.

Thank you,

Re: Nagios Timeout

Posted: Mon Oct 01, 2018 2:50 pm
by scottwilkerson
About how log does the check take to execute?

Can you attach your /usr/local/nagios/etc/nagios.cfg so we can look at the service_check_timeout value.

Thanks

Re: Nagios Timeout

Posted: Wed Oct 03, 2018 1:28 pm
by dtidwell1224
We have discovered the source of the timeouts and it is on the host end.

An install of a commvault backup agent on numerous hosts caused a significant delay on the remote host when answering snmp calls, regardless of the originating destination of the snmp request.

We have opened a ticket with the backup vendor regarding this issue, but can rule out any Nagios related error at this time.

Re: Nagios Timeout

Posted: Wed Oct 03, 2018 1:48 pm
by benjaminsmith
Glad to know your were able to identify the root cause of the problem. I will go ahead and lock this post.