Page 2 of 2

Re: check_ifoperstatnag - No info being retrieved

Posted: Fri Feb 27, 2015 10:42 am
by brdr
After further analysis the issue on SNMP timeouts appear to be related to Cisco NX-OS (Nexus 5548 as example) devices at this time.

Issue:
The port status service check, within Nagios, checks a port every 2 minutes for configured device. Most of the time the service check successfully completes. On occasion, however, the first service check after the last 2 minute expires the ‘snmpwalk’ command fails due to a time-out. Nagios waits its retry period (1 min in this case) and tries again and this succeeds.

We get approx. 1500 timeouts in a given day. I’ve checked the CPU and memory on the devices at the time timeouts occur and I don’t see load as an issue. Nagios is monitoring ports for about 200 devices. These errors are coming from 12 devices.

I'm going to pass this up to our Network team.

Re: check_ifoperstatnag - No info being retrieved

Posted: Fri Feb 27, 2015 11:10 am
by lmiltchev
I am glad we got to the bottom of this.
I'm going to pass this up to our Network team.
Let us know if it is safe to lock this topic.

Re: check_ifoperstatnag - No info being retrieved

Posted: Fri Feb 27, 2015 11:22 am
by brdr
safe to lock. Thanks for ur help!

Re: check_ifoperstatnag - No info being retrieved

Posted: Fri Feb 27, 2015 11:24 am
by cmerchant
We'll go ahead and close the thread. Thanks.