Re: check_ifoperstatnag - No info being retrieved
Posted: Fri Feb 27, 2015 10:42 am
After further analysis the issue on SNMP timeouts appear to be related to Cisco NX-OS (Nexus 5548 as example) devices at this time.
Issue:
The port status service check, within Nagios, checks a port every 2 minutes for configured device. Most of the time the service check successfully completes. On occasion, however, the first service check after the last 2 minute expires the ‘snmpwalk’ command fails due to a time-out. Nagios waits its retry period (1 min in this case) and tries again and this succeeds.
We get approx. 1500 timeouts in a given day. I’ve checked the CPU and memory on the devices at the time timeouts occur and I don’t see load as an issue. Nagios is monitoring ports for about 200 devices. These errors are coming from 12 devices.
I'm going to pass this up to our Network team.
Issue:
The port status service check, within Nagios, checks a port every 2 minutes for configured device. Most of the time the service check successfully completes. On occasion, however, the first service check after the last 2 minute expires the ‘snmpwalk’ command fails due to a time-out. Nagios waits its retry period (1 min in this case) and tries again and this succeeds.
We get approx. 1500 timeouts in a given day. I’ve checked the CPU and memory on the devices at the time timeouts occur and I don’t see load as an issue. Nagios is monitoring ports for about 200 devices. These errors are coming from 12 devices.
I'm going to pass this up to our Network team.