check_ifoperstatnag - No info being retrieved

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
brdr
Posts: 312
Joined: Mon Jun 02, 2014 12:49 pm

check_ifoperstatnag - No info being retrieved

Post by brdr »

Hi,

We are using Nagios XI 2014R2.3

I literally get hundreds (per day) of these messages below for port status checks. The associated port check on bandwidth does NOT have this issue. Further, this issue only comes up on the 1st attempt of max_attempts. The second attempt always works. I can run the check_ifoperstatnag from command line and never see this error....

Any idea why we are getting this error? I looked at the boards and didn't see a solution. Thanks.

[1424890841] SERVICE ALERT: a1.bdfrma01;netapp controller 1 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[1424890990] SERVICE ALERT: a2.bstnma01;san-port-channel-2 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[1424890990] SERVICE ALERT: a3.bstnma01;fc1/28 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[1424891001] SERVICE ALERT: a4.bstnma01;mgmt0 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[1424891001] SERVICE ALERT: a8.bstnma01;fc1/32 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[1424891001] SERVICE ALERT: a7.bstnma01;san-port-channel-2 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[1424891001] SERVICE ALERT: a2.bstnma01;fc1/28 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[1424891011] SERVICE ALERT: a1.bdfrma01;Member of SAN-Port-Channel to MDS-01 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: check_ifoperstatnag - No info being retrieved

Post by abrist »

It could be that the initial snmp check times out and that the info is cached on the device temporarily causing the second check to return much faster. Try running the check from the cli with "time" in order to time a cold check. If it takes longer than the timeout assumes, but returns faster on the second attempt, this is most likely the issue and an increase in timeouts should resolve it for you.
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
brdr
Posts: 312
Joined: Mon Jun 02, 2014 12:49 pm

Re: check_ifoperstatnag - No info being retrieved

Post by brdr »

I can see the timeout in XI for the snmpwalk in the check_ifoperstatnag script, as well as seeing the timeout sometimes on the subsequent snmpget. What timeout value is does this script use? Does it use max_execution_time from php.ini

I picked one service check from yesterday morning that illustrates the UNKNOWN behavior. This port status service check is set to check every 2 minutes. It appears from the nagios.log that something is happening about every hour that forces the check to timeout. Where is this cache?

[Wed Feb 25 00:23:51 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 00:24:51 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
[Wed Feb 25 01:43:31 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 01:44:21 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
[Wed Feb 25 02:43:31 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 02:44:12 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
[Wed Feb 25 03:43:21 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 03:44:12 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
[Wed Feb 25 04:33:31 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 04:34:12 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
[Wed Feb 25 05:33:12 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 05:34:01 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
[Wed Feb 25 06:13:31 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 06:14:22 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
[Wed Feb 25 07:13:21 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 07:14:12 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
[Wed Feb 25 08:13:21 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;UNKNOWN;SOFT;1;UNKNOWN - No info is being retrieved.
[Wed Feb 25 08:14:12 2015] SERVICE ALERT: x.bstnma01;fc1/30 Status;OK;SOFT;2;OK - Interface fc1/30 (index 16896000) is up.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: check_ifoperstatnag - No info being retrieved

Post by scottwilkerson »

Can you confirm which version of the Switch / Router Wizard you are running
Admin -> Manage Config Wizards
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
brdr
Posts: 312
Joined: Mon Jun 02, 2014 12:49 pm

Re: check_ifoperstatnag - No info being retrieved

Post by brdr »

Version: 2.1.5
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: check_ifoperstatnag - No info being retrieved

Post by lmiltchev »

Can you show us the actual command that you are running from the command line, along with the output of it? (Hide sensitive info)

Example:

Code: Select all

/usr/local/nagios/libexec/check_ifoperstatnag 13 -v3 -u <username> -A <auth protocol passphrase> -x DES -X <privacy protocol passphrase> -a MD5 -l authPriv <ip address>
OK - Interface Adaptive (index 13) is up.
Be sure to check out our Knowledgebase for helpful articles and solutions!
brdr
Posts: 312
Joined: Mon Jun 02, 2014 12:49 pm

Re: check_ifoperstatnag - No info being retrieved

Post by brdr »

/usr/local/nagios/libexec/check_ifoperstatnag 16887808 -v3 -u xxxx -A xxxx -a MD5 -l authNoPriv x.x.x.x
OK - Interface fc1/28 (index 16887808) is up.

Please lemme know if you need more info.
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: check_ifoperstatnag - No info being retrieved

Post by lmiltchev »

The command seems correct. As this is an intermittent issue, it is possible that your device fails to respond in a timely fashion. I would recommend modifying your check command (in the CCM) for the problem interfaces by adding a "-t" flag (timeout value). You can start with "-t 15" for example, and increase the value if needed. Let us know if this fixed your issue.
Be sure to check out our Knowledgebase for helpful articles and solutions!
brdr
Posts: 312
Joined: Mon Jun 02, 2014 12:49 pm

Re: check_ifoperstatnag - No info being retrieved

Post by brdr »

Sure. I will pick a couple offenders and set timeout and see if this fixes, and circle back with ya'. Thx.
cmerchant
Posts: 546
Joined: Wed Sep 24, 2014 11:19 am

Re: check_ifoperstatnag - No info being retrieved

Post by cmerchant »

Let us know how that works out. Thanks.
Locked