Page 1 of 2

SNMP windows issue - services timing out, but performance OK

Posted: Tue Dec 04, 2018 3:24 pm
by jacek
Hi,

I have a very weird issue with SNMP and an windows server.

The service checks are testing disk space, cpu, ram, and some services.
All performance related checks are working (disk, cpu, ram), but all service checks are returning "ERROR: Alarm signal (Nagios time-out)".

If I run an SNMP walk against the server I see all services with their names, the same if I run a wizard against it.

I raised the service check timeout to 300, but with no result.
If I run the command via ssh it errors out after less then 10 seconds, so this can't be a timeout?

I'm attaching my plugin just in case.

Re: SNMP windows issue - services timing out, but performanc

Posted: Tue Dec 04, 2018 5:16 pm
by ssax
Did you adjust the service check timeout in the nagios.cfg file or did you specify the -t 300 option in your command?

You'll want to define it on the command then you will need to modify your /usr/local/nagios/etc/nagios.cfg and update service_check_timeout to match otherwise it will timeout at 60 seconds.

Re: SNMP windows issue - services timing out, but performanc

Posted: Tue Dec 04, 2018 5:27 pm
by jacek
I set it in the core config via web.
But this doesn't change the fact, that when running the command via ssh I get the timeout after less then 10s (I'm not providing the -t parameter via the command).

Re: SNMP windows issue - services timing out, but performanc

Posted: Tue Dec 04, 2018 5:31 pm
by ssax
The default timeout for that script is 5 seconds, please try specifying the -t 300 on the command when running through SSH and see how long it takes to timeout.

Re: SNMP windows issue - services timing out, but performanc

Posted: Wed Dec 05, 2018 2:58 am
by jacek
The plugin doesn't allow to specify more than 60 seconds:

Code: Select all

ERROR: Invalid timeout value [300], range 1.0 - 60.0 seconds.
When I specified 60 seconds it really waits 60 seconds and then timeouts.
Any thoughts? Maybe I have some old version of the plugin (I attached it to the first post)?

Re: SNMP windows issue - services timing out, but performanc

Posted: Wed Dec 05, 2018 9:44 am
by scottwilkerson
Really it would be odd for the service to take longer than that anyways.

I'm starting to think that maybe your Windows server isn't allowing access to the SNMP OID

Can you run this replacing the community string and Windows server IP from your XI server?

Code: Select all

snmpwalk -v 2c -c public xxx.xxx.xxx.xxx 1.3.6.1.4.1.77.1.2.3.1

Re: SNMP windows issue - services timing out, but performanc

Posted: Wed Dec 05, 2018 12:33 pm
by jacek
I'm getting output from snmpwalk, as I already mentioned.
Snippet below:

Code: Select all

SNMPv2-SMI::enterprises.77.1.2.3.1.1.5.80.111.119.101.114 = STRING: "Power"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.6.83.101.114.118.101.114 = STRING: "Server"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.6.84.104.101.109.101.115 = STRING: "Themes"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.8.78.101.116.108.111.103.111.110 = STRING: "Netlogon"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.9.73.80.32.72.101.108.112.101.114 = STRING: "IP Helper"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.10.68.78.83.32.67.108.105.101.110.116 = STRING: "DNS Client"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.11.68.72.67.80.32.67.108.105.101.110.116 = STRING: "DHCP Client"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.11.84.105.109.101.32.66.114.111.107.101.114 = STRING: "Time Broker"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.11.87.111.114.107.115.116.97.116.105.111.110 = STRING: "Workstation"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.12.83.78.77.80.32.83.101.114.118.105.99.101 = STRING: "SNMP Service"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.12.85.115.101.114.32.77.97.110.97.103.101.114 = STRING: "User Manager"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.12.86.77.119.97.114.101.32.84.111.111.108.115 = STRING: "VMware Tools"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.12.87.105.110.100.111.119.115.32.84.105.109.101 = STRING: "Windows Time"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.13.67.111.114.101.77.101.115.115.97.103.105.110.103 = STRING: "CoreMessaging"
SNMPv2-SMI::enterprises.77.1.2.3.1.1.13.80.108.117.103.32.97.110.100.32.80.108.97.121 = STRING: "Plug and Play"

Re: SNMP windows issue - services timing out, but performanc

Posted: Wed Dec 05, 2018 12:41 pm
by scottwilkerson
Can you show us the check_snmp_win.pl command you are running from the command line (obfuscating any sensitive information)?

Re: SNMP windows issue - services timing out, but performanc

Posted: Wed Dec 05, 2018 1:00 pm
by jacek
Sure, that's what the CCM is building when I click "Run Check Command", and that's what I'm testing with:

Code: Select all

/usr/local/nagios/libexec/check_snmp_win.pl -H HOSTNAME -C COMMUNITY --v2c -r -n 'Windows Deployment Services Server'
I also tested with short service names like "Server" and with the "-t 60" parameter without success.

Re: SNMP windows issue - services timing out, but performanc

Posted: Wed Dec 05, 2018 3:19 pm
by scottwilkerson
Can you run the command line this

Code: Select all

/usr/local/nagios/libexec/check_snmp_win.pl -H HOSTNAME -C COMMUNITY --v2c -r -n -vvv