Page 1 of 1
Monitoring Openmanage Issue
Posted: Thu Aug 03, 2017 8:10 am
by CJIndustries
I'm able to get NagiosXI to monitor a PowerEdge 2900, Windows Server 2003, OpenManage 5.2. When I go through the same process with a PowerVault DL2100, Windows Server 2008, OpenManage 7.4, I get an Unknown with "ERROR: Alarm signal (Nagios time-out)". SNMP is setup identical on both. Firewalls are turned off.
What would be the next step in troubleshooting this problem?
Thanks in advance,
Mike
Re: Monitoring Openmanage Issue
Posted: Thu Aug 03, 2017 8:42 am
by mcapra
Which plugin are you using for this? I assume you're using
check_openmanage? If so, which version of check_openmanage are you using? The plugin hasn't been updated in a while, and the last OMSA version mention I see in the changelog is 7.2.0.
Can you share the specific check you are using, and it's output, executed from the CLI of your Nagios XI machine? With any verbose output (
--debug option if using the above plugin), if available.
My initial hunch is that the SNMP request is timing out for one reason or another.
Re: Monitoring Openmanage Issue
Posted: Thu Aug 03, 2017 9:22 am
by CJIndustries
I attached the version I'm using and according to the comments, the PowerVault DL2100 is supported. I did increase the timeout, but I'm still getting the same results.
Re: Monitoring Openmanage Issue
Posted: Thu Aug 03, 2017 10:52 am
by cdienger
The only difference I can see in the script between the two models is that one additional OID is pulled for the DL2100. I'd be curious to see if you run the command against the DL2100 but run the command with the option "-T pe_2900".
Re: Monitoring Openmanage Issue
Posted: Thu Aug 03, 2017 11:47 am
by CJIndustries
I get the same result.
[root@CJNAGIOS libexec]# ./check_dell_openmanage.pl -H host -C password -T pe_2900
ERROR: Alarm signal (Nagios time-out)
Re: Monitoring Openmanage Issue
Posted: Thu Aug 03, 2017 5:07 pm
by dwhitfield
What happens if you add a -2 at the end of your command?
Also, can you run an snmpwalk on that host?
Re: Monitoring Openmanage Issue
Posted: Fri Aug 04, 2017 6:03 am
by CJIndustries
It worked with the -2 at the end once and then it would error out after that.
[root@CJNAGIOS libexec]# ./check_dell_openmanage.pl -H HOST -C PASSWORD -T pv_dl2100 -2
OK:
[root@CJNAGIOS libexec]# ./check_dell_openmanage.pl -H HOST -C PASSWORD -T pv_dl2100 -2
ERROR: Alarm signal (Nagios time-out)
[root@CJNAGIOS libexec]# ./check_dell_openmanage.pl -H HOST -C PASSWORD -T pv_dl2100 -2
ERROR: Alarm signal (Nagios time-out)
It also looks like the SNMP walk returns nothing. Maybe that is the issue?
Update: Figured this one out. The server had multiple interfaces with different ip addresses. I guess SNMP only wanted to talk through one interface instead of all of them. Made the change in Nagios to look at the one interface and it worked. Case closed
Thanks for the ideas people.
Re: Monitoring Openmanage Issue
Posted: Fri Aug 04, 2017 10:32 am
by bolson
Closing topic as resolved.
Thank you for using the Nagios Support Forum.