Page 1 of 2

check_dell IPMI (Return code of 255 is out of bounds)

Posted: Tue Jun 09, 2015 3:33 am
by Richard_Lian
Hi all,

I am currently using the check_dell script written by sircharlo.
https://exchange.nagios.org/directory/P ... ck/details

I am having an issue with my new Poweredge R630's and NX3230. The checks return the error (Return code of 255 is out of bounds). I have tried executing the script via CLI but get the same response. Still works fine on my R610's and R620's.
I tried contacting the owner but no joy as of yet.

If anyone has any ideas or suggestions they would be greatly appreciated.

Many thanks.

Regards,
Richard

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Tue Jun 09, 2015 10:13 am
by jdalrymple
Unfortunately, the developers website seems to be down at the moment. No doubt this will require figuring out what about the newer hardware is different from the old hardware and how it interacts with the script differently to get the proper results. My problems:

1) I can't even look at the plugin right now because the developers site is down
2) Even if I did have the plugin, I don't have your hardware so I couldn't test in realtime

My suggestions:

1) See if there is a verbose output option and use it
2) Verify using racadm or whatever the proper tool is that the device is providing output to the plugin at all

You can also send the plugin to us to take a look at, but without the hardware it is definitely going to be difficult to troubleshoot.

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Wed Jun 10, 2015 2:05 am
by Richard_Lian
Hi,

Yeah I tired accessing the devs website and contacting them first but no joy.

I have attached the check_dell script none the less in the hope that you might be able to help.

I will also try to get a verbose output if at all possible.

Thanks in advance.

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Wed Jun 10, 2015 9:59 am
by tgriep
It looks like the plugin uses ipmi-sensors to connect to the Dell systems. Can you run the following and post back the output? Replace $ip, $username, $password with the correct values.

Code: Select all

ipmi-sensors -h $ip -u $username -p $password -C -Q

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Wed Jun 10, 2015 10:52 am
by Richard_Lian
When I run that I get: ipmi_ctx_open_outofband: session timeout

If I run the same command against one of my R610 servers for example I get a full list.

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Wed Jun 10, 2015 11:03 am
by tgriep
Can you verify that IPMI is enabled on the new systems?

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Wed Jun 10, 2015 11:12 am
by Richard_Lian
Yes - as far as I can tell.

They are using iDRAC enterprise 8 and I have enabled IPMI the same as my older servers running version 6/7.

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Wed Jun 10, 2015 11:40 am
by tgriep
You may want to run the ipmi-sensors command with --verbose and --debug enabled to get a better idea of what is not working but it could be a compatibility issue with the newer systems that you will have to workout.
Check this site that developed the tool.
http://www.gnu.org/software/freeipmi/

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Wed Jun 10, 2015 11:59 am
by Richard_Lian
The text file attached contains the output using --verbose and --debug together.

Re: check_dell IPMI (Return code of 255 is out of bounds)

Posted: Wed Jun 10, 2015 1:22 pm
by tgriep
Here is what the man page says for the ipmi-sensors for session timeout says.
"session timeout" - The IPMI session has timed out. Please reconnect. If this error occurs often, you may wish to increase the retransmission timeout.
Some remote BMCs are considerably slower than others.
You may want to add these 2 arguments to the command to see if they help out.
--session-timeout=MILLISECONDS
Specify the session timeout in milliseconds. Defaults to 20000 milliseconds (20 seconds) if not specified.

--retransmission-timeout=MILLISECONDS
Specify the packet retransmission timeout in milliseconds. Defaults to 1000 milliseconds (1 second) if not specified. The retransmission timeout cannot be larger than
the session timeout.
If you resolve the timeout problem, you can change the check command so it will function for you.

We don't have a system to test with so there isn't much else we can do.