CHECK_BY_SSH intermittant

This support forum board is for support questions relating to Nagios XI, our flagship commercial network monitoring solution.
redeming
Posts: 8
Joined: Mon Mar 04, 2013 11:31 am

CHECK_BY_SSH intermittant

Post by redeming »

I am having problems with the following message being received from check_by_ssh to remote Linux systems. (Return code of 255 is out of bounds)

This is an intermittent problem that occurs with a number of hosts but not all of them and only from time to time.

Running the full check from the command line manually works every time. The systems are not overly busy and respond well to other ssh requests.

check_by_ssh -H $HOSTADDRESS$ -l <username> -C "/nfs/imo/imofile2/user/tools/nagios/libexec/check_disk -w20% -c10% -p/var"

Username has the user on the remote system.
You do not have the required permissions to view the files attached to this post.
abrist
Red Shirt
Posts: 8334
Joined: Thu Nov 15, 2012 1:20 pm

Re: CHECK_BY_SSH intermittant

Post by abrist »

Do you experience the issue when running the command from the cli? If so, could you increase the timeout and try to reproduce the issue again?

Code: Select all

check_by_ssh -H $HOSTADDRESS$ -l <username> -t 60 -C "/nfs/imo/imofile2/user/tools/nagios/libexec/check_disk -w20% -c10% -p/var"
Former Nagios employee
"It is turtles. All. The. Way. Down. . . .and maybe an elephant or two."
VI VI VI - The editor of the Beast!
Come to the Dark Side.
redeming
Posts: 8
Joined: Mon Mar 04, 2013 11:31 am

Re: CHECK_BY_SSH intermittant

Post by redeming »

As noted previously, it works EVERY time.
User avatar
lmiltchev
Bugs find me
Posts: 13589
Joined: Mon May 23, 2011 12:15 pm

Re: CHECK_BY_SSH intermittant

Post by lmiltchev »

You mean it works every time from the CLI, but works intermittently in the GUI correct? Can you post the service definition?
Be sure to check out our Knowledgebase for helpful articles and solutions!
redeming
Posts: 8
Joined: Mon Mar 04, 2013 11:31 am

Re: CHECK_BY_SSH intermittant

Post by redeming »

Yes, every time I run the service check from the command line using check_by _ssh it works. It is only when it is run automatically that it does not run. It will run often, but not all of the time. Changing the time out had no effect.

Here is the configuration.

###############################################################################
#
# Service configuration file
#
# Created by: Nagios QL Version 3.0.3
# Date: 2014-02-27 13:16:50
# Version: Nagios 3.x config file
#
# --- DO NOT EDIT THIS FILE BY HAND ---
# Nagios QL will overwite all manual settings during the next update
#
###############################################################################

define service {
service_description Unix - Check Var Directory
use generic-service
hostgroup_name Var Monitoring
display_name Unix - Check Var Directory
servicegroups Linux - Standard
check_command check_xi_by_ssh!<username>!"/nfs/imo/imofile2/user/tools/nagios/libexec/check_disk -w20% -c10% -p/var"!!!!!!
contact_groups Var System Monitoring
register 1
}

###############################################################################
#
# Service configuration file
#
# END OF FILE
#
###############################################################################


Thanks
cunningrat
Posts: 29
Joined: Mon Nov 18, 2013 5:51 pm

Re: CHECK_BY_SSH intermittant

Post by cunningrat »

Redeming, I had a similar issue: checks would alert with the return code of "Plugin timed out while executing system call". Increasing the timeout value (the -t flag to check_by_ssh) fixed the problem.
scottwilkerson
DevOps Engineer
Posts: 19396
Joined: Tue Nov 15, 2011 3:11 pm
Location: Nagios Enterprises
Contact:

Re: CHECK_BY_SSH intermittant

Post by scottwilkerson »

@cunningrat - Thanks. I would have suggested the same thing...
Former Nagios employee
Creator:
Human Design Website
Get Your Human Design Chart
redeming
Posts: 8
Joined: Mon Mar 04, 2013 11:31 am

Re: CHECK_BY_SSH intermittant

Post by redeming »

The error I get does not seem to reflect a time out. Additionally, as mentioned in my last post, I tried increasing the timeout.
slansing
Posts: 7698
Joined: Mon Apr 23, 2012 4:28 pm
Location: Travelling through time and space...

Re: CHECK_BY_SSH intermittant

Post by slansing »

Is this happening with every plugin you are trying to execute via check_by_ssh?
redeming
Posts: 8
Joined: Mon Mar 04, 2013 11:31 am

Re: CHECK_BY_SSH intermittant

Post by redeming »

Yes.
Locked